Skip to content

Latest commit

 

History

History
149 lines (122 loc) · 8.55 KB

jcache-usage.md

File metadata and controls

149 lines (122 loc) · 8.55 KB

JCache integration

Bucket4j supports any GRID solution which compatible with JCache API (JSR 107) specification.

Do not forget to read JCache production checklist before using the Bucket4j over JCache cluster.

To use JCache extension you also need to add following dependency:

<dependency>
    <groupId>com.github.vladimir-bukhtoyarov</groupId>
    <artifactId>bucket4j-jcache</artifactId>
    <version>${bucket4j.version}</version>
</dependency>

Example 1 - limiting access to HTTP server by IP address

Imagine that you develop any Servlet based WEB application and want to limit access per IP basis. The limitation depends on country to which IP belongs, you want to allow less tokens to democratic countries and more tokens for normal countries, and a very small amount of tokens for Russia in order to protect your service from russian hackers. ServletFilter would be obvious place to check limits:

public class IpThrottlingFilter implements javax.servlet.Filter {

    // service to find alpha-3 country code by IP
    @Inject
    private GeoLocationService geoLocationService;
    
    // service to provide per country limits
    @Inject
    private LimitProvider limitProvider;
    
    // cache for storing token buckets, where IP is key.
    @Inject
    private javax.cache.Cache<String, GridBucketState> cache;
    
    private ProxyManager<String> buckets;
    
    @Override
    public void init(FilterConfig filterConfig) throws ServletException {
         // init bucket registry
         buckets = Bucket4j.extension(JCache.class).proxyManagerForCache(cache);
    }
    
    @Override
    public void doFilter(ServletRequest servletRequest, ServletResponse servletResponse, FilterChain filterChain) throws IOException, ServletException {
        HttpServletRequest httpRequest = (HttpServletRequest) servletRequest;
        String ip = IpHelper.getIpFromRequest(httpRequest);
        
        // prepare configuration supplier which will be called(on first interaction with proxy) if bucket was not saved yet previously. 
        Supplier<BucketConfiguration> configurationLazySupplier = getConfigSupplierForIp(ip);
        
        // acquire cheap proxy to bucket, the real  
        Bucket bucket = buckets.getProxy(ip, configurationLazySupplier);

        // tryConsume returns false immediately if no tokens available with the bucket
        if (bucket.tryConsume(1)) {
            // the limit is not exceeded
            filterChain.doFilter(servletRequest, servletResponse);
        } else {
            // limit is exceeded
            HttpServletResponse httpResponse = (HttpServletResponse) servletResponse;
            httpResponse.setContentType("text/plain");
            httpResponse.setStatus(429);
            httpResponse.getWriter().append("Too many requests");
        }
    }
    
    private Supplier<BucketConfiguration> getConfigSupplierForIp(String ip) {
         return () -> {
             String countryAlpha3 = geoLocationService.getCountryCodeByIp(ip);
             long tokensPerMinute = limitProvider.getPerMinuteRate(countryAlpha3);
             return Bucket4j.configurationBuilder()
                         .addLimit(Bandwidth.simple(tokensPerMinute, Duration.ofMinutes(1)))
                         .buildConfiguration()
         };
    }

}

Question: is the provided JCache integration safe across multiple JVMs? Does it ensure that two nodes creating a bucket simultaneously on a given Cache<K, V> will only actually create one single bucket (without resetting a previously created one with the same key)?
Answer: Yes. JCache integration is safe for multi node environment, Bucket4j never replaces bucket which already exists. This behavior is guaranteed by putIfAbsent method contract of javax.cache.Cache class.

Question: Does ProxyManager store buckets internally, could be this a reason of OutOfMemoryError?
Answer: No. ProxyManager stores nothing about buckets which it returns, the buckets actually stored in in-memory GRID outside client JVM. Think about proxy returned by ProxyManager#getBucket just about very cheap pointer to data which actually stored somewhere outside. So, independently of count of buckets ProxyManager will never be a reason of crash or extreme memory consumption.

Question: what will happen if bucket state will be lost in the GRID because of split-brain, human mistake or pragmatically errors introduced by GRID vendor?
Answer: ProxyManager automatically detect this kind of situations and save bucket yet another time, to reconstruct bucket it uses provided configuration supplier. Reconstructed bucket remembers nothing about previously consumed tokens, so limit can be exceeded in this kind of GRID failures.

Question: should I always work with JCache through ProxyManager?
Answer: It depends. When you have deal with potentially huge and unpredictable amount of buckets, it is always better to use ProxyManager. ProxyManager protects you from common performance pitfalls(like described in this issue). But when you have deal with one or few buckets which well known at development time, then it would be better to avoid ProxyManager and work directly with GridBucket as described in the next example.

Example 2 - working with JCache without ProxyManager abstraction

Imagine yet another time that you develop WEB application and want to protect the whole cluster by 1000 requests per second, independently from request source, in other words you need one single bucket which protects the system at whole. Lets create ServletFilter to check limits similar to Example 1:

public class GlobalThrottlingFilter implements javax.servlet.Filter {

    private static final String BUCKET_ID = "global-limit";
    
    @Inject
    private javax.cache.Cache<String, GridBucketState> cache;
    
    private Bucket bucket;
    
    @Override
    public void init(FilterConfig filterConfig) throws ServletException {
         // create bucket
         bucket = Bucket4j.extension(JCache.class).builder()
             .addLimit(Bandwidth.simple(1000, Duration.ofSeconds(1)))
             .build(cache, BUCKET_ID, RecoveryStrategy.RECONSTRUCT);
    }
    
    @Override
    public void doFilter(ServletRequest servletRequest, ServletResponse servletResponse, FilterChain filterChain) throws IOException, ServletException {
        // tryConsume returns false immediately if no tokens available with the bucket
        if (bucket.tryConsume(1)) {
            // the limit is not exceeded
            filterChain.doFilter(servletRequest, servletResponse);
        } else {
            // limit is exceeded
            HttpServletResponse httpResponse = (HttpServletResponse) servletResponse;
            httpResponse.setContentType("text/plain");
            httpResponse.setStatus(429);
            httpResponse.getWriter().append("Too many requests");
        }
    }

}

As you can see the code is simpler when you work with Bucket directly without ProxyManager, so use this way always when all buckets are known at development time.

Runnable examples of JCache integration

Bucket4j well tested with Hazelcast and Apache Ignite/GridGain, you can use integration tests from this folder as live examples.

Why JCache specification is not enough and since 3.0 were introduced the dedicated modules for Infinispan, Hazelcast and Ignite?

Asynchronous processing is very important for high-throughput applications, but JCache specification does not specify asynchronous API, because two early attempts to bring this kind functionality at spec level 307,312 were failed in absence of consensus. Sad, but true, if you need for asynchronous API, then JCache extension is useless, and you need to choose from following extensions:

Of course implementation the asynchronous support for any other JCache provider outside from the list above should be easy exercise, so feel free to return back the pull request addressed to cover your favorite JCache provider.