Tech & Dev 75% CONFIDENCE Dev.to Top 14 czerwca 2026 22:51

The Rate‑Limiter Jedi: Mastering CAP Like a Lightsaber Duel

AUTHOR · Timevolt

The Quest Begins (The "Why") Picture this: I’m knee‑deep in a microservice that powers a flash‑sale site. Black Friday is looming, traffic spikes like a dragon breathing fire, and every request that slips through threatens to melt our downstream inventory service. I slapped a simple counter in Redis, called it a day, and went to grab coffee. When I came back, the site was down—not because the inventory service crashed, but because my “rate limiter” was too strict. Under a brief network hiccup, the Redis node became unreachable, the limiter started denying everything , and users saw a sea of 429s. It felt like watching The Matrix when Neo first sees the code—except the code was a wall of red error messages, and I was the one who had just pulled the plug. I needed a limiter that could survive a partition, stay responsive, and still keep the bad guys at bay. That’s when I remembered the old CAP theorem from a university lecture I’d half‑listened to while scrolling memes. Turns out, the theorem isn’t just theory—it’s the very lightsaber we need to duel with distributed systems. The Revelation (The Insight) Here’s the treasure I uncovered: CAP isn’t a choice you make once and forget; it’s a lens you keep adjusting as you design each piece of your system. C onsistency – every node sees the same counter value at the same time. A vailability – every request gets a response (even if it’s stale). P artition tolerance – the system keeps working when the network splits. You can only have two of the three at any given moment. For a rate limiter, the question becomes: Which two do we actually need? If we chase strong consistency (C) and partition tolerance (P), we sacrifice availability (A). Think of a traditional Redis cluster with strict quorum writes: when a node is unreachable, the limiter blocks until the quorum reforms. That’s great for correctness, but during a brief network blip you end up denying legitimate traffic—exactly what happened to me. If we instead favor availability and partition tolerance (A + P), we accept that our counters may diverge for a short while. This is the eventual‑consistency path. The limiter might let a few extra requests slip through during a partition, but the service stays up and users keep getting responses. After the partition heals, the counters converge. For most public‑facing APIs—think Twitter’s rate limits, Cloudflare’s WAF, or even your own internal service‑to‑service calls— a few extra hits are far cheaper than a total outage . So we deliberately design an AP rate limiter. The magic insight? Design your limiter around a local token bucket that each node updates independently, then gossip the bucket state periodically. The bucket itself gives you the throttling logic; the gossip layer handles the AP trade‑off. Wielding the Power (Code & Examples) The Struggle: A Naïve Centralized Limiter // centralLimiter.go – the version that broke on Black Friday package main import ( "github.com/go-redis/redis/v8" "time" ) var rdb = redis . NewClient ( & redis . Options { Addr : "redis:6379" }) func allow ( userID string ) bool { key := fmt . Sprintf ( "rl:%s" , userID ) // INCR and EXPIRE are a single atomic script (Lua) script := redis . NewScript ( ` local val = redis.call('INCR', KEYS[1]) if val == 1 then redis.call('EXPIRE', KEYS[1], ARGV[1]) end return val ` ) count , _ := script . Run ( ctx , rdb , [] string { key }, strconv . Itoa ( limit )) . Int () return count <= limit } What went wrong? The INCR blocks until Redis replies. If the Redis node is unreachable (network partition), the call times out, we treat it as a denial, and the whole service throttles itself. It’s a classic CP choice that murdered availability when we needed it most. The Victory: An AP Token‑Bucket Limiter with Gossip First, the local bucket on each node: // bucket.go – each service instance holds its own bucket type Bucket struct { Capacity int64 Tokens int64 LastRefill time . Time Rate time . Duration // refill interva

CZYTAJ ŹRÓDŁOWY ARTYKUŁ → WIĘCEJ Z TECH & DEV