DevOps
fromMedium
12 hours agoFair Multitenancy-Beyond Simple Rate Limiting
Fair multitenancy ensures equitable infrastructure access for customers, balancing simplicity, performance, and safety in shared environments.
In recent blog posts, both Uber ( Uber's Rate Limiting System), and OpenAI ( Beyond rate limits: scaling access to Codex and Sora) discuss shifts in their approach to rate limiting: moving from counter-based, per-service limits to adaptive, policy-based systems. Both companies developed proprietary rate-limiting platforms implemented at the infrastructure layer. These systems feature soft controls that manage traffic by asserting pressure on clients rather than utilizing hard stops - either through probabilistic shedding or credit-based waterfalls - ensuring system resilience without sacrificing user momentum.
GitHub engineers recently traced user reports of unexpected "Too Many Requests" errors to abuse-mitigation rules that had accidentally remained active long after the incidents that prompted them. According to GitHub, the affected users were not generating high-volume traffic; they were "making a handful of normal requests" that still tripped protections. The investigation found that older incident rules were based on traffic patterns that were strongly associated with abuse at the time, but later began matching some legitimate, logged-out requests.
Beskar is a Rails engine that fills that gap with layered protection: WAF that detects vulnerability scanning patterns (WordPress, config files, path traversal) Impossible travel detection using geolocation and Haversine calculations Smart rate limiting that identifies attack patterns (brute force, credential stuffing, distributed attacks) Risk-based account locking with automatic responses Persistent IP banning with escalating durations Installation is deliberately simple - drop it in your Gemfile, run the installer, add one line to your User model. Runs in monitor-only mode by default so you can tune thresholds before blocking real traffic.