Overview
Satsuma implements rate limiting to ensure fair usage and maintain service quality for all users. This guide explains how rate limits work, how to handle them gracefully, and strategies to optimize your API usage.Rate Limit Structure
Request-based Limits
Rate limits are enforced per API key across different time windows:- Per Second: Short-term burst protection
- Per Minute: Medium-term usage smoothing
- Per Hour: Long-term quota management
- Per Day: Daily usage caps
Tiered Limits by Plan
Free Tier
Free Tier
- Requests: 1,000 per month
- Rate: 5 requests per second
- Burst: 10 requests per 10 seconds
- Quota Reset: Monthly on signup date
Developer - $49/month
Developer - $49/month
- Requests: 50,000 per month
- Rate: 25 requests per second
- Burst: 100 requests per 10 seconds
- Quota Reset: Monthly on billing date
Business - $199/month
Business - $199/month
- Requests: 500,000 per month
- Rate: 100 requests per second
- Burst: 500 requests per 10 seconds
- Quota Reset: Monthly on billing date
Enterprise - Custom
Enterprise - Custom
- Requests: Unlimited or custom limit
- Rate: Custom rates up to 1,000 RPS
- Burst: Negotiated burst capacity
- Quota Reset: Configurable
Rate Limit Headers
Every API response includes rate limit information in the headers:Header Definitions
X-RateLimit-Limit: Maximum requests allowed in current windowX-RateLimit-Remaining: Requests remaining in current windowX-RateLimit-Reset: Unix timestamp when the limit resetsX-RateLimit-Retry-After: Seconds to wait before retrying (429 responses only)
Handling Rate Limit Responses
429 Rate Limit Exceeded
When you exceed rate limits, the API returns a 429 status with details:Implementing Retry Logic
Optimization Strategies
Request Batching
Combine multiple operations into single requests when possible:Caching Strategies
Implement intelligent caching to reduce API calls:Connection Pooling
Reuse HTTP connections to reduce overhead:Rate Limit Monitoring
Tracking Usage
Monitor your rate limit usage to avoid surprises:Alerting on High Usage
Set up alerts when approaching rate limits:Plan Management
Upgrading Plans
When you consistently hit rate limits:1
Monitor Usage Patterns
Track your API usage over time to understand peak load requirements
2
Calculate ROI
Compare the cost of API rate limit delays vs. higher plan pricing
3
Upgrade Plan
Visit the billing section to upgrade
4
Verify Limits
Test that new limits meet your application’s needs
Custom Enterprise Limits
For high-volume applications, contact our sales team to discuss:- Custom rate limits above standard plans
- Burst capacity for traffic spikes
- Regional rate limits for global applications
- Dedicated infrastructure for consistent performance
Best Practices
Request Timing
- Spread requests evenly instead of bursts when possible
- Use jitter in retry delays to avoid thundering herd effects
- Queue non-urgent requests during high-traffic periods
- Implement circuit breakers to prevent cascade failures
Error Handling
- Always check rate limit headers before making next request
- Implement exponential backoff with a maximum delay
- Log rate limit events for monitoring and optimization
- Gracefully degrade functionality when limits are reached
Architecture Considerations
- Cache frequently accessed data to reduce API calls
- Use webhooks instead of polling for real-time updates
- Implement request queuing for batch operations
- Consider async processing for non-real-time operations
Troubleshooting
Common Rate Limit Issues
Unexpected 429 errors on low usage
Unexpected 429 errors on low usage
- Check if you have multiple processes/servers using the same API key
- Verify you’re not making concurrent requests that exceed per-second limits
- Review your request batching and caching implementation
- Check if your plan has sufficient rate limits for your usage pattern
Rate limits reset unexpectedly
Rate limits reset unexpectedly
- Different rate limit windows (per-second, per-minute, per-hour) have different reset times
- Monthly quotas reset on your billing date, not calendar month
- Upgrading plans may cause immediate limit increases
- Contact support if you see inconsistent reset behavior
High latency during rate limiting
High latency during rate limiting
- Implement proper backoff strategies instead of aggressive retries
- Use connection pooling to reduce connection establishment overhead
- Consider upgrading to a higher plan for better rate limits
- Monitor for server-side rate limiting vs. client-side queuing delays
Rate Limit Debugging
Use these tools to debug rate limit issues:Getting Help
For rate limit assistance:- Review your usage in the dashboard analytics
- Check our status page for any rate limiting incidents
- Contact support@satsuma.ai with your API key prefix and usage details
- For plan upgrades or custom limits, reach out to sales@satsuma.ai