Rate Limit Tiers
Different service tiers offer varying rate limits to accommodate different usage patterns and requirements. Below are the rate limits for each tier:Tier | Requests per Minute | Input Tokens per Minute | Output Tokens per Minute | Credits to Purchase | Max Monthly Spend |
---|---|---|---|---|---|
Default | 20 | 30,000 | 8,000 | $5 | $100 |
Tier 2 | 200 | 300,000 | 80,000 | $40 | $500 |
Tier 3 | 400 | 600,000 | 160,000 | $200 | $1,000 |
Default Tier
The default tier provides basic rate limits suitable for most development and testing scenarios. This tier is unlocked with$5
credits and has a maximum monthly spend of $100
.
Tier 2
Tier 2 offers 10x the default limits, ideal for production applications with moderate usage. This tier is unlocked with$40
credits and has a maximum monthly spend of $500
.
Tier 3
Tier 3 provides the highest limits at 20x the default, designed for high-volume production workloads. This tier is unlocked with$200
credits and has a maximum monthly spend of $1,000
.
Understanding Rate Limits
- Requests per Minute: The maximum number of API calls allowed per minute
- Input Tokens per Minute: The maximum number of tokens that can be sent as input per minute
- Output Tokens per Minute: The maximum number of tokens that can be generated as output per minute