Skip to main content
For custom rate limits, contact Sales.
To ensure optimal service performance and fairness in resource allocation, our endpoints enforce the following rate limits.
  1. RPM (requests per minute): 400
  2. TPM (token per min): 25,000

Best practices

Implement mechanisms in your applications to track and regulate the frequency of your requests to stay within the prescribed limits.
In cases where you exceed these limits, employ adaptive retry strategies with exponential backoff to handle retries efficiently and reduce the likelihood of consecutive limit breaches.
Prepare to handle HTTP 429 (too many requests) responses by pausing or slowing down request rates.
I