SambaNova Cloud enforces rate limits on inference requests per model to ensure that developers are able to try the fastest inference.

Developer tier

Rate limits for the developer tier are described below.

Preview models

Preview models in SambaNova Cloud are offered as early access models primarily for trial purposes. During the Preview phase, these models are available exclusively in the playground.

For API access and higher rate limits for DeepSeek-R1, please complete this form to join the waitlist. Priority will be given to developers who are actively using SambaNova Cloud and have signed up with a payment method.

For those who have been granted access for DeepSeek-R1 from the waitlist, please refer to the Production models section under the Payment and credits tab for the allowed rate limits.

DeveloperModel IDRequests per minute (RPM)Requests per hour (RPH)Requests per day (RPD)
DeepSeek
DeepSeek-R12510

Production models

Production models meet our high standards for speed and quality and are intended for use in production environments.

Payment and credits limits are applied when a payment method is linked with the account. Credits only limits are applied when there is no additional payment method linked with the account. See more on the Billing page.

DeveloperModel IDRequests per minute (RPM)Requests per hour (RPH)Requests per day (RPD)
DeepSeek
DeepSeek-R1101002000
DeepSeek-R1-Distill-Llama-70B303003600
Ai2
Llama-3.1-Tulu-3-405B151501800
Meta
Meta-Llama-3.3-70B-Instruct303003600
Meta-Llama-3.2-3B-Instruct606007200
Meta-Llama-3.2-1B-Instruct606007200
Meta-Llama-3.1-405B-Instruct151501800
Meta-Llama-3.1-70B-Instruct303003600
Meta-Llama-3.1-8B-Instruct606007200
Meta-Llama-Guard-3-8B303003600
Llama-3.2-90B-Vision-Instruct550600
Llama-3.2-11B-Vision-Instruct151501800
Qwen
Qwen2.5-72B-Instruct202002400
Qwen2.5-Coder-32B-Instruct202002400
QwQ-32B101001200
Qwen2-Audio-7B-Instruct550600
Tokyotech-llm
Llama-3.1-Swallow-8B-Instruct-v0.3303003600
Llama-3.1-Swallow-70B-Instruct-v0.3202002400

Other tiers

For rate limits in the Managed Subscription and Dedicated tiers, reach out to sales or contact us on our Community page so we can accommodate your projects needs.