SambaNova Cloud API provides access to DeepSeek, Llama, and Qwen family of models listed below, all at full precision. SambaNova provides access to all models for all tiers.

Model details

A structured comparison of various models across the Qwen and Llama families is provided below. It describes key specifications such as model name, ID, developer, context length, and provides links to their respective model cards.

Preview models

Preview models in SambaNova Cloud are available as early-access offerings intended primarily for evaluation purposes. During the preview phase, these models have limited capacity but are fully functional in terms of accuracy and performance.

DeveloperModel IDContext lengthMax file size1View on Hugging Face
DeepSeek
DeepSeek-V3-032432k tokensN/AModel card
OpenAI
Whisper-Large-v3N/A25MBModel card
Meta
Llama-4-Scout-17B-16E-Instruct8k tokensN/AModel card
Llama-4-Maverick-17B-128E-Instruct128k tokensN/AModel card
Qwen
Qwen2-Audio-7B-InstructN/A25MBModel card
Qwen3-32B8kN/AModel card

1Audio models only.

Production models

Production models meet our high standards for speed and quality and are intended for use in production environments.

DeveloperModel IDContext lengthView on Hugging Face
DeepSeek
DeepSeek-R132k tokensModel card
DeepSeek-R1-Distill-Llama-70B128k tokensModel card
Meta
Meta-Llama-3.3-70B-Instruct128k tokensModel card
Meta-Llama-3.2-3B-Instruct4k tokensModel card
Meta-Llama-3.2-1B-Instruct16k tokensModel card
Meta-Llama-3.1-405B-Instruct16k tokensModel card
Meta-Llama-3.1-8B-Instruct16k tokensModel card
Meta-Llama-Guard-3-8B8k tokensModel card
Qwen
QwQ-32B16k tokensModel card
Tokyotech-llm
Llama-3.3-Swallow-70B-Instruct-v0.416k tokensModel card
Other
E5-Mistral-7B-Instruct4k tokensModel card