SambaNova Cloud API provides access to DeepSeek models, the Llama 3.3, 3.2, and 3.1 family of models, as well as the Qwen 2.5 family, all at full precision. SambaNova provides access to all models for all tiers.

Model details

A structured comparison of various models across the Qwen and Llama families is provided below. It describes key specifications such as model name, ID, developer, context length, and provides links to their respective model cards.

Preview models

Preview models in SambaNova Cloud are available as early-access offerings intended primarily for evaluation purposes. During the preview phase, these models have limited capacity but are fully functional in terms of accuracy and performance.

DeveloperModel IDContext lengthView on Hugging Face
DeepSeek
DeepSeek-V3-03248k tokensModel card
Meta
Llama-4-Scout-17B-16E-Instruct8k tokensModel card
Llama-4-Maverick-17B-128E-Instruct8k tokensModel card
Qwen
Qwen2-Audio-7B-InstructN/AModel card

Production models

Production models meet our high standards for speed and quality and are intended for use in production environments.

DeveloperModel IDContext lengthView on Hugging Face
DeepSeek
DeepSeek-R116k tokensModel card
DeepSeek-R1-Distill-Llama-70B128k tokensModel card
Meta
Meta-Llama-3.3-70B-Instruct128k tokensModel card
Meta-Llama-3.2-3B-Instruct8k tokensModel card
Meta-Llama-3.2-1B-Instruct16k tokensModel card
Meta-Llama-3.1-405B-Instruct16k tokensModel card
Meta-Llama-3.1-8B-Instruct16k tokensModel card
Meta-Llama-Guard-3-8B8k tokensModel card
Qwen
QwQ-32B16k tokensModel card
Tokyotech-llm
Llama-3.1-Swallow-8B-Instruct-v0.316k tokensModel card
Other
E5-Mistral-7B-Instruct4k tokensModel card