Skip to main content
SambaCloud currently supports the following models for all developer and enterprise accounts:

Production models

Production models are intended for use in production environments and meet our high standards for speed and quality.
DeveloperModel IDContext lengthView on Hugging FaceModel evaluation report
DeepSeek
DeepSeek-R1-052832k tokensModel cardLatticeFlow AI report
DeepSeek-V3-032432k tokensModel cardLatticeFlow AI report
DeepSeek-V3.132k tokensModel card
DeepSeek-R1-Distill-Llama-70B128k tokensModel card
Meta
Meta-Llama-3.3-70B-Instruct128k tokensModel cardLatticeFlow AI report
Meta-Llama-3.1-8B-Instruct16k tokensModel cardLatticeFlow AI report

Preview models

Preview models are intended for evaluation purposes and developer experimentation only, and should not be used in production environments. These models have limited capacity and may be removed at short notice.
DeveloperModel IDContext lengthMax file size1View on Hugging FaceModel evaluation report
Meta
Llama-4-Maverick-17B-128E-Instruct128k tokensUp to 5 images, each ≤ 20 MBModel cardLatticeFlow AI report
OpenAI
gpt-oss-120b128k tokensModel card
Whisper-Large-v3N/A25MBModel card
Qwen
Qwen3-32B8k tokensN/AModel cardLatticeFlow AI report
Tokyotech-llm
Llama-3.3-Swallow-70B-Instruct-v0.416k tokensN/AModel card
Other
E5-Mistral-7B-Instruct4k tokensN/AModel card
1Audio models only.

I