Skip to main content
SambaCloud currently supports the following models for all developer accounts.

Model Lanes

  • Fast Lane: These models are a showcase of our unique performance advantages and speeds.
  • High Volume Lane: These models run with continuous batching — able to sustain high-volume workloads with large batch sizes.

Production models

Production models are intended for use in production environments and meet our high standards for speed and quality.
DeveloperModel laneModel IDContext lengthView on Hugging FaceModel evaluation report
MiniMax
Fast LaneMiniMax-M2.5160k tokensModel card
DeepSeek
Fast LaneDeepSeek-V3.1128k tokensModel card
Meta
Fast LaneMeta-Llama-3.3-70B-Instruct128k tokensModel cardLatticeFlow AI report
OpenAI
Fast Lanegpt-oss-120b128k tokensModel card

Preview models

Preview models are intended for evaluation purposes and developer experimentation only, and should not be used in production environments. These models have limited capacity and may be removed at short notice.
DeveloperModel laneModel IDContext lengthMax file size1View on Hugging FaceModel evaluation report
DeepSeek
High Volume LaneDeepSeek-V3.28k tokensModel card
Meta
Fast LaneLlama-4-Maverick-17B-128E-Instruct128k tokensUp to 5 images, each ≤ 20 MBModel cardLatticeFlow AI report
1 Applies to multimodal (image/audio) inputs where supported.