-
Bundle template: A bundle template defines what can be deployed together on a single node. It contains one or more model templates, each listing supported configurations such as sequence length and batch size.
- Example - Model templates:
DeepSeek-R1-0528-Template
— supports sequence lengths 8k and 4k, with batch sizes 1, 2, 4, and 8 for DeepSeek-R1-0528.DeepSeek-V3-0324-Template
— supports sequence lengths 8k and 4k, with batch sizes 1, 2, 4, and 8 for DeepSeek-V3-0324.
- Example - Bundle template:
deepseek-r1-v3-fp8-32k-Template
— the bundle template includes both DeepSeek-R1-0528-Template and DeepSeek-V3-0324-Template.
- Example - Model templates:
-
Bundle: A bundle is created by binding actual checkpoints to the model templates defined in a bundle template.
- Example - Bundle:
deepseek-r1-v3-fp8-32k
— a Bundle that binds the R1-0528 checkpoint and the V3-0324 checkpoint to their respective Model Templates.
- Checkpoints: Saved states of a machine learning model at a particular point during training.
- PEFs: PEFs are compiler runtime files optimized for inference with specific sequence lengths, batch sizes, and model configurations.
Version management and private assets
Before reviewing specific update procedures, it is important to understand how SambaStack organizes and manages artifact versions. This ensures your deployments remain stable and compatible as new models and software revisions are introduced.Model and software updates
This section explains how SambaStack handles updates for models and software, helping keep your deployments current and compatible with minimal disruption.- Model updates are delivered by updating folders in GCS. Existing deployments remain unchanged until updated by customers.
- SambaStack software releases include all relevant bundle and model updates. Details of each release are provided in the SambaStack release notes.
- The update procedure involves:
- Reinstalling the SambaStack tarball from the Google Artifact Registry (GAR).
-
Using the included
values.yaml
to point to the correct Docker artifact paths and deployment YAMLs (details provided with each release). -
Applying changes using:
- New model and artifact versions are designed to maintain compatibility and avoid breaking existing deployments or APIs. Any exceptions will be communicated in release notes.
Private artifacts
Private artifacts are custom models and checkpoints managed separately to protect sensitive or proprietary data. This section explains how access to these assets is protected and controlled.- Custom models and checkpoints are managed in separate storage buckets.
- Access to private artifacts is restricted based on defined rules to ensure security and compliance.
- Access to private artifacts is restricted through segregation and access control policies to ensure their security and compliance.