Create and use endpoints
With the SambaStudio platform, you can generate predictions from your models on your data by deploying it to an endpoint.
This document describes how to create, view, and edit endpoints.
Create an endpoint using the GUI
Follow the sequence below to create an endpoint using the GUI to be used for prediction.
Base models do not support inference and cannot be deployed for endpoints. It is recommended to use Base models for training and not inference. |
-
Create a new project or use an existing one.
-
From a project window, click New endpoint. The Add an endpoint window (Figure 1) will open.
-
In the Add an endpoint window, enter a name for the endpoint into the Endpoint name field, as shown in Figure 1.
-
The Share settings drop-down provides options for which tenant to share your endpoint, as shown in Figure 1.
-
Share with <current-tenant> allows the endpoint to be shared with the current tenant you are using, identified by its name in the drop-down.
-
Share with all tenants allows the endpoint to be shared across all tenants.
-
Endpoint will be shared with all users in <current-tenant> identifies that the endpoint will be shared with other users in the tenant you are using.
If the Endpoint will be shared with all users in <current-tenant> option is displayed, the Share with <current-tenant> and Share with all tenants options described above will not be available. Share with all tenants is an optional feature of SambaStudio. Please contact your administrator or SambaNova representative for more information.
-
-
Select the ML App from the ML App drop-down, as shown in Figure 1.
-
From the Select model drop-down, choose My models, Shared models, SambaNova models, or Select from Model Hub.
-
My models displays a list of models that you have previously added to the Model Hub.
-
Shared models displays a list of models that have been shared with the selected active tenant.
-
SambaNova models displays a list of models provided by SambaNova.
Figure 1. Add an endpoint window -
Select from Model Hub displays a window with a list of downloaded models that correspond to a selected ML App, as shown in Figure 2, or a list of all the downloaded models if an ML App is not selected. The list can be filtered by selecting options under Field of application, ML APP, Architecture, and Owner. Additionally, you can enter a term or value into the Search field to refine the model list by that input. Choose the model you wish to use and confirm your choice by clicking Use model.
Figure 2. Select from Model Hub
-
-
Enter a value for the Number of instances to designate how many instances will be deployed, as shown in Figure 3. The Number of instances setting helps the endpoint scale to accommodate a high volume number of requests. You can adjust the value based on the expected volume number of requests and available resources.
-
Click Add an endpoint, as shown in Figure 3, to queue the endpoint for deployment. Once the endpoint is created, it will take a few minutes for the endpoint to be deployed.
Figure 3. Number of instances, add an endpointYour endpoint is deployed and will be available to generate predictions when the endpoint status displays Live.
Create an endpoint using the CLI
The example below demonstrates how to create an endpoint using the snapi endpoint create
command. You will need to provide the following:
-
A project to assign the endpoint. Create a new project or use an existing one.
-
A new endpoint name.
-
The model to be used for the endpoint.
-
The number of instances to be used for the endpoint.
$ snapi endpoint create \
--project <project-name> \
--endpoint_name <endpoint-name> \
--model-checkpoint <model-name> \
--instances <number-of-instances>
View an endpoint using the GUI
Open the Endpoint window to view detailed information about the endpoint.
-
Click Projects from the left menu to navigate to the Projects list.
Figure 4. Project list -
Click the Project that contains the endpoint you wish to view. That Project window will open displaying the Endpoints table, which is located beneath the Jobs table. The Endpoints table provides a summary of information about each endpoint, including:
-
Name displays the name of the endpoint.
-
Model displays the associated model of the endpoint.
-
Instances displays the number of instances of the endpoint.
-
Status displays the current status of the endpoint.
-
Date of creation displays the date and time the endpoint was created.
-
Owner displays the owner of the endpoint.
-
Actions provides additional interactions to the endpoint via a drop-down menu.
Figure 5. Endpoints table
-
-
For detailed information about the endpoint, click the endpoint you wish to view from the list. The Endpoint window will open. The Endpoint window displays the following information:
-
The status of the endpoint is displayed in the top Details bar.
-
The endpoint name is displayed in the top of the window and in the breadcrumb path at the top of the window.
-
URL provides the URL of the endpoint to use programmatically to make requests, such as generating predictions. Use the copy icon to temporarily store the URL (copy) to your computer’s clipboard.
-
ML App displays the ML App that was selected when the endpoint was created.
-
Model/Checkpoint displays the endpoint’s associated model/checkpoint.
-
The API key allows you to make authenticated requests of the endpoint URL, such as generating predictions. Use the copy icon to temporarily store the endpoint API key (copy) to your computer’s clipboard.
The endpoint API key and the platform API key have two distinct implementations.
-
The endpoint API key is a unique API key generated by the platform for each endpoint.
-
The platform API key is used as an authorization key for authenticated access to the platform. See SambaStudio resources for information about the platform API key.
Figure 6. Endpoint window -
-
View endpoint info using the CLI
The example below demonstrates how to use the snapi endpoint info
command to view the info of an endpoint. You will need to provide the following:
-
The project the endpoint is assigned.
-
The endpoint name.
snapi endpoint info \
--project <project-name> \
--endpoint <endpoint-name>
Edit an endpoint using the GUI
The platform provides the ability to edit an existing endpoint.
-
From an Endpoint window, click Edit, as shown in Figure 7. Or, from the Endpoints table, click Edit from the Actions drop-down, as shown in Figure 8. The Update the endpoint window will open.
Figure 7. Edit endpoint from the endpoint windowFigure 8. Edit endpoint from the endpoints table -
As shown in Figure 9, from the Update the endpoint window you can:
-
Enter a new description into the Description field.
-
Update the tenant to share your endpoint from the Share settings drop-down.
-
Choose a different model from the Select Model drop-down.
-
Update the Number of instances.
-
-
Click Edit endpoint to complete process and send the endpoint to the queue.
Figure 9. Update the endpoint
The endpoint’s new description, model, and number of instances will be reflected in the endpoints table. |
Edit an endpoint using the CLI
The example below demonstrates how to use the snapi endpoint update
command to edit and update an endpoint. You will need to provide the following:
-
The project the endpoint is assigned.
-
The endpoint name.
Similar to the GUI, you can:
-
Change the number of instances by using the
--instances
option. -
Change the description by using the
--description
option.
$ snapi endpoint update \
--project <project-name> \
--description <new-description> \
--endpoint <endpoint-name> \
--instances <number-of-instances>
Stop and restart an endpoint using the GUI
To free up Reconfigurable Dataflow Unit™ (RDU) resources in the platform, you may wish to stop an endpoint from running.
Stop an endpoint
Do the following to stop an existing endpoint from running.
-
From an Endpoint window, click Stop as shown in Figure 10. An alert box will open..
Figure 10. Stop endpoint -
The alert box (Figure 11) displays a list of the active models in the endpoint that will stop running.
-
As shown in Figure 11, click Yes to confirm that you want to stop the endpoint from running. Click Cancel to return the Endpoint window.
Figure 11. This endpoint will stop running box
Restart an endpoint
From an Endpoint window, click Restart, as shown in Figure 12. The platform will restart the endpoint and will notify you that the endpoint is restarting.

The status of a restarted endpoint is displayed in the endpoint’s top Details bar. |
Delete an endpoint using the GUI
Do the following to delete an existing endpoint.
-
From an Endpoint window, click Delete as shown in Figure 13. An alert window box open.
Figure 13. Delete endpointThe alert box (Figure 14) displays a message that the endpoint will be deleted and no longer available. The alert also lists the endpoint that will be deleted.
-
Click Yes to finalize deleting the endpoint, as shown in Figure 14. Click Cancel to return the Endpoint window.
Figure 14. Delete alert window
Delete an endpoint using the CLI
The example below demonstrates how to use the snapi endpoint delete
command to delete an endpoint. You will need to provide the following:
-
The project the endpoint is assigned.
-
The endpoint name.
$ snapi endpoint delete \
--project <project-name> \
--endpoint <endpoint-name> \
Endpoint new-endpoint-snapi successfully marked delete.