ElevenLabs

This tutorial walks you through building low-latency Conversational AI agents using ElevenLabs and SambaCloud’s high-speed LLM inference engine. Low latency is crucial for smooth voice conversations, and SambaNova delivers this with specialized hardware optimized for world-class inference speeds using open-source models.

Pre-requisites

Before starting, ensure you have:

Free account on SambaCloud.
Free account on Eleven Labs.

Setup

Follow these steps to set up your AI agent.

Access the Agent in ElevenLabs

Go to the Agents page on ElevenLabs.
Create a new agent, or select an existing agent to edit.

Configure the LLM settings

Scroll to the LLM section of your agent settings.
Select Custom LLM from the dropdown menu.

Retrieve SambaNova endpoint and model

Open the SambaCloud Playground.
Select View Code in the top-right to get your model endpoint URL and model name.

Generate your SambaNova API key

Go to your SambaCloud account.
Generate an API key from the portal.

Add API key to ElevenLabs

Return to the ElevenLabs agent settings page.
Under Workspace Secrets, add name and value.
Name: SAMBANOVA_API_KEY.
Value: Paste the API key from previous step.

This enables ElevenLabs to access your SambaNova model.

Set token limit

In the Limit token usage section, set maximum tokens to 1024. This helps control the response length for optimal conversational flow.

Save and test

Select Save to apply changes.
Test your setup by selecting Test AI agent followed by Call AI agent.
See video walkthrough for details.

Overview

Agent building and orchestration

Coding assistants

Evaluation and monitoring

LLM frameworks

Low-code platforms

Hyperscalers

Real-time voice

Tool and Browser Use

Vector DB and search

Pre-requisites

Setup

Access the Agent in ElevenLabs

Configure the LLM settings

Retrieve SambaNova endpoint and model

Generate your SambaNova API key

Add API key to ElevenLabs

Set token limit

Save and test

Video walkthrough

Overview

Agent building and orchestration

Coding assistants

Evaluation and monitoring

LLM frameworks

Low-code platforms

Hyperscalers

Real-time voice

Tool and Browser Use

Vector DB and search

​Pre-requisites

​Setup

​Access the Agent in ElevenLabs

​Configure the LLM settings

​Retrieve SambaNova endpoint and model

​Generate your SambaNova API key

​Add API key to ElevenLabs

​Set token limit

​Save and test

​Video walkthrough

Pre-requisites

Setup

Access the Agent in ElevenLabs

Configure the LLM settings

Retrieve SambaNova endpoint and model

Generate your SambaNova API key

Add API key to ElevenLabs

Set token limit

Save and test

Video walkthrough