Real-time voice
ElevenLabs
This tutorial walks you through building low-latency Conversational AI agents using ElevenLabs and SambaNova Cloud’s high-speed LLM inference engine. Low latency is crucial for smooth voice conversations, and SambaNova delivers this with specialized hardware optimized for world-class inference speeds using open-source models.
Pre-requisites
Before starting, ensure you have:
- Free account on SambaNova Cloud.
- Free account on Eleven Labs.
Setup
Follow these steps to set up your AI agent.
Access the Agent in ElevenLabs
- Go to the Agents page on ElevenLabs.
- Create a new agent, or select an existing agent to edit.
Configure the LLM settings
- Scroll to the LLM section of your agent settings.
- Select Custom LLM from the dropdown menu.
Retrieve SambaNova endpoint and model
- Open the SambaNova Cloud Playground.
- Select View Code in the top-right to get your model endpoint URL and model name.
Generate your SambaNova API key
- Go to your SambaNova Cloud account.
- Generate an API key from the portal.
Add API key to ElevenLabs
- Return to the ElevenLabs agent settings page.
- Under Workspace Secrets, add name and value.
- Name: SAMBANOVA_API_KEY.
- Value: Paste the API key from previous step.
This enables ElevenLabs to access your SambaNova model.
Set token limit
In the Limit token usage section, set maximum tokens to 1024
. This helps control the response length for optimal conversational flow.
Save and test
- Select Save to apply changes.
- Test your setup by selecting Test AI agent followed by Call AI agent.
- See video walkthrough for details.