Skip to content

TEN-framework/ten-framework

Repository files navigation


Table of Contents

Table of Contents


👋 Welcome to TEN

TEN is a comprehensive open-source ecosystem for creating, customizing, and deploying real-time conversational AI agents with multimodal capabilities including voice, vision, and avatar interactions.

TEN includes TEN Framework, TEN Turn Detection, TEN VAD, TEN Agent, TMAN Designer, and TEN Portal. Check out 🌍 TEN Ecosystem for more details.


Community Channel Purpose
Follow on X Follow TEN Framework on X for updates and announcements
Follow on LinkedIn Follow TEN Framework on LinkedIn for updates and announcements
Discord TEN Community Join our Discord community to connect with developers
Hugging Face Space Join our Hugging Face community to explore our spaces and models
WeChat Join our WeChat group for Chinese community discussions

Important

Star TEN Repositories ⭐️

Get instant notifications for new releases and updates. Your support helps us grow and improve TEN!


TEN star us gif


Star History


🎨 TMAN Designer

TMAN-Designer.mp4

TMAN Designer

TMAN Designer is a low/no-code option to create voice agents with an easy-to-use workflow UI. It can load apps and graphs, and includes an online editor, log viewer, and much more.

Check out this blog for more details.


✨ Features

TEN Agent with Trulience

1️⃣ Real-time Avatar

Build engaging AI avatars with TEN Agent using Trulience's diverse collection of free avatar options. To get it up and running, you only need 2 steps:

  1. Follow the README to finish setting up and running the Playground
  2. Enter the avatar ID and token you get from Trulience


TEN with MCP servers

2️⃣ Real-time voice with MCP servers

TEN Agent now integrates seamlessly with MCP servers, expanding its LLM capabilities. To get started:

  1. Open the Module Picker in Playground
  2. Add the MCP server tool for LLM integration
  3. Paste a URL from your MCP server in the extension
  4. Start a realtime conversation with TEN Agent

This integration allows you to leverage MCP's diverse servers offerings while maintaining TEN Agent's powerful conversational abilities.


esp32.mov

3️⃣ Real-time communication with hardware

TEN Agent is now running on the Espressif ESP32-S3 Korvo V3 development board, an excellent way to integrate realtime communication with LLM on hardware.

Check out the integration guide for more details.


Real-time Vision

4️⃣ Real-time vision and real-time screenshare detection

Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.


TEN with Dify

5️⃣ TEN with other LLM platforms

TEN Agent + Dify

TEN offers a great support to make the realtime interactive experience even better on other LLM platform as well, check out docs for more.


TEN StoryTeller

6️⃣ StoryTeller - TEN image generation

Experience the real-time image generation with StoryTeller, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN.


👩‍💻 Get TEN Agent up and running

🅰️ Run TEN Agent in localhost

Step ⓵ - Prerequisites

Category Requirements
Keys • Agora App ID and App Certificate (free minutes every month)
OpenAI API key (any LLM that is compatible with OpenAI)
Deepgram ASR (free credits available with signup)
Elevenlabs TTS (free credits available with signup)
Installation Docker / Docker Compose
Node.js(LTS) v18
Minimum System Requirements • CPU >= 2 Core
• RAM >= 4 GB

Note

macOS: Docker setting on Apple Silicon

Uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings, it may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.


Step ⓶ - Build agent in VM

1. Clone down the repo,cd to ai-agents and create .env file from .env.example
cd ai_agents
cp ./.env.example ./.env
2. Setup Agora App ID and App Certificate in .env
AGORA_APP_ID=
AGORA_APP_CERTIFICATE=

# In case you are running default voice-assistant example
# Deepgram (required for speech-to-text)
DEEPGRAM_API_KEY=your_deepgram_api_key_here

# OpenAI (required for language model)
OPENAI_API_KEY=your_openai_api_key_here
OPENAI_MODEL=gpt-4o

# ElevenLabs (required for text-to-speech)
ELEVENLABS_TTS_KEY=your_elevenlabs_api_key_here
3. Start agent development containers
docker compose up -d
4. Enter container
docker exec -it ten_agent_dev bash
5. Build agent with the default example ( ~5min - ~8min)

check the /examples folder for more examples

# use the chained voice assistant
cd agents/examples/voice-assistant

# or use the speech-to-speech voice assistant realtime
cd agents/examples/voice-assistant-realtime
6. Start the web server
task install
# run task build if you changed any local source code, this is necessary if you are working on languages which require compilation like TypeScript or Golang, while not needed for languages like Python
# task build

task run
7. Access the agent

Step ⓷ - Customize your agent with TMAN Designer

  1. Open localhost:49483.
  2. Right-click on the STT, LLM, and TTS extensions.
  3. Open their properties and enter APIs respectively.
  4. Right-click the canvas and select 'Manage Apps' to open the Apps Manager.
  5. Right under the Actions, click the ▶ to run the App.
  6. Check the 'Run with TEN Agent' option and click the Run button.


🅱️ Run TEN Agent in Codespace(no docker)

GitHub offers free Codespace for each repository, you can run the playground in Codespace without using Docker.Also, the speed of Codespace is much faster than localhost.

Check out this guide for more details.


🛳️ TEN Agent Self Hosting

🅰️ Deploying with Docker

Once you have customized your agent (either by using the TMAN Manager, Playground, or editing property.json directly), you can deploy it by creating a release Docker image for your service.

Release as Docker image

Note: The following commands need to be executed outside of any Docker container.

Build image
cd ai_agents
docker build -f agents/examples/<example-name>/Dockerfile -t example-app .
Run
docker run --rm -it --env-file .env -p 3000:3000 example-app

🅱️ Deploying with other cloud services

coming soon


🌍 TEN Ecosystem

Project Preview
🏚️ TEN Framework
TEN is an open-source framework for real-time, multimodal conversational AI.

️🔂 TEN Turn Detection
TEN is for full-duplex dialogue communication.

🔉 TEN VAD
TEN VAD is a low-latency, lightweight and high-performance streaming voice activity detector (VAD).

🎙️ TEN Agent
TEN Agent is a showcase of TEN Framewrok.

🎨 TMAN Designer
TMAN Designer is low/no code option to make a voice agent with easy to use workflow UI.

📒 TEN Portal
The official site of TEN framework, it has documentation and blog.



❓ Ask Questions

TEN Framework is available on these AI-powered Q&A platforms. They can help you find answers quickly and accurately in multiple languages, covering everything from basic setup to advanced implementation details.

Service Link
DeepWiki Ask DeepWiki
ReadmeX ReadmeX

🥰 Contributing

We welcome all forms of open-source collaboration! Whether you're fixing bugs, adding features, improving documentation, or sharing ideas - your contributions help advance personalized AI tools. Check out our GitHub Issues and Projects to find ways to contribute and show your skills. Together, we can build something amazing!


Tip

Welcome all kinds of contributions 🙏

Join us in building TEN better! Every contribution makes a difference, from code to documentation. Share your TEN Agent projects on social media with to inspire others!

Connect with one of the TEN maintainers @elliotchen100 on 𝕏 or @cyfyifanchen on GitHub for project updates, discussions and collaboration opportunities.


Code Contributors

TEN

Contribution Guidelines

Contributions are welcome! Please read the contribution guidelines first.

License

  1. The entire TEN framework (except for the folders explicitly listed below) is released under the Apache License, Version 2.0, with additional restrictions. For details, please refer to the LICENSE file located in the root directory of the TEN framework.

  2. The components within the packages directory are released under the Apache License, Version 2.0. For details, please refer to the LICENSE file located in each package's root directory.

  3. The third-party libraries used by the TEN framework are listed and described in detail. For more information, please refer to the third_party folder.

About

Open-source framework for conversational voice AI agents

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors 58