Brainbase Voice Template

This assignment requires you to create and deploy the Brainbase Voice service.

Introduction

At Brainbase, one of our most popular workers is our Voice worker which can make and receive natural sounding, low-latency calls.

Your first task at Brainbase is to create and deploy a simple version of this service.

Installing the template

Prerequisites

Before you begin, ensure you have Node.js and Python installed on your machine. If not, you can download and install it from Node.js official website and Python official website.

Installation

Clone the repository to your local machine:

git clone https://github.com/BrainbaseHQ/brainbase-voice-template
cd brainbase-voice-template

Using ngrok

To make your local endpoints publicly available, you will need to use ngrok. You can learn more about ngrok here but for the most part, it will be enought to use

ngrok http PORT

to create a public endpoint for your service at localhost:PORT.

Creating and connecting a Twilio phone number

For this project, you will need to create a phone number on Twilio, here's a rundown of the necessary steps:

Sign up for a Twilio account at twilio.com
Buy a phone number: Twilio usually charges around $1/mo for a phone number + usage cost which is a couple cents in most cases. We will be able to reimburse you for any charges up to $50 on Twilio after the take-home project.

3. Point the phone number to the public endpoint for the server

Components

The assignment has the following components:

server.js: Primary server that will receive the call details from Twilio
sockets.js/sockets.py: The websocket server to send and receive voice stream

Milestones

This is a challenging assignment. Therefore you're given the following milestones that get progressively more difficult, and provide necessary structure for how to implement the entire system.

Milestone 1: Echophone

For Milestone 1, you need to implement a service that will repeat the caller's speech back to them, by setting up the Websocket server and the server which will route the Twilio call here.

Criteria

[THIS HAS BEEN PROVIDED FOR YOU] server.js/server.py is successfully set up to receive call from Twilio and reroute to websocket server.
sockets.js/sockets.py successfully receives audio from the Twilio connection and can repeat the speech back to the caller.
Every Twilio number has its own unique identifier which gets passed into the websocket endpoint (the websocket should know which call_id each call is when processing).

Milestone 2: AI voice

For Milestone 2, you need to modify your sockets.js service to be able to have a conversation with the user. Speech coming into websocket is transcribed to text, sent to OpenAI, response is returned, response is converted to speech, and response is finally sent back to the Twilio client.

**TIP: While you can implement all of these from scratch, you can use Pipecat for some/most/all. Pipecat is an open-source, free AI package that has a lot of these services built in and makes it easy to create pipelines.

Criteria

TTS (Deepgram) is correctly implemented
LLM (OpenAI) is correctly implemented
STT (ElevenLabs) is correctly implemented
<1 second latency from end of caller speech to start of AI speech
User can have a call with the AI on the Twilio number
The system can run an unlimited number of these turns

Milestone 3: Scale up

For Milestone 3, you need to scale this voice service up to be able to take over 100 concurrent calls, by Dockerizing the service and deploying on Kubernetes.

Criteria

Service Dockerized
Running on Kubernetes locally with correct ports and permissions
>100 concurrency

Final run

Once all three milestones are succesfully completed, you should now have an AI worker that you can have a conversation with by calling the provided number!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brainbase Voice Template

Introduction

Installing the template

Prerequisites

Installation

Using ngrok

Creating and connecting a Twilio phone number

Components

Milestones

Milestone 1: Echophone

Criteria

Milestone 2: AI voice

Criteria

Milestone 3: Scale up

Criteria

Final run

About

Releases

Packages

Languages

BrainbaseHQ/brainbase-voice-template

Folders and files

Latest commit

History

Repository files navigation

Brainbase Voice Template

Introduction

Installing the template

Prerequisites

Installation

Using ngrok

Creating and connecting a Twilio phone number

Components

Milestones

Milestone 1: Echophone

Criteria

Milestone 2: AI voice

Criteria

Milestone 3: Scale up

Criteria

Final run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages