Voice Chat with PDFs

This is a LlamaIndex project using Next.js

Voice Chat with PDFs

This is a an example based on the openai/openai-realtime-console, extending it with a simple RAG system using LlamaIndexTS.

Prerequisites

The project requires an OpenAI API key (user key or project key) that has access to the Realtime API. Set the key in the .env file or as an environment variable OPENAI_API_KEY.

Getting Started

First, install the dependencies:

npm install

Second, generate the embeddings of the documents in the ./data directory:

npm run generate

The example PDF is about physical letter standards, you can use your own documents.

Third, run the development server:

npm run dev

Open http://localhost:3000 with your browser to see the result.

Using the console

You'll be prompted on startup to enter the API key again (this needs to be fixed).

To start a session you'll need to connect. This will require microphone access. You can then choose between manual (Push-to-talk) and vad (Voice Activity Detection) conversation modes, and switch between them at any time.

You can freely interrupt the model at any time in push-to-talk or VAD mode.

Learn More

To learn more about LlamaIndex, take a look at the following resources:

LlamaIndex Documentation - learn about LlamaIndex (Python features).
LlamaIndexTS Documentation - learn about LlamaIndex (Typescript features).

You can check out the LlamaIndexTS GitHub repository - your feedback and contributions are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
public		public
readme		readme
relay-server		relay-server
src		src
.env		.env
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Chat with PDFs

Prerequisites

Getting Started

Using the console

Learn More

About

Releases

Packages

Contributors 2

Languages

License

run-llama/voice-chat-pdf

Folders and files

Latest commit

History

Repository files navigation

Voice Chat with PDFs

Prerequisites

Getting Started

Using the console

Learn More

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages