Azure AI Speech Application

Azure AI Speech Application, an interactive application designed for Speech-to-Text (STT) and Text-to-Speech (TTS) functionalities using Azure Cognitive Services and Gradio. This application combines the power of Azure's APIs with an intuitive interface provided by Gradio, enabling efficient speech processing for developers and end-users.

Features

Speech-to-Text (STT): Convert audio inputs (e.g., microphone recordings) into accurate text transcriptions using Azure's Speech-to-Text API.
Text-to-Speech (TTS): Generate natural-sounding audio from text input using Azure's Text-to-Speech API with support for customizable voices.
Interactive Gradio Interface: A user-friendly interface for real-time testing and interaction.
Demo Examples: Includes example outputs (demo.png, demo.mp4) to showcase the platform's capabilities.

File Structure

project/
├── app.py              # Main application script
├── requirements.txt    # List of dependencies
├── README.md           # Project documentation
├── .env                # Environment variables (not shared in version control)
├── demo/               # Demo files showcasing platform capabilities
│   ├── demo.png        # Screenshot of the interface
│   ├── demo.mp4        # Video demonstration
└── utils/              # Utility functions for Azure APIs
    └── azure_speech.py # Helper functions for STT and TTS

Setup Instructions

1. Clone the Repository

Clone this repository to your local system:

git clone <repository-url>
cd project

2. Install Dependencies

Install the required Python libraries using the requirements.txt file:

pip install -r requirements.txt

3. Set Up Environment Variables

Create a .env file in the root directory with the following content:

AZURE_SPEECH_REGION=<your-region>
AZURE_SPEECH_KEY=<your-api-key>

4. Run the Application

Start the application with:

python app.py

Usage

Speech-to-Text (STT):
- Record or upload an audio file using the microphone input.
- View the transcribed text in real-time.
Text-to-Speech (TTS):
- Enter the text you want to convert to speech.
- Download or play the generated audio file directly from the interface.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure AI Speech Application

Features

File Structure

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Set Up Environment Variables

4. Run the Application

Usage

Interface Screenshot

Demo Video

Dependencies

License

Acknowledgments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
demo		demo
utils		utils
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

License

seonokkim/azure-ai-speech

Folders and files

Latest commit

History

Repository files navigation

Azure AI Speech Application

Features

File Structure

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Set Up Environment Variables

4. Run the Application

Usage

Interface Screenshot

Demo Video

Dependencies

License

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages