msword-Azure-tts

This Python script converts a Microsoft Word document (.docx) into an MP3 audio file using Azure Cognitive Services Text-to-Speech API. It now supports conversion of long documents, overcoming the 10-minute limit of Azure TTS API by splitting the text and combining the resulting audio files.

Prerequisites

Python 3.6 or higher
An Azure subscription key for the Text-to-Speech service. Follow the instructions at BobTranslate to obtain an API key.
The region for the Azure Text-to-Speech service.
A voice shortname for the Text-to-Speech service. A list of available voices can be found at Language and voice support for the Speech service.
ffmpeg software package. This is required for splitting long documents into smaller chunks and combining the resulting audio files.

Installation

Clone the repository or download the source code. You can clone the repository by using the command:
```
git clone https://github.com/lancer1911/msword-Azure-tts.git
```
If you don't have Git installed, you can download the source code directly. Go to the repository's main page on GitHub, click on the "Code" button, and then click "Download ZIP". Once the ZIP file is downloaded, extract it to access the source code.
Install the required dependencies:
```
pip install -r requirements.txt
```
Install ffmpeg:

macOS:
```
brew install ffmpeg
```
Linux (Ubuntu/Debian):
```
sudo apt-get update
sudo apt-get install ffmpeg
```
Windows: Download a static build from the official site. Unzip the downloaded file and add the bin directory from the unzipped file to your system PATH.
Open the settings.cfg file and add your Azure subscription key, region, voice shortname, and speech recognition language:
```
[Azure]
subscription_key = your_subscription_key
region = your_region
voice_shortname = voice_shortname # e.g. en-US-EricNeural
speech_recognition_language = your_recognition_language # e.g. en-US
```
Replace YOUR_SUBSCRIPTION_KEY, YOUR_REGION, YOUR_VOICE_SHORTNAME, and YOUR_SPEECH_RECOGNITION_LANGUAGE with the appropriate values.

Usage

You can run the script by providing a .docx file path as a command-line argument or by selecting the file using a file dialog.

Using Command Line

python msword-Azure-tts.py sample.docx

Replace sample.docx with the path to your Word document.

Using File Dialog

Run the script without any command-line arguments:

python msword-Azure-tts.py

A file dialog will appear, allowing you to select the Word document to convert.

The script will save the generated MP3 audio file in the same directory as the input file with the same name and an -Azure-tts.mp3 suffix. For example, if the input file is named sample.docx, the output file will be named sample-Azure-tts.mp3.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README-zh.md		README-zh.md
README.md		README.md
msword-Azure-tts.py		msword-Azure-tts.py
requirements.txt		requirements.txt
sample-Temple of Earth-CN.docx		sample-Temple of Earth-CN.docx
sample-Temple of Earth.docx		sample-Temple of Earth.docx
settings.cfg.template		settings.cfg.template

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

msword-Azure-tts

Prerequisites

Installation

Usage

Using Command Line

Using File Dialog

About

Uh oh!

Releases

Packages

Uh oh!

Languages

lancer1911/msword-Azure-tts

Folders and files

Latest commit

History

Repository files navigation

msword-Azure-tts

Prerequisites

Installation

Usage

Using Command Line

Using File Dialog

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages