Speaker Recognizer

This project is a simple speaker recognition application that uses machine learning to identify speakers from audio recordings. The application is built with Python and utilizes various libraries for audio processing and visualization.

Features

Record audio and identify the speaker.
Import WAV files and identify the speaker.
Visualize audio spectrum during recording.

Requirements

Python 3.7 or higher.
Required libraries: numpy, matplotlib, librosa, scikit-learn, sounddevice, pyaudio, tkinter, scipy

Installation

Clone the repository:

git clone https://github.com/yourusername/speaker-recognizer.git
cd speaker-recognizer

Install the required Python libraries:

pip install numpy matplotlib librosa scikit-learn sounddevice pyaudio scipy

Ensure you have the following files in the project directory:
- microphone.ico
- background.png
- background2.png
- img0.png
- img1.png
- img2.png
Create a directory named Voices in the project directory. This folder should contain WAV files for training the model. The filenames should start with the speaker's name (e.g., alice_01.wav, bob_02.wav).

Usage

Run the Application:
```
python main.py
```
Main Window:
- Click on the microphone icon to start recording.
- Click on the upload icon to import a WAV file.
Recording Interface:
- Recording starts automatically after a 1-second delay.
- The predicted speaker is displayed after recording.
- The audio spectrum is visualized in real-time.

Code Overview

main.py: Main application script.
Voices/: Directory containing training audio files.
extract_features(): Function to extract features from audio files.
import_wav(): Function to import a WAV file and identify the speaker.
change_colors(): Function to switch to the recording interface and start recording.

Training the Model

The model is trained using the K-Nearest Neighbors (KNN) algorithm. Training data is loaded from the Voices directory. MFCC features are extracted from the audio files for training.

Example Training Data Structure

Voices/
├── alice_01.wav
├── alice_02.wav
├── bob_01.wav
└── bob_02.wav

Acknowledgements

This project uses the following open-source libraries:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Recognizer

Features

Requirements

Installation

Usage

Code Overview

Training the Model

Example Training Data Structure

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Speaker Recognition.py		Speaker Recognition.py
background.png		background.png
background2.png		background2.png
img0.png		img0.png
img1.png		img1.png
img2.png		img2.png
microphone.ico		microphone.ico

iiiiOreo/Speaker-Recognition

Folders and files

Latest commit

History

Repository files navigation

Speaker Recognizer

Features

Requirements

Installation

Usage

Code Overview

Training the Model

Example Training Data Structure

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages