Sariqat-al-Lahzat

YouTube Transcription and Audio Clipping

This repository provides a set of tools and scripts for transcribing YouTube videos, extracting timestamps, generating subtitles, and clipping audio based on the subtitles. It aims to automate the process of extracting valuable information from YouTube videos and making it easily accessible.

Features

Automatic speech recognition to transcribe audio from YouTube videos.
Extraction of timestamps from the transcribed text.
Generation of subtitles in SRT format.
Searching for specific words in the subtitles and extracting matching subtitles.
Conversion of subtitles to CSV for further analysis.
Clipping audio files based on subtitle timestamps.
Powered by AI.

Requirements

To use the tools and scripts in this repository, you need the following:

Python 3.7 or higher
torch and transformers libraries for speech recognition
pydub library for audio processing
pandas library for data manipulation
youtube-dl library for downloading YouTube videos

Make sure you have these dependencies installed before running the scripts.

Usage

Clone the repository to your local machine:

git clone https://github.com/your-username/Sariqat-al-Lahzat.git

pip install torch transformers pydub pandas youtube-dl

Run the scripts in the repository to perform different tasks such as transcribing, generating subtitles, searching for specific words, and clipping audio. Make sure to provide the necessary input files and parameters as required by each script.
Customize and extend the functionality of the scripts according to your specific needs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
__pycache__		__pycache__
cliped_audio		cliped_audio
downloaded_youtube		downloaded_youtube
timestamps		timestamps
Sariqat-al-Lahzat.png		Sariqat-al-Lahzat.png
__ini__.py		__ini__.py
audio_collectiong_with_whisper_from_transformers.ipynb		audio_collectiong_with_whisper_from_transformers.ipynb
cleaning.py		cleaning.py
clip_audio_based_on_subtitlies.py		clip_audio_based_on_subtitlies.py
main.py		main.py
readme.md		readme.md
transcribe_subtitles.py		transcribe_subtitles.py
youtube_download.py		youtube_download.py
youtube_scraping.py		youtube_scraping.py
youtube_timestamps.py		youtube_timestamps.py
youtube_urls.csv		youtube_urls.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sariqat-al-Lahzat

YouTube Transcription and Audio Clipping

Features

Requirements

Usage

About

Releases

Packages

Languages

abdelkareemkobo/Sariqat-al-Lahzat

Folders and files

Latest commit

History

Repository files navigation

Sariqat-al-Lahzat

YouTube Transcription and Audio Clipping

Features

Requirements

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages