title | emoji | colorFrom | colorTo | sdk | app_port | app_file |
---|---|---|---|---|---|---|
AI Audio-to-Video Converter |
🎥 |
blue |
green |
docker |
7860 |
app.py |
Convert audio to video with synchronized images using Hugging Face models.
- Transcription:
openai/whisper-large-v3
- Image Acquisition: Stable Diffusion or Google scraping with caching
- Video Assembly: MoviePy with FFmpeg
- Enhancements: Robust error handling, image caching, temporary file cleanup
- Upload audio (max 50 min).
- Select video format (9:16, 16:9, or 1:1).
- Choose image method (Scraper or Generator).
- Generate and download the video.
- Pinned
huggingface_hub==0.20.3
fordiffusers
compatibility. - Updated
gr.Audio
for Gradio 4.19.2 compatibility. - Added logging and cleanup for improved debugging and resource management.