New Pipeline: Talking Head — Turn raw footage into polished social videos #2
calesthio
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Talking Head Pipeline is here
We just shipped the talking-head pipeline — a complete system that takes your raw talking-head footage (webcam, phone, camera) and transforms it into a polished, social-ready video with animated captions, enhancements, background music, and multi-clip assembly.
Think: record yourself talking for 3 minutes, paste a prompt, and get back a finished Instagram Reel / TikTok / YouTube Short — with jump cuts, eye enhancement, face-tracked reframing, word-by-word animated captions, and background music that fades in only during your speech sections.
What can you do with it?
Single talking-head video:
Multi-clip showcase reel:
Podcast clip extraction:
8 New Tools Built
remove,speed_up,mark.Plus updates to
subtitle_genwith acorrectionsdictionary for fixing common ASR mistakes (e.g. "cloud" → "Claude").The Enhancement Chain
The pipeline runs enhancements in this order:
Every step is optional — the agent checks what tools are available and adapts.
Try It — Sample Prompt
Drop your raw footage into a project folder and use this prompt:
Or for a multi-clip reel:
What you need
We'd love to see what you make with it. Share your results in Show and tell!
Beta Was this translation helpful? Give feedback.
All reactions