Nodes

Input Nodes

Load Audio

Load an audio file from the input/ folder.

Load Audio from Video

Load the audio component of a video file from the input/ folder.

Load Audio from Batch

Load a batch of audio files for processing. Files can be selected from the input folder or any arbitrary path on the system. Supports simultaneous audio and video input as well as regex filtering for file selection.

Model Nodes

Load Whisper Model

Create a new whisper pipeline to transcribe audio. Supports selecting the input language.

Note: Automatic language detection can occaisionally produce errors. If you're having trouble, try specifying the language manually.

Processing Nodes

Whisper Transcribe

Transcribe the audio using the whisper model. Includes options for saving files to the output/transcription folder.

Whisper Transcribe Batch

Transcribe a batch of audio files using the whisper model. Includes options for saving files to the output/transcription folder.

Output Nodes

Audio Sink

Saves wav_bytes to a file in the output/audio folder. I only created this for debugging purposes.

Utility Nodes

Convert VHS Audio to WAV bytes

Converts audio from Video Helper Suite audio type to wav_bytes type. VHS is using wav_bytes under the hood, but currently the way ComfyUI handles type checking, it's necessary to convert it by name using this node.

Conversion like this is not recommended for even mildly long input videos. The VHS node is primarily to load the frames of videos, requiring significant memory usage for the image data. Long videos may OOM or at least take a LONG time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nodes.md

nodes.md

Nodes

Input Nodes

Load Audio

Load Audio from Video

Load Audio from Batch

Model Nodes

Load Whisper Model

Processing Nodes

Whisper Transcribe

Whisper Transcribe Batch

Output Nodes

Audio Sink

Utility Nodes

Convert VHS Audio to WAV bytes

Files

nodes.md

Latest commit

History

nodes.md

File metadata and controls

Nodes

Input Nodes

Load Audio

Load Audio from Video

Load Audio from Batch

Model Nodes

Load Whisper Model

Processing Nodes

Whisper Transcribe

Whisper Transcribe Batch

Output Nodes

Audio Sink

Utility Nodes

Convert VHS Audio to WAV bytes