Skip to content

Conversation

@ronjer30
Copy link
Contributor

@ronjer30 ronjer30 commented Oct 2, 2025

Description

Adds support for the Nemotron Nano 12B V2 VLM

Usage

  1. Download model checkpoints
  2. Run
python tutorials/video/getting-started/video_split_clip_example.py \
  --video-dir </path/to/videos_directory/> \
  --output-clip-path ./outputs \
  --generate-captions \
  --captioning-algorithm nemotron \
  --nemotronh-vl-model-path </path/to/checkpoints/> \
  --captioning-batch-size 8 \
  --no-generate-embeddings 

Checklist

  • I am familiar with the Contributing Guide.
  • New or Existing tests cover these changes.
  • The documentation is up to date with these changes.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Oct 2, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

…in favor of a unified llm_inputs structure. Update related tests and examples to reflect this change
@ronjer30
Copy link
Contributor Author

ronjer30 commented Oct 7, 2025

Thanks @suiyoubi, I've addressed the issues, added tests and updated existing ones as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants