-
Notifications
You must be signed in to change notification settings - Fork 184
add enhanced captioning to rn; bump versions on json files #1154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
- **Semantic deduplication**: [K-means clustering and pairwise similarity](../../curate-video/process-data/dedup.md) for near-duplicate clip removal | ||
- **Content filtering**: [Motion-based filtering](../../curate-video/process-data/filtering.md) and [aesthetic filtering](../../curate-video/process-data/filtering.md) for quality improvement | ||
- **Embedding generation**: InternVideo2 and Cosmos-Embed1 models for clip-level embeddings | ||
- **Enhanced captioning**: [Qwen-VL caption generation with optional Qwen-LM enhancement](../../curate-video/process-data/captions-preview.md) for detailed video descriptions |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we keep this generic? VL based caption generation with optional LLM based re-write? or something ..; Qwen can be mentioned in brackets as supported today ...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@suiyoubi fyi
"version": "0.25.7", | ||
"url": "../0.25.7" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this version be renamed to 25.07?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes i plan to fix this but it's a kinda slow process because i need to download the old docs and re-upload them to a different s3 directory
[like] Arham Mehta reacted to your message:
________________________________
From: Sarah Yurick ***@***.***>
Sent: Wednesday, October 1, 2025 4:02:16 PM
To: NVIDIA-NeMo/Curator ***@***.***>
Cc: Arham Mehta ***@***.***>; Review requested ***@***.***>
Subject: Re: [NVIDIA-NeMo/Curator] add enhanced captioning to rn; bump versions on json files (PR #1154)
@sarahyurick commented on this pull request.
________________________________
In docs/versions1.json<#1154 (comment)>:
"version": "0.25.7",
"url": "../0.25.7"
Can this version be renamed to 25.07?
—
Reply to this email directly, view it on GitHub<#1154 (review)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BBVYZYXBRDO6PT4FPA6OBNT3VP3IRAVCNFSM6AAAAACIATVMZKVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTEOBZHE2DSOBTGQ>.
You are receiving this because your review was requested.Message ID: ***@***.***>
|
No description provided.