Skip to content

Conversation

lbliii
Copy link
Contributor

@lbliii lbliii commented Oct 1, 2025

No description provided.

@lbliii lbliii requested a review from arhamm1 October 1, 2025 15:45
@lbliii lbliii self-assigned this Oct 1, 2025
Signed-off-by: Lawrence Lane <[email protected]>
@lbliii lbliii requested a review from sarahyurick October 1, 2025 15:47
- **Semantic deduplication**: [K-means clustering and pairwise similarity](../../curate-video/process-data/dedup.md) for near-duplicate clip removal
- **Content filtering**: [Motion-based filtering](../../curate-video/process-data/filtering.md) and [aesthetic filtering](../../curate-video/process-data/filtering.md) for quality improvement
- **Embedding generation**: InternVideo2 and Cosmos-Embed1 models for clip-level embeddings
- **Enhanced captioning**: [Qwen-VL caption generation with optional Qwen-LM enhancement](../../curate-video/process-data/captions-preview.md) for detailed video descriptions
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we keep this generic? VL based caption generation with optional LLM based re-write? or something ..; Qwen can be mentioned in brackets as supported today ...

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@suiyoubi fyi

Comment on lines 9 to 10
"version": "0.25.7",
"url": "../0.25.7"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this version be renamed to 25.07?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes i plan to fix this but it's a kinda slow process because i need to download the old docs and re-upload them to a different s3 directory

@arhamm1
Copy link
Contributor

arhamm1 commented Oct 1, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants