-
Notifications
You must be signed in to change notification settings - Fork 307
Open
Description
It seems not good to inference on the autonomous driving scene with Grounded-SAM-2 Video Object Tracking with Continuous ID (with Grounding DINO)
or reverse tracking. Is using the API the only solution?
The prompt: 'car. suv. bus.'
nusc1.mp4
As we can see, the model ignores the black suv :(
Metadata
Metadata
Assignees
Labels
No labels