Exploring using BLIP to caption videos and using embeddings to identify important video frames.
- CUDA 11.7
- Python 3.10
git clone https://github.com/tsugg/VideoCaptioningWithBLIP.git
cd VideoCaptioningWithBLIP
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt