ClipMind: A Framework for Auditing Short-Format Video Recommendations Using Multimodal AI Models

Aoyu Gong, Sepehr Mousavi, Yiting Xia, Savvas Zannettou

[Paper] [Slides] [Citation]

🧭 Overview

This repository contains the code for our paper:

ClipMind: A Framework for Auditing Short-Format Video Recommendations Using Multimodal AI Models

🔧 Environment Setup

💡 If you have Conda installed, you may skip this section and proceed to the next one.

Follow these steps to set up a reproducible environment:

1. Download Miniforge (Conda Installer)

wget https://github.com/conda-forge/miniforge/releases/download/24.11.0-0/Miniforge3-24.11.0-0-Linux-x86_64.sh

2. Install Miniforge

bash Miniforge3-24.11.0-0-Linux-x86_64.sh -b -p ~/miniforge3

3. Initialize Conda

~/miniforge3/bin/conda init bash
source ~/.bashrc

To ensure Conda is initialized in login shells, add the following to ~/.bash_profile:

echo 'source ~/.bashrc' >> ~/.bash_profile

If ~/.bash_profile already exists, make sure it includes this line:

source ~/.bashrc

4. Verify Conda installation:

conda --version

🏗️ Create Environment and Install Dependencies

1. Create the Conda Environment

conda env create -f environment.yml

2. Activate the Conda Environment

conda activate clipmind

3. Install PyTorch with CUDA Support

pip install torch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117

4. Install Additional Tools (ffmpeg and git-lfs)

conda install -c conda-forge ffmpeg=4.3.1 git-lfs

💡 FFmpeg version 4 is required for compatibility.

⚙️ Edit Configuration

Update the following fields:

openai.api_key: Insert your OpenAI API key.
working_trace: Path to your short-format video trace directory.

💡 A default test trace is provided for demonstration purposes.

📂 Add Your Own Trace

To analyze your short-format video traces, organize your data using the following folder structure:

./ClipMind/
└── data/
    └── your_trace_name/
        ├── metadata/           # Video metadata
        ├── videos/             # Video files
        └── viewing.json        # A JSON file with timestamped viewing history

💡 A default test trace is provided for demonstration purposes.

🔁 Two-Phase Workflow

The working_trace field in configuration.yaml specifies the active data directory used by the framework.

Phase 1 – Calibration Trace: Start by setting working_trace to a trace you want to use for sampling and annotation. This trace is used to identify the best feature combination and similarity threshold. After running the notebook identify_best_features_threshold.ipynb, the best parameters will be written back into configuration.yaml.
Phase 2 – Analysis Traces: You can now switch working_trace to other traces you wish to analyze. The notebook video_sequence_analysis.ipynb will apply the identified parameters to auditing short-format video recommendations in those traces.

🚀 Running the Framework

The following list outlines the recommended notebook execution order across the two phases:

setup.ipynb
convert_video_to_audio.ipynb
llm_generated_description.ipynb
user_defined_metadata.ipynb
llm_generated_keywords.ipynb
text_embedding.ipynb
sampling.ipynb
annotation.ipynb
identify_best_features_threshold.ipynb
video_sequence_analysis.ipynb

💡 Use Jupyter or VSCode to execute notebooks interactively.

For the two phases, run different subsets of notebooks depending on whether you are identifying the best parameters or analyzing new traces:

Prepare AI Models: Run notebook 1
Phase 1 – Calibration Trace: Run notebooks 2 $\to$ 9
Phase 2 – Analysis Traces: Run notebooks 2 $\to$ 6 (prepare embeddings), then notebook 10 (analyze new traces)

📄 Citation

If you find the codebase helpful, please consider giving a ⭐ and citing our paper:

@inproceedings{gong2025clipmind,
  title={ClipMind: A Framework for Auditing Short-Format Video Recommendations Using Multimodal AI Models},
  author={Gong, Aoyu and Mousavi, Sepehr and Xia, Yiting and Zannettou, Savvas},
  booktitle={Proceedings of the International AAAI Conference on Web and Social Media},
  volume={19},
  pages={671--687},
  year={2025}
}

😋 Questions or Issues?

If you run into problems or have suggestions, feel free to open an issue or reach out to us.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClipMind: A Framework for Auditing Short-Format Video Recommendations Using Multimodal AI Models

🧭 Overview

🔧 Environment Setup

1. Download Miniforge (Conda Installer)

2. Install Miniforge

3. Initialize Conda

4. Verify Conda installation:

🏗️ Create Environment and Install Dependencies

1. Create the Conda Environment

2. Activate the Conda Environment

3. Install PyTorch with CUDA Support

4. Install Additional Tools (ffmpeg and git-lfs)

⚙️ Edit Configuration

📂 Add Your Own Trace

🔁 Two-Phase Workflow

🚀 Running the Framework

📄 Citation

😋 Questions or Issues?

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
data/test		data/test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
annotation.ipynb		annotation.ipynb
configuration.yaml		configuration.yaml
convert_video_to_audio.ipynb		convert_video_to_audio.ipynb
environment.yml		environment.yml
identify_best_features_threshold.ipynb		identify_best_features_threshold.ipynb
llm_generated_description.ipynb		llm_generated_description.ipynb
llm_generated_keywords.ipynb		llm_generated_keywords.ipynb
sampling.ipynb		sampling.ipynb
setup.ipynb		setup.ipynb
text_embedding.ipynb		text_embedding.ipynb
user_defined_metadata.ipynb		user_defined_metadata.ipynb
utils.py		utils.py
video_sequence_analysis.ipynb		video_sequence_analysis.ipynb
video_visual_audio_embedding.ipynb		video_visual_audio_embedding.ipynb

License

aygong/ClipMind

Folders and files

Latest commit

History

Repository files navigation

ClipMind: A Framework for Auditing Short-Format Video Recommendations Using Multimodal AI Models

🧭 Overview

🔧 Environment Setup

1. Download Miniforge (Conda Installer)

2. Install Miniforge

3. Initialize Conda

4. Verify Conda installation:

🏗️ Create Environment and Install Dependencies

1. Create the Conda Environment

2. Activate the Conda Environment

3. Install PyTorch with CUDA Support

4. Install Additional Tools (ffmpeg and git-lfs)

⚙️ Edit Configuration

📂 Add Your Own Trace

🔁 Two-Phase Workflow

🚀 Running the Framework

📄 Citation

😋 Questions or Issues?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages