Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels [IJCAI 2025]

Official PyTorch implementation for the paper:

Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels

Chaowei Fang, Hangfei Ma, Zhihao Li, De Cheng, Yue Zhang, Guangbin Li

Environment

Set up the conda environment using the provided environment.yml:

# Establish a new conda environment with dependencies
conda env create -f environment.yml

# Activate the new environment
conda activate srrs

Datasets

Please follow CoOp Datasets Instructions to install the datasets.

How to run

Training

The training operation shares the same script: scripts/train.sh.

CUDA_VISIBLE_DEVICES=0 bash scripts/train.sh dtd 2 symflip False Final

Parameters Config

dtd: dataset name, such as caltech101, food101 and so on.
2: number of false positive training samples per class.
symflip: type of noise (symflip, pairflip).
False: whether to use the GCE loss, False or True.
Final: a tag to mark once training pipline.

After training the prompt learners according to the scripts/train.sh, the results will be produced under the directory of output/Final/.

Ablation

We still provide the ablation codes about all submodules in scripts/ablation/train.sh.

CUDA_VISIBLE_DEVICES=0 bash scripts/ablation/train.sh dtd 2 pairflip False ablation CoOpDC

Parameters Config

dtd: dataset name, such as caltech101, food101 and so on.
2: number of false positive training samples per class.
pairflip: type of noise (symflip, pairflip).
False: whether to use the GCE loss, False or True.
ablation: a tag to mark once training pipline.
CoOpDC: the submodule name to do ablation, it will fetch training codes under the directory of analysis/.

The results will be placed under the directory of output_ablation/CoOpDC. This makes sure the well organized structure.

Parsing

The modified codes in parse_test_res.py will load training results and parse the acc indicators and make a summary.

CUDA_VISIBLE_DEVICES=0 bash scripts/parse.sh dtd Final

Parameters Config

dtd: dataset name, such as caltech101, food101 and so on.
Final: a tag to mark once training pipline.

The script will search the accuracy results in all files under the directory of output/Final/ and print the calculated average values.

Analysis

A useful class to analysis the results such as recoding noise variation and visualizing the noise distribution. That's achieved in the file of analysis.py and has the same usage with Parsing.

Acknowledgement

We would like to thank the CoOp and PTNL for open-sourcing their code.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ablation		ablation
analysis		analysis
assert		assert
blip		blip
clip		clip
configs		configs
dassl		dassl
datasets		datasets
scripts		scripts
trainers		trainers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis.py		analysis.py
environment.yml		environment.yml
output0.log		output0.log
output1.log		output1.log
output2.log		output2.log
output3.log		output3.log
output4.log		output4.log
output5.log		output5.log
parse_test_res.py		parse_test_res.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels [IJCAI 2025]

Environment

Datasets

How to run

Training

Ablation

Parsing

Analysis

Acknowledgement

References

About

Uh oh!

Releases

Packages

Languages

License

LZHMS/SRRS

Folders and files

Latest commit

History

Repository files navigation

Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels [IJCAI 2025]

Environment

Datasets

How to run

Training

Ablation

Parsing

Analysis

Acknowledgement

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages