VLM-CPL

Overall Framework

There are three steps to train the classification network with selected high-quality pseudo labels.

Data prepare

Download the HPH dataset here
Download the LC25K dataset here
Download the CRC100K dataset here
Download the DigestPath dataset here

Using a 4:1 split for training and testing.

Training process

First, use the on-the-shelf VLM for zero-shot inference with our proposed method to filter out noisy samples on the training set.
In the vlm_cpl_LC25K.py file, there are two main functions, MVCandPrompt_feature_consensus.
You can use the combination of MVCandPrompt_feature_consensus or either one alone. You can also adjust the order of these two filters.

python vlm_cpl_LC25K.py --gpu 0

Second, after obtaining high-quality pseudo-labels, you can train a classification network.

python train_pseudo.py --gpu 0 --pseudo_csv <your_csv>

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
dataset		dataset
model		model
README.md		README.md
evaluate_util.py		evaluate_util.py
fig1.png		fig1.png
overall-1.png		overall-1.png
train_pseudo.py		train_pseudo.py
vlm_cpl_LC25K.py		vlm_cpl_LC25K.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VLM-CPL

Overall Framework

Data prepare

Training process

About

Uh oh!

Releases

Packages

Languages

HiLab-git/VLM-CPL

Folders and files

Latest commit

History

Repository files navigation

VLM-CPL

Overall Framework

Data prepare

Training process

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages