Confusion about finetune_cls.sh

Thank you for this amazing project!

While I am going to conduct the QwenVL classification task, I have some questions about the parameters used in finetune_cls.sh:
1. What is the global batch size used for the default learning rate?
2. Why is the head_lr different from learning_rate? Is this the empirical set？ 
3. Why are you setting the llm frozen while the vision tower and merger module trainable, which is in contrast to the setup of sft the mllm (usually we only finetune the LLM part).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Confusion about finetune_cls.sh #175

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Confusion about finetune_cls.sh #175

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions