Question about LoRA alpha

Hi, thanks for your great work. I noticed that in your scripts, you hard-coded the lora alpha to be 128 and the rank r to be 4 (therefore leading to a scaling factor of 32):
https://github.com/eric-ai-lab/PEViT/blob/be6fb43ff54adeeffe720c663dd238976070558e/vision_benchmark/evaluation/lora_model.py#L455-L463
Was there a principled justification for these choices? I am just wondering if you did any tuning on these values to suggest what would be good values to use.

	'''
	LoRA setting
	'''
	self.lora_moe_lambda = 1.0
	self.lora_moe_act = 'linear'
	self.lora_r_dropout = None
	self.lora_attn_dim = 4
	self.lora_moe = 0
	self.lora_attn_alpha=128

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about LoRA alpha #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about LoRA alpha #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions