Skip to content

Commit 9d825eb

Browse files
committed
fix: reduce capacity again
1 parent 8a8dd82 commit 9d825eb

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

moe_benchmarks/megablocks_yamoe/torch_profile.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1026,7 +1026,7 @@ def build_megablocks_model(device: torch.device):
10261026
# Attach loaded expert weights to the experts container
10271027
e = model.experts
10281028
e.alpha = 1.702
1029-
e.capacity_factor = 64
1029+
e.capacity_factor = 32
10301030
e.gate_up_proj = torch.nn.Parameter(gate_up_proj.clone().to(device))
10311031
e.gate_up_proj_bias = torch.nn.Parameter(gate_up_proj_bias.clone().to(device))
10321032
e.down_proj = torch.nn.Parameter(down_proj.clone().to(device))

0 commit comments

Comments
 (0)