A detailed list of TODOs Mamba repo - [x] create a Mamba-MoE branch in Mamba repo @fabianlim FMS-FSDP repo - [x] add mamba moe configs - [x] modify loss to moe-load-balancing-loss Job yaml file - [x] add extra dependencies for mamba moe - [x] switch to the mamba moe fork Prepare config - [x] 30b, 8 active experts, 64 total experts - [x] 120b, 16 active experts, 256 total experts