Thanks for the great work ! I wonder why the in_channels of decode_head is [ 224, 368, 480, 480 ] rather than [ 128, 224, 368, 480 ] for the MPViT-Base ? Looking forward to your reply. Thanks again.