Hello,
Thank you for releasing the code for training the behavior model recently. I'm currently trying to reproduce the performance of ctrl-sim and I was wondering if you can share some training results (such as the validation loss value) as reference to let me know if the training went well. In your paper, you showed the results in table 5 but scanning through your code, it doesn't look like there's any way to reproduce it.
Any guidance would be greatly appreciated. Thank you!