Reproducing results on Llama-3.1-8B-Inst

Thank you for your great work!

I tried to reproduce the results for [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct), but I noticed several discrepancies between my outcomes and those reported in the [public result file](https://docs.google.com/spreadsheets/d/1LBt6dP4UwZwU_CjoYhyAd_rjKhQLvo0Gq4cYUnpi_CA/edit?gid=1716106781#gid=1716106781) under Llama-3.1-8B-Inst. 

You can view my results here: [Results](https://docs.google.com/spreadsheets/d/1nKRM-CKRScl2XE4H-drMliwg3NoOaAOqTMRnzNXJvjo/edit?usp=sharing). I obtained these results by running `scripts/run_eval_slurm.sh` on both short and long configs for some benchmarks, and I compiled all the results using `scripts/collect_results.py`.

Do you have any suggestion on specific things I could adjust to align my results with yours?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reproducing results on Llama-3.1-8B-Inst #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reproducing results on Llama-3.1-8B-Inst #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions