Open
Description
It's quite difficult to interpret the values in the Test Metrics table, produced after inference. What is a mean word overlap of 21.19? What is a summary length of 570?
Describe the solution you'd like
An extra column in the test metric table, to contextualize the metrics and give their units, and maybe their minimum and maximum.
I can't assign this myself, but I would like to fix this one. Please assign to @SinclairHudson 🎸