How to test performance of dense retrieval model #5555

mscherrmann · 2023-08-12T06:14:28Z

mscherrmann
Aug 12, 2023

Hey,

I like to train my own dense retrieval model on my own dataset. However, as a starting point I tried to replicate the results of the haystack example. At the end, they report different DPR performance measures of their trained model. It looks like this:

However, they do not say how they get these measures. Is there a simple way to compute these given the trained model and the data, or do I have to implement this myself?

Thank you very much in advance!

anakin87 · 2023-08-14T10:54:06Z

anakin87
Aug 14, 2023
Maintainer

@bogdankostic do you have any ideas to help?

0 replies

bogdankostic · 2023-08-14T12:39:26Z

bogdankostic
Aug 14, 2023

The performance measures displayed at the end of DPR training are in-batch metrics. This means that, for each batch of samples in your development set, and for each question in each batch, we calculate whether the model can identify the relevant passage (positive label) among all passages in the batch (hard-negative labels and in-batch negatives). The final metrics shown at the end are the average metrics over all batches and highly depend on the batch size you set when training the model.

These metrics are useful for quickly checking whether training works as expected (i.e., the model is converging) and to compare different hyperparameter values. However, the best practice for evaluating retrieval models is to check their performance on a large collection of documents. I’d recommend reviewing the Evaluation Guide in our documentation and our Evaluation Tutorial.

0 replies

mscherrmann · 2023-08-17T07:58:43Z

mscherrmann
Aug 17, 2023
Author

I see, thank you for your help!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to test performance of dense retrieval model #5555

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to test performance of dense retrieval model #5555

Uh oh!

mscherrmann Aug 12, 2023

Replies: 3 comments

Uh oh!

anakin87 Aug 14, 2023 Maintainer

Uh oh!

bogdankostic Aug 14, 2023

Uh oh!

mscherrmann Aug 17, 2023 Author

mscherrmann
Aug 12, 2023

anakin87
Aug 14, 2023
Maintainer

bogdankostic
Aug 14, 2023

mscherrmann
Aug 17, 2023
Author