How to evaluate a pipeline on entire data as context? #6645

tobias-mack · 2023-12-26T00:35:44Z

tobias-mack
Dec 26, 2023

Hi 👋 , is there a way to evaluate a pipeline on FAQ data without the given context in the SQuAD format, so that it tests how well the retrieval actually works with the entire data written to the documentstore and not just a small piece of context?

Since the evaluation gave the same results for a prewritten and empty documentstore (adding eval data with document_store.add_eval_data() ), the retrieval is only based on the given context field it seems.

Do I have to put the entire text corpus into every single context field or is there a simpler way to achieve this ?

So i basically want to use the context field in the SQUaD like dataset as reference for comparison and not as a basis for retrieval:
A question is given -> the retriever gets context pieces -> context pieces are compared with given context -> reader takes top k -> reader generates answer -> generated answer is compared with given answer

I am thankful for any help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to evaluate a pipeline on entire data as context? #6645

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to evaluate a pipeline on entire data as context? #6645

Uh oh!

Uh oh!

tobias-mack Dec 26, 2023

Replies: 0 comments

tobias-mack
Dec 26, 2023