Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Showcase Haystack evaluations on an industry dataset #7407

Closed
2 tasks done
Tracked by #6786
mrm1001 opened this issue Mar 22, 2024 · 0 comments
Closed
2 tasks done
Tracked by #6786

Showcase Haystack evaluations on an industry dataset #7407

mrm1001 opened this issue Mar 22, 2024 · 0 comments
Labels
epic P2 Medium priority, add to the next sprint if no P1 available topic:eval

Comments

@mrm1001
Copy link

mrm1001 commented Mar 22, 2024

User story

I would like to learn how to apply Haystack core evaluations to improve my RAG pipeline, with an example on how to improve my retriever component, on an industry dataset.

Sub-tasks:

  • create an example of how to improve Haystack evaluation metrics by tweaking: chunk size and/or embedding model (e.g. context size). Using: semantic similarity on answers and LLM-based metric context relevance.

More context here: https://www.notion.so/deepsetai/Evaluation-1521712b928d4142828232f2df136856?pvs=4

Tasks

  1. P1 topic:eval
    vblagoje
  2. 2.x P2 topic:eval
    TuanaCelik
@masci masci added the P2 Medium priority, add to the next sprint if no P1 available label Mar 22, 2024
@mrm1001 mrm1001 added this to the 2.1.0 milestone Mar 25, 2024
@mrm1001 mrm1001 added P1 High priority, add to the next sprint and removed P2 Medium priority, add to the next sprint if no P1 available labels Mar 27, 2024
@masci masci added P2 Medium priority, add to the next sprint if no P1 available and removed P1 High priority, add to the next sprint labels Mar 28, 2024
@masci masci removed this from the 2.1.0 milestone Apr 7, 2024
@masci masci changed the title Create example of using core Haystack evaluations on industry dataset Showcase Haystack evaluations on an industry dataset Apr 7, 2024
@masci masci added the epic label Apr 7, 2024
@masci masci closed this as completed May 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic P2 Medium priority, add to the next sprint if no P1 available topic:eval
Projects
None yet
Development

No branches or pull requests

2 participants