Evaluate
This module contains functionality related to the the evaluate
script.
Evaluate
This script is used to evaluate RAG system using langfuse datasets. To add a new item to datasets, visit Langfuse UI. Qdrant vector storage should be running with ready collection of embeddings. To run the script execute the following command from the root directory of the project:
python src/evaluate.py
main(injector)
Execute RAG system evaluation workflow.
Parameters: |
|
---|
Note
Evaluates both feedback and manual datasets Results are recorded in Langfuse
Source code in src/evaluate.py
18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 |
|