add acc tuning script. (#121)

* add acc tuning script. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * change directory. * update acc tuning doc. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update README.md * update ragas version. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
opea-project · Sep 11, 2024 · a6fd418 · a6fd418
1 parent a9b087f
commit a6fd418
Show file tree

Hide file tree

Showing 25 changed files with 645 additions and 1 deletion.
diff --git a/evals/auto_tuning/README.md → evals/benchmark/auto_tuning/README.md b/evals/auto_tuning/README.md → evals/benchmark/auto_tuning/README.md
@@ -149,3 +149,42 @@ Example of a strategy file:
 Both the K8S manifests and strategy files are generated in the current directory, providing everything needed for deployment.
 
 Deployment methods: simply run `kubectl apply -f` on the newly generated *_run.yaml files and the chatqna_config_map.
+
+# Auto-Tuning for ChatQnA: Optimizing Accuracy by Tuning Model Related Parameters
+
+The ChatQnA pipeline contains many components, such as `data_prep/embedding/retrieval/reranking/llm`, and each component has some hyper-parameters that have an impact on accuracy. So, we can create a tuning script to search the best accuracy config.
+
+Most of the hyper-parameters listed below:
+- embedding models
+- reranking models
+- large language models (llms)
+- data_prep hyper-parameters
+    - chunk_size
+    - chunk_overlap
+- retrieval hyper-parameters
+    - search_types
+    - top_k
+    - fetch_k
+- llms hyper-parameters
+    - top_k
+    - top_p
+    - temperature
+
+## Prepare Dataset
+
+We use the evaluation dataset from [CRUD-RAG](https://github.com/IAAR-Shanghai/CRUD_RAG) repo, use the below command to prepare the dataset.
+
+```
+git clone https://github.com/IAAR-Shanghai/CRUD_RAG
+mkdir data/
+cp CRUD_RAG/data/crud_split/split_merged.json data/
+cp -r CRUD_RAG/data/80000_docs/ data/
+python ../../evaluation/rag_eval/examples/process_crud_dataset.py
+```
+
+## Run the Tuning script
+
+```
+python3 acc_tuning.py --tuning_config acc_tuning_config.json --hardware_info hardware_info_gaudi.json --service_info chatqna_neuralchat_rerank_latest.yaml
+
+```