Image-Included RAG Evaluation

This repository contains a Python script designed to evaluate the performance of an image included Retrieval-Augmented Generation (RAG) system. The script analyzes the presence and accuracy of image links within markdown text, comparing them against ground truth data to calculate various performance metrics.

Files in the Repository

image_included_rag_evaluation.py: Main script that performs the evaluation.
requirements.txt: List of Python dependencies required to run the script.
test_image_included_rag_evaluation.py: Contains unit tests for the main script.

Prerequisites

Ensure you have Python installed on your system. This script requires Python 3.10 or higher.

Setup

Install Dependencies

Use pip to install the required packages:
```
pip install -r requirements.txt
```
Prepare Your Data

Ensure your CSV file follows the structure expected by the script. An example row in the CSV might look like this:

inputs.ground_truth inputs.answer inputs.documents

This is an image. This is an image. This is an image.

or

inputs.ground_truth inputs.answer inputs.documents

Usage

Run the image_included_rag_evaluation.py script to evaluate the images in your CSV file.

python image_included_rag_evaluation.py path/to/your/csvfile.csv

The script will load the specified CSV file and perform the following tasks:

Load the CSV file into a DataFrame.
Extract image links from the markdown text in the ground truths, input answers, and input documents.
Check if the links are valid Azure Blob Storage URLs and if they are accessible.
Calculate precision, recall, and other metrics.
Categorize hallucinations into broken links, non-existing resources, and others.
Output detailed results for each row and average metrics.

Output

The script will print out the detailed results for each row and the average metrics, which include:

Retrieval Score
Precision
Recall
Number of Hallucination Links
Hallucination Ratio
Number of Broken Links
Number of Resource-Not-Existing Links
Number of Other Hallucination Links

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
image_included_rag_evaluation.py		image_included_rag_evaluation.py
requirements.txt		requirements.txt
test_image_included_rag_evaluation.py		test_image_included_rag_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-Included RAG Evaluation

Files in the Repository

Prerequisites

Setup

Usage

Output

About

Releases

Packages

Languages

aykhara/image-included-rag-evaluation

Folders and files

Latest commit

History

Repository files navigation

Image-Included RAG Evaluation

Files in the Repository

Prerequisites

Setup

Usage

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages