GraphRAG QA Example #1036

akollegger · 2024-10-28T17:11:04Z

Description

An example of GraphRAG using the SEC Edgar dataset with tool-based access over unstructured, structured, and semistructured data.

Issues

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

custom Streamlit UI
calls out to OpenAI for embedding + chat completion

Tests

Describe the tests that you ran to verify your changes.

…dalone GraphRAG API & UI

for more information, see https://pre-commit.ci

rbrugaro · 2024-10-28T19:20:08Z

@akollegger great to see you contribution! looking forward to trying it out.

I have been working on the below PRs to integrate GraphRAG (microsoft pipeline: graph extraction from text, clustering, community summaries, partial answers, final answer…) using llama-index and neo4j. Please check it out!
#1007
opea-project/GenAIComps#793

What do you think about these two approaches used jointly with an agent in real deployments? I am thinking: agent based on query type can decide if this is better answered by KG query or "microsoft graphRAG" for broader corpus theme/multihop queries. This can also be an efficient cost optimization strategy since "microsoft graphRAG" is more expensive with many more LLM calls.

@arun-gupta

chensuyue · 2024-10-29T01:25:12Z

We can't distribute a dataset in this repo. Better to ask user download from public. If it's a new dataset, we need to distribute somewhere else after the legal process done.

akollegger · 2024-11-03T21:31:45Z

We can't distribute a dataset in this repo. Better to ask user download from public. If it's a new dataset, we need to distribute somewhere else after the legal process done.

Understood. This data is the last stage of processing public data set from https://www.sec.gov/search-filings/edgar-search-assistance/accessing-edgar-data . I will look for a reasonable public host for the files.

akollegger added 2 commits October 28, 2024 11:58

duplicated AgentQnA into GraphRAG; added Edgar data, notebooks + stan…

2ef5df5

…dalone GraphRAG API & UI

(GraphRAG) added api and ui sub-projects to standalone

909607a

akollegger requested a review from lvliang-intel as a code owner October 28, 2024 17:11

[pre-commit.ci] auto fixes from pre-commit.com hooks

723c972

for more information, see https://pre-commit.ci

hshen14 requested review from ftian1, XinyuYe-Intel and chensuyue October 28, 2024 23:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GraphRAG QA Example #1036

GraphRAG QA Example #1036

akollegger commented Oct 28, 2024

rbrugaro commented Oct 28, 2024

chensuyue commented Oct 29, 2024

akollegger commented Nov 3, 2024

GraphRAG QA Example #1036

Are you sure you want to change the base?

GraphRAG QA Example #1036

Conversation

akollegger commented Oct 28, 2024

Description

Issues

Type of change

Dependencies

Tests

rbrugaro commented Oct 28, 2024

chensuyue commented Oct 29, 2024

akollegger commented Nov 3, 2024