This repository contains a worked example using the GraphRAG Python package for Neo4j. The example demonstrates the end-to-end workflow, starting from unstructured documents (in this case pdfs), to knowledge graph construction, knowledge graph retriever design, and a working GraphRAG pipeline. Research papers on Lupus are used as the data source.
- The end-to-end-lupus notebook contains the worked example.
- The corresponding blog post has a full write-up walking through the example with more details, explanations, and resources.
- The truncated-pdfs directory contains the pdf source files. They were obtained from NIH PubMed. Some pages at the end containing references have been truncated to better focus the knowledge graph on medical information rather than citations and other publications.