Experiment: Testing Different RAG Strategies

Inspired by the paper: "Searching for Best Practices in Retrieval-Augmented Generation" by Wang et al. This repository is dedicated to search for the best RAG strategy on a tight budget.

How to run it locally

You can use the repository on your local laptop to play around with different RAG pipelines.

Installation and Setup

Create a virtual environment

python -m venv venv
./venv/Scripts/Activate

Install the requirements:

pip install -r requirements.txt

Specify the different API keys in .env:

GOOGLE_API_KEY: for google models
OPENAI_API_KEY: for openai models
HF_TOKEN: for open-sourced models

Choose the data and RAG strategy

Put the documents that you want to query into a documents folder
Play around with the code in test.py: Example using gemini-pro and BAAI/bge-small-en-v1.5:

if __name__ == '__main__':
   llm = get_llm(model = "gemini-pro")
   Settings.llm = llm
   Settings.embed_model = get_embedding(sentence_transformer="BAAI/bge-small-en-v1.5")
   print("Embed model and LLM loaded successfully")
   
   documents = data_loader(file_path='./documents')
   print("Documents loaded successfully")
   
   chunker = ChunkerStrategy(strategy='sentence')
   chunker = chunker.parser(chunk_size = 512, chunk_overlap=50)
   print("Chunker loaded successfully")
   
   index, doc = Faiss(documents=documents, dimension=384, transformation=[chunker])
   print("Index Loaded Successfully")
   
   rt1 = Retriever(vector=index, method='BM25')
   rt2 = Retriever(vector=index, method='vector')
   retriever1 = rt1.parser(docstore=doc)
   retriever2 = rt1.parser(docstore=doc)
   fusion = fusion_retriever(top_k=5, retrievers=[retriever1, retriever2], weights=[0.7, 0.3])
   print("Retriever Loaded Successfully")
   
   reranker = Reranker(strategy='custom').parser(top_n=3, model_name="cross-encoder/ms-marco-MiniLM-L-2-v2")
   query_engine = QueryEngine(retriever=fusion, transform_mode='none', llm=llm, node_processor=reranker)
   print("Query Engine Completion")
   
   query = 'The process of a GEN AI Cycle?'
   nodes = fusion.retrieve(query)
   print('\n--------------------------\n')
   for i, node in enumerate(nodes):
       print(f'{node.text}')
       print(f'{node.score}')
       print(f'\n----------The {i}th Node----------------\n')
   response = query_engine.query(query)
   print(response.get_formatted_sources(length=50))

Evaluation Strategy

Each RAG strategy was evaluated by Trulens Evaluation Benchmarks: RAG Triad. The RAG triad is made up of 3 evaluations: context relevance, groundedness and answer relevance. Satisfactory evaluations on each provides us confidence that our LLM app is free from hallucination.

Searching for Best Practices in Retrieval-Augmented Generation

@inproceedings{Wang2024SearchingFB,
  title={Searching for Best Practices in Retrieval-Augmented Generation},
  author={Xiaohua Wang and Zhenghua Wang and Xuan Gao and Feiran Zhang and Yixin Wu and Zhibo Xu and Tianyuan Shi and Zhengyuan Wang and Shizheng Li and Qi Qian and Ruicheng Yin and Changze Lv and Xiaoqing Zheng and Xuanjing Huang},
  year={2024},
  url={https://api.semanticscholar.org/CorpusID:270870251}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
evaluation_data.py		evaluation_data.py
quick_read.jpg		quick_read.jpg
rag_evaluation.jpg		rag_evaluation.jpg
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experiment: Testing Different RAG Strategies

How to run it locally

Installation and Setup

Choose the data and RAG strategy

Evaluation Strategy

Searching for Best Practices in Retrieval-Augmented Generation

About

Releases

Packages

Languages

License

phamkinhquoc2002/rag-best-strategy

Folders and files

Latest commit

History

Repository files navigation

Experiment: Testing Different RAG Strategies

How to run it locally

Installation and Setup

Choose the data and RAG strategy

Evaluation Strategy

Searching for Best Practices in Retrieval-Augmented Generation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages