Advanced RAG With Semantic Caching, Semantic Routing, and Observability (Langchain, Ollama, Milvus, Redis, Langfuse, and Uptrain)

Three Important RAG Techniques for a GenAI Platform

Semantic Caching:
- Semantic caching in Retrieval-Augmented Generation (RAG) systems enhances efficiency and relevance by storing semantically relevant information, reducing retrieval time and computational resources.
LLM Routing:
- LLM routing using LiteLLM involves directing user queries to the most appropriate language model based on factors like query complexity, domain specificity, or required response quality. This optimizes the use of computational resources and improves response accuracy.
Guardrails:
- Guardrails are mechanisms or guidelines implemented to ensure that a system, especially an AI or machine learning model, operates within safe, ethical, and intended boundaries, preventing unintended outcomes or misuse.

UpTrain:
- UpTrain is an open-source platform for evaluating and improving LLM applications.
Langfuse:
- Langfuse is an open-source platform for monitoring, evaluating, and improving LLM applications.

Clone the Repository:

git clone https://github.com/Major-wagh/Advanced-RAG.git
cd Advanced-RAG

Install the requirements:
```
pip install -r requirements.txt
```
Run the Notebook:

Launch the Jupyter Notebook in your environment and open the project notebooks to start exploring and experimenting with the RAG techniques.

We welcome contributions to this project! Please read our contributing guidelines to get started.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
pdfs-master		pdfs-master
.gitignore		.gitignore
LICENSE		LICENSE
Rag.ipynb		Rag.ipynb
Readme.md		Readme.md
langfuse.yaml		langfuse.yaml
milvus-standalone-docker-compose.yml		milvus-standalone-docker-compose.yml
ollama.yaml		ollama.yaml
postgres.yaml		postgres.yaml
requirements.txt		requirements.txt
uptrain.yml		uptrain.yml