RAG: Reranking to improve results #39

MrCsabaToth · 2024-08-19T06:45:17Z

Right now the vector DB is working (#7) and we also made the ANN distance thresholds configurable (#35), but for proper RAG it'd be great to have re-ranking. Using Gemini this could mean many calls. Maybe we could leverage Gemma 2b model (FP16, int4, instruction tuned) locally with MediaPipe or something? That's not a re-ranker model though. And how to do that with Flutter in a platform independent way?

MrCsabaToth · 2024-08-19T07:11:19Z

MediaPipe GenAI Flutter package by Google https://pub.dev/packages/mediapipe_genai
unfortunately v0.0.1

MrCsabaToth · 2024-08-21T05:40:49Z

Open reranker model performing well (besides closed source Cohere reranker / embedding): mxbai https://huggingface.co/mixedbread-ai/mxbai-rerank-large-v1, see reference post https://www.rungalileo.io/blog/mastering-rag-how-to-select-a-reranking-model

A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE:

Cross-Encoders vs. LLMs: Effective cross-encoders, when paired with strong retrievers, have shown the ability to outperform most LLMs in reranking tasks, except for GPT-4 on some datasets. Notably, cross-encoders offer this improved performance while being more efficient, making them an attractive option for reranking tasks.
LLM-based Rerankers: Zero-shot LLM-based rerankers, including those based on OpenAI and open models, exhibit competitive effectiveness, with some even matching the performance of GPT3.5 Turbo. However, the inefficiency and high cost associated with these models currently limit their practical use in retrieval systems, despite their promising performance.

MrCsabaToth · 2024-09-03T15:21:23Z

Potential reranking code on Vertex AI: https://cloud.google.com/generative-ai-app-builder/docs/ranking#rank_or_rerank_a_set_of_records_according_to_a_query

We'll potentially need a cloud function for this.

… functions

…s & runs, still no effect tho #39

… not JSON serializable #39

MrCsabaToth added the enhancement New feature or request label Aug 19, 2024

MrCsabaToth changed the title ~~Reranking for RAG~~ RAG: Reranking to improve results Aug 21, 2024

MrCsabaToth added the RAG Retrieval Augmented Generation related label Aug 21, 2024

This was referenced Aug 30, 2024

Upgrade text embedding from text-embedding-004 to text-embedding-preview-0815 #46

Closed

Perform dimensionality reduction on the embeddings #47

Closed

MrCsabaToth mentioned this issue Sep 5, 2024

RAG: Upgrade text embedding from text-embedding-004 to text-multilingual-embedding-002 #48

Closed

MrCsabaToth self-assigned this Sep 14, 2024

MrCsabaToth added a commit that referenced this issue Oct 18, 2024

Upgrading pypi package versions for new embedding #48 and reranking #39…

4675929

… functions

MrCsabaToth added a commit that referenced this issue Oct 18, 2024

Adding reranking function #39

a841d6f

MrCsabaToth added a commit that referenced this issue Oct 18, 2024

Correction after renaming embedding #48 and reranking #39 functions

be4f0cc

MrCsabaToth added a commit that referenced this issue Oct 20, 2024

Refactor programmatic function memory capacity configuration, compile…

4c52313

…s & runs, still no effect tho #39

MrCsabaToth added a commit that referenced this issue Oct 20, 2024

Needs processing of the reranking results because the native class is…

4e31f25

… not JSON serializable #39

MrCsabaToth mentioned this issue Oct 20, 2024

Switch from google_generative_ai package to firebase_vertexai (and from BYO API key to BYO Firebase project) #53

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG: Reranking to improve results #39

RAG: Reranking to improve results #39

MrCsabaToth commented Aug 19, 2024

MrCsabaToth commented Aug 19, 2024

MrCsabaToth commented Aug 21, 2024

MrCsabaToth commented Sep 3, 2024

RAG: Reranking to improve results #39

RAG: Reranking to improve results #39

Comments

MrCsabaToth commented Aug 19, 2024

MrCsabaToth commented Aug 19, 2024

MrCsabaToth commented Aug 21, 2024

MrCsabaToth commented Sep 3, 2024