An interactive visualization tool for understanding Retrieval-Augmented Generation (RAG) pipelines. This project helps developers and researchers explore and debug each step of the RAG process through hands-on experiments and real-time demonstrations.
- Text Splitting Visualization: Watch how documents are split into meaningful chunks while preserving semantic coherence and context
- Vector Embedding Analysis: See how text is transformed into numerical vectors and visualize their relationships in high-dimensional space
- Semantic Search Demo: Experience real-time vector similarity search and understand how relevant context is retrieved from your knowledge base
- Context Generation: Observe how LLMs combine retrieved context with user queries to generate accurate, contextual responses
- Framework: Next.js 14 with App Router
- Language: TypeScript
- Styling: TailwindCSS
- UI Components: shadcn/ui
- Vector Operations: Transformers.js
- Text Processing: LangChain
- Markdown Rendering: markdown-it
- Clone the repository:
git clone https://github.com/Kain-90/RAG-Play
cd RAG-Play
- Install dependencies:
npm install
# or
yarn install
# or
pnpm install
- Set up environment variables:
cp .env.example .env
- Start the development server:
npm run dev
# or
yarn dev
# or
pnpm dev
- Open http://localhost:3000 with your browser to see the result.
src/
├── app/
│ ├── experiment/ # RAG experiment pages and components
│ │ ├── components/ # Experiment-specific components
│ │ ├── types/ # TypeScript types for RAG operations
│ │ └── workers/ # Web Workers for embedding computation
│ ├── components/ # Shared components
│ └── stores/ # Global state management
├── components/ # UI components
└── lib/ # Utility functions and helpers
- Text Splitting Tab: Visualizes document chunking with different strategies
- Embedding Tab: Demonstrates vector embedding generation and similarity computation
- Generation Tab: Shows how context and queries combine for response generation
- Text chunking tab
- Add configurable chunk size and overlap settings
- Support multiple splitting strategies (character, recursive character)
- Enable real-time preview of splitting results
- Add custom delimiter-based splitting
- Implement markdown-aware splitting
- Embedding tab
- Basic vector similarity visualization
- Real-time similarity score display
- Implement UMAP for high-dimensional vector visualization
- Support real-time visualization updates during embedding generation
- Implement advanced retrieval and reranking features
- Basic semantic search functionality
- Real-time similarity scoring
- Support multiple reranking models (e.g., Cohere, BGE)
- Implement hybrid search (combining sparse and dense retrievers)
- Visualize reranking process and score changes
- Enable A/B testing of different reranking strategies
- Generation tab
- Real-time response generation
- Context display
- Add configurable generation model settings (e.g., temperature, max_tokens)
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- LangChain for text splitting utilities
- Transformers.js for embedding models
- shadcn/ui for the component library