Skip to content

Latest commit

 

History

History
50 lines (39 loc) · 1.84 KB

README.md

File metadata and controls

50 lines (39 loc) · 1.84 KB

LLMs Training Datasets Manager

LLMs Training Datasets Manager is a web application for creating and managing training datasets with different formats for training Large Language Models (LLMs) or for using them in Retrieval Augmented Generation (RAG) applications.

Features ✨

  • 🗂️ Creating datasets and adding instructions into it.
  • ⚙️ Managing the instructions of the datasets (updating or deleting the instructions).
  • 📑 Browse the instructions of datasets esaly in pagination model.
  • 📥 Export datasets into your machine (Actually download datasets).
  • 🤗 Huggingface integration via Huggingface OAuth flow. And the gained features will be 👇
    • 📤 Linking datasets with Huggingface dataset repository and pushing local datasets to.
    • 🆕 Creating new Huggingface dataset repository if needed
    • 🖲️ Syncing local datasets with their linked repositories after updates.
    • 🔌 Unlinking a local dataset with its linked repository (with options to delete the entire repository or just the dataset file)

Tech Stack

  • React + Next.js
  • Typescript
  • Tailwind + Shadcn/ui
  • Clerk (for users management and authenticating)

Installation & Getting Started

Note ! You must have Node.js installed on your machine to be able to run LLMs TDM Application

  1. Clone the repository and open project's directory
git clone https://github.com/AbdulrhmanGoni/LLMs-TDM-App.git
cd LLMs-TDM-App
  1. Install dependencies
npm install
  1. Copy .env.example file into .env.development file and modify the variables in this file by following the guidelines inside .env.example file.
cp .env.example .env.development
  1. Run LLMs TDM in development mode
npm run dev