Skip to content

LLMs Training Datasets Manager is a web application for creating and managing training datasets easily with different formats for training LLMs or using them in RAG systems.

Notifications You must be signed in to change notification settings

AbdulrhmanGoni/LLMs-TDM-App

Repository files navigation

LLMs Training Datasets Manager

LLMs Training Datasets Manager is a web application for creating and managing training datasets with different formats for training Large Language Models (LLMs) or for using them in Retrieval Augmented Generation (RAG) applications.

Features ✨

  • 🗂️ Creating datasets and adding instructions into it.
  • ⚙️ Managing the instructions of the datasets (updating or deleting the instructions).
  • 📑 Browse the instructions of datasets esaly in pagination model.
  • 📥 Export datasets into your machine (Actually download datasets).
  • 🤗 Huggingface integration via Huggingface OAuth flow. And the gained features will be 👇
    • 📤 Linking datasets with Huggingface dataset repository and pushing local datasets to.
    • 🆕 Creating new Huggingface dataset repository if needed
    • 🖲️ Syncing local datasets with their linked repositories after updates.
    • 🔌 Unlinking a local dataset with its linked repository (with options to delete the entire repository or just the dataset file)

Tech Stack

  • React + Next.js
  • Typescript
  • Tailwind + Shadcn/ui
  • Clerk (for users management and authenticating)

Installation & Getting Started

Note ! You must have Node.js installed on your machine to be able to run LLMs TDM Application

  1. Clone the repository and open project's directory
git clone https://github.com/AbdulrhmanGoni/LLMs-TDM-App.git
cd LLMs-TDM-App
  1. Install dependencies
npm install
  1. Copy .env.example file into .env.development file and modify the variables in this file by following the guidelines inside .env.example file.
cp .env.example .env.development
  1. Run LLMs TDM in development mode
npm run dev

About

LLMs Training Datasets Manager is a web application for creating and managing training datasets easily with different formats for training LLMs or using them in RAG systems.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages