LLMs Training Datasets Manager is a web application for creating and managing training datasets with different formats for training Large Language Models (LLMs) or for using them in Retrieval Augmented Generation (RAG) applications.
- 🗂️ Creating datasets and adding instructions into it.
- ⚙️ Managing the instructions of the datasets (updating or deleting the instructions).
- 📑 Browse the instructions of datasets esaly in pagination model.
- 📥 Export datasets into your machine (Actually download datasets).
- 🤗 Huggingface integration via Huggingface OAuth flow. And the gained features will be 👇
- 📤 Linking datasets with Huggingface dataset repository and pushing local datasets to.
- 🆕 Creating new Huggingface dataset repository if needed
- 🖲️ Syncing local datasets with their linked repositories after updates.
- 🔌 Unlinking a local dataset with its linked repository (with options to delete the entire repository or just the dataset file)
- React + Next.js
- Typescript
- Tailwind + Shadcn/ui
- Clerk (for users management and authenticating)
Note ! You must have Node.js installed on your machine to be able to run LLMs TDM Application
- Clone the repository and open project's directory
git clone https://github.com/AbdulrhmanGoni/LLMs-TDM-App.git
cd LLMs-TDM-App
- Install dependencies
npm install
- Copy
.env.example
file into.env.development
file and modify the variables in this file by following the guidelines inside.env.example
file.
cp .env.example .env.development
- Run LLMs TDM in development mode
npm run dev