Skip to content
This repository has been archived by the owner on Oct 27, 2024. It is now read-only.
/ llm.cpp Public archive

Building LLM architectures from scratch using C++

Notifications You must be signed in to change notification settings

svarunid/llm.cpp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NOTE: Repository archived. Will be continued soon.

llm.cpp

Building LLM architectures from scratch using C++ (Inspired by llm.c)

Overview

I'm new to low-level languages like C/C++. Being a software engineer working on deep learning, I wanted to learn them to be able to better develop softwares that are efficient & performant.

The goal of this project is, for me to able to write softwares comfortably in C/C++, learn more about system memory and also learn to program using CUDA APIs.

Once I have accomplished above, I'll try to train a small version (124M) of GPT-2 on the FineWeb dataset by HuggingFace. Though I don't intend this model to compete with LLMs, I want the experience of orchestrating a workflow of training such large models.

Structure

The code for different layer of a neural network lives in include/nn.hpp file while the implementation can be found in src/nn.cpp. The library is tested using gtest framework and the test live in tests/ directory.

About

Building LLM architectures from scratch using C++

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published