NOTE: Repository archived. Will be continued soon.

llm.cpp

Building LLM architectures from scratch using C++ (Inspired by llm.c)

Overview

I'm new to low-level languages like C/C++. Being a software engineer working on deep learning, I wanted to learn them to be able to better develop softwares that are efficient & performant.

The goal of this project is, for me to able to write softwares comfortably in C/C++, learn more about system memory and also learn to program using CUDA APIs.

Once I have accomplished above, I'll try to train a small version (124M) of GPT-2 on the FineWeb dataset by HuggingFace. Though I don't intend this model to compete with LLMs, I want the experience of orchestrating a workflow of training such large models.

Structure

The code for different layer of a neural network lives in include/nn.hpp file while the implementation can be found in src/nn.cpp. The library is tested using gtest framework and the test live in tests/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
include		include
src		src
tests		tests
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NOTE: Repository archived. Will be continued soon.

llm.cpp

Overview

Structure

About

Releases

Packages

Languages

svarunid/llm.cpp

Folders and files

Latest commit

History

Repository files navigation

NOTE: Repository archived. Will be continued soon.

llm.cpp

Overview

Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages