We will be learning what are large language models and how they can be used as knowledge bases. At the end we will be building and training a BERT and a miniGPT entirely from scratch though our miniGPT won't be as powerful as the GPT models out there, we will learn how we can improve our model and some of the techniques that can be used to align them towards instructions at the very end of the project, if time permits :). The aim of the project is to make you so well versed in LLMs that you can build and train one from scratch on the go.
Basics of probability theory, statistical machine learning and python
- Gentle Introduction to NLP with word2vec, word embeddings, Distributional sementics
- Introduction to pytorch and neural networks, convolutional layers and pooling, building cnns and training them on dummy datasets
- Text classification, building generative and discriminative models
- Language Modeling, N gram LMs, Neural LMs, evaluating LMs
- Building encoder-decoder models, autoencoder and inferencing
- Introducing attention(Transformer: Attention is All You Need) in encoder-decoders, building a transformer from scratch, Seq2Seq
- Transfer learning, replacing pre trained word embeddings in GPT and BERT
- Building and training BERT and miniGPT in pytorch from scratch
- building and training cnns in pytorch
- building and training a bayesian classifier
- building and training autoencoders for image generation
- building and training a transformer from scratch
- building and training BERT and miniGPT in pytorch from scratch
To get started with python : https://docs.python.org/3.11/tutorial/index.html
Python and Numpy Tutorial : https://cs231n.github.io/python-numpy-tutorial/