LLM from Scratch

A project to build and train a Large Language Model (LLM) from scratch, implementing core components and training procedures to understand how modern language models work.

🎯 Objective

The final goal is to train a complete LLM from scratch, scaling to whatever size your hardware allows. This project focuses on understanding the fundamentals of transformer architectures, tokenization, training loops, and model optimization.

This project is a learning exercise to understand LLMs at a fundamental level. The implementation will prioritize clarity and educational value over optimization.

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended for training)
Sufficient RAM/VRAM for your target model size

🛠️ Planned Components (not finalized)

Tokenizer implementation (BPE/WordPiece)
Transformer architecture (attention, feed-forward, layer norm)
Positional encoding
Training loop with gradient accumulation
Data loading and preprocessing pipeline
Model checkpointing and resuming
Inference engine
Model quantization (for deployment)

📚 Resources

Attention Is All You Need - Original Transformer paper
The Illustrated Transformer - Visual guide to transformers
minGPT - Minimal GPT implementation reference

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
01_notebooks		01_notebooks
40_training_data		40_training_data
50_models		50_models
90_src		90_src
95_utils		95_utils
99_scratch_notes		99_scratch_notes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM from Scratch

🎯 Objective

Prerequisites

🛠️ Planned Components (not finalized)

📚 Resources

About

Uh oh!

Releases

Packages

Languages

License

ashrithssreddy/llm-from-scratch

Folders and files

Latest commit

History

Repository files navigation

LLM from Scratch

🎯 Objective

Prerequisites

🛠️ Planned Components (not finalized)

📚 Resources

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages