Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

🌐 Select Language / 日本語 🇯🇵 | 中文 🇨🇳

colab-badge.svg) Build LLM on Google Colab from scratch

EveryonesLLM_demo.gif

Click-> AI YOU build in Chapter29😘


Table of Contents

WebUI.png

WebApp Released (Now only in Japanese)

EveryonesLLM

ChapterEstimated TimeNotebook
Chapter 00: Start Tutorial1-2 hoursOpen in Colab
Chapter 01: Dataloader1-2 hoursOpen in Colab
Chapter 02: TokenEmbedding0.5-1 hourOpen in Colab
Chapter 03: PositionEmbedding0.5-1 hourOpen in Colab
Chapter 04: EmbeddingModule0.5-1 hourOpen in Colab
Chapter 05: LayerNorm1-2 hoursOpen in Colab
Chapter 06: AttentionHead3-4 hoursOpen in Colab
Chapter 07: MultiHeadAttention1-2 hoursOpen in Colab
Chapter 08: FeedForward1-2 hoursOpen in Colab
Chapter 09: TransformerBlock0.5-1 hourOpen in Colab
Chapter 10: VocabularyLogits0.5-1 hourOpen in Colab
Chapter 11: nanoGPT1-2 hoursOpen in Colab
Chapter 12: Trainer1-2 hoursOpen in Colab
Chapter 13: Tokens per second(CPU)1-2 hoursOpen in Colab
Chapter 14: Tokens per second(T4 GPU)0.5-1 hourOpen in Colab
Chapter 15: Train nanoGPT with GPU0.5-1 hourOpen in Colab
Chapter 16: Make only the model size bigger0.5-1 hour (+ 1 hour model training)Open in Colab
Chapter 17: Make the dataset bigger1-2 hours (+ 1 hour model training)Open in Colab
Chapter 18: tiktoken1-2 hours (+ 1 hour model training)Open in Colab
Chapter 19: Long Train1-2 hours (+ 6 hours model training)Open in Colab
Chapter 20: Learning rate0.5-1 hourOpen in Colab
Chapter 21: Scaling Law1-2 hoursOpen in Colab
Chapter 22: TinyStories(Main)1-2 hoursOpen in Colab
Chapter 22: TinyStories(Model Training)1 hourOpen in Colab
Chapter 23: RPE(OverSimplified)2-3 hoursOpen in Colab
Chapter 24: RPE(Simplified)1-2 hours (+ 1 hour model training)Open in Colab
Chapter 25: LR schedule1 hourOpen in Colab
Chapter 26: Checkpoint1 hourOpen in Colab
Chapter 27: Pretraining0.5 hour (+ 20 hours model training)Open in Colab
Chapter 28: Instruction Tuning0.5 hour (+ 0.5 hour model training)Open in Colab
Chapter 29: Magpie (Prompt mask)1.5 hours (+ 2 hours model training)Open in Colab

2026/6/5 Vision LLM beta is now available!

Explanations and exercises are not available yet. Evaluation on major benchmarks is also not available yet.

Please use it for early preview learning. We plan to update it from time to time, so we recommend working on it after future updates.

ChapterEstimated timeNotebook
Chapter 30: Vision Pretraining (Beta)3 hours model trainingOpen in Colab
Chapter 31: Vision Instruction Tuning (Beta)2 hours model trainingOpen in Colab

EveryonesVLM_demo

Link to Web App (Vision LLM)



Tensor Map (Full Tensor Overview)

Try making the tensor map below by yourself!
Do not worry, I prepared lots of hints for you.
View the full-resolution Tensor Map of the nanoGPT model on Canva

Everyones TensorMap


About the Development Environment

To keep setup easy, please try running all the samples on Google Colab.

However, Google Colab does not save checkmarks in checkboxes.
If you want to track your progress, or if you want to work little by little, say every 30 minutes, I recommend VS Code.
In that case, fork this repository and clone it to your own PC. Just use Google Colab extension for your VS code, then you can use Colab CPU and GPU.


Answers

ChapterEstimated TimeNotebook
Chapter 00: Start Tutorial1-2 hoursOpen in Colab
Chapter 01: Dataloader1-2 hoursOpen in Colab
Chapter 02: TokenEmbedding0.5-1 hourOpen in Colab
Chapter 03: PositionEmbedding0.5-1 hourOpen in Colab
Chapter 04: EmbeddingModule0.5-1 hourOpen in Colab
Chapter 05: LayerNorm1-2 hoursOpen in Colab
Chapter 06: AttentionHead3-4 hoursOpen in Colab
Chapter 07: MultiHeadAttention1-2 hoursOpen in Colab
Chapter 08: FeedForward1-2 hoursOpen in Colab
Chapter 09: TransformerBlock0.5-1 hourOpen in Colab
Chapter 10: VocabularyLogits0.5-1 hourOpen in Colab
Chapter 11: nanoGPT1-2 hoursOpen in Colab
Chapter 12: Trainer1-2 hoursOpen in Colab
Chapter 13: Tokens per second(CPU)1-2 hoursOpen in Colab
Chapter 14: Tokens per second(T4 GPU)0.5-1 hourOpen in Colab
Chapter 15: Train nanoGPT with GPU0.5-1 hourOpen in Colab
Chapter 16: Make only the model size bigger0.5-1 hour (+ 1 hour model training)Open in Colab
Chapter 17: Make the dataset bigger1-2 hours (+ 1 hour model training)Open in Colab
Chapter 18: tiktoken1-2 hours (+ 1 hour model training)Open in Colab
Chapter 19: Long Train1-2 hours (+ 6 hours model training)Open in Colab
Chapter 20: Learning rate0.5-1 hourOpen in Colab
Chapter 21: Scaling Law1-2 hoursOpen in Colab
Chapter 22: TinyStories(Main)1-2 hoursOpen in Colab
Chapter 22: TinyStories(Model Training)1 hourOpen in Colab
Chapter 23: RPE(OverSimplified)2-3 hoursOpen in Colab
Chapter 24: RPE(Simplified)1-2 hours (+ 1 hour model training)Open in Colab
Chapter 25: LR schedule1 hourOpen in Colab
Chapter 26: Checkpoint1 hourOpen in Colab
Chapter 27: Pretraining0.5 hour (+ 20 hours model training)Open in Colab
Chapter 28: Instruction Tuning0.5 hour (+ 1 hour model training)Open in Colab
Chapter 29: Magpie (Prompt mask)1.5 hours (+ 2 hours model training)Open in Colab

Sources

This tutorial is based on Andrej Karpathy's nanoGPT and jingyaogong's Minimind. For Instruction Tuning, it refers to Sebastian Raschka's book Build a Large Language Model (From Scratch). For Vision LLM, it refers to LLaVA. I would like to take this opportunity to express my sincere gratitude.

Notice

This project is a community-based open-source educational project and is not affiliated with Google in any way.

About Project EveryonesLLM

EveryonesLLM Logo
EveryonesLLM Goal
EveryonesLLM Idea
EveryonesLLM Prerequites

关于 About

No description, website, or topics provided.

语言 Languages

Jupyter Notebook100.0%
Dockerfile0.0%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
106
Total Commits
峰值: 37次/周
Less
More

核心贡献者 Contributors