GPT from Scratch

This repository contains an implementation of a simple GPT (Generative Pretrained Transformer) model built from scratch using PyTorch. The model learns to generate text based on a given dataset, inspired by Karpathy's minGPT approach.

Dataset

The model is trained on Shakespearean text. The dataset can be downloaded using the following command:

https://github.com/karpathy/char-rnn/blob/master/data/tinyshakespeare/input.txt

Installation

To run the project, install the required dependencies:

pip install torch

Model Overview

The core components of the model include:

Character Tokenization: Maps characters to integer indices and vice versa.
Bigram Language Model: Uses embeddings and a lookup table to predict the next token.
Self-Attention Mechanism: Implements multi-head self-attention for contextual understanding.
Transformer Blocks: Stacks attention and feedforward layers for improved learning.

Training

The model is trained using an AdamW optimizer with the following hyperparameters:

Batch size: 32
Block size: 8
Learning rate: 1e-3
Training steps: 5000

To train the model, run:

python train.py

Results

Example generated text:

To spits as stold's bewear I would and say mesby all
on sworn make he anough
As cousins the solle, whose be my conforeful may lie them yet

Acknowledgements

This project is inspired by Andrej Karpathy's work on GPT and language modeling.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
GPT_from_Scratch.ipynb		GPT_from_Scratch.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT from Scratch

Table of Contents

Dataset

Installation

Model Overview

Training

Results

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GPT from Scratch

Table of Contents

Dataset

Installation

Model Overview

Training

Results

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages