Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
main.py	main.py

Name

Last commit message

Last commit date

GPT-2 inference

Runs GPT-2 text generation implemented in Magnetron. Includes KV caching and optional streaming output. Uses transformers for weights and tiktoken for tokenization.

Install

From the repo root:

uv pip install -e .[examples]

Run

python examples/gpt2/main.py "What is the answer to life?"

Pick a model and generation settings:

python examples/gpt2/main.py "Write a haiku about compilers" --model gpt2-xl --max_tokens 128 --temp 0.7

Disable streaming:

python examples/gpt2/main.py "Hello" --no-stream

Notes

First run downloads model weights from Hugging Face.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

GPT-2 inference

Install

Run

Notes

Uh oh!

FilesExpand file tree

gpt2

Directory actions

More options

Directory actions

More options

Latest commit

History

gpt2

Folders and files

parent directory

README.md

GPT-2 inference

Install

Run

Notes