Elelems

Mini decoder-only causal Transformer (GPT-like SLM)

The main goal is to learn how LLMs work and create a very simple model capable of performing function calling in video games. I may be wrong about some things, but I will improve them as I learn lol.

I generated the corpus using gpt-5-mini.

Quick Start

after cloning the repo and assuming you're not a boomer and you use UV

uv sync
uv run python main.py prepare
uv run python main.py train --epochs 30 --batch-size 32
uv run python main.py generate --prompt "el gato" --max-new-tokens 8 --top-k 5

Commands

prepare: cleans the corpus, builds vocab.json, and creates train/val splits in data/processed.
train: trains the mini transformer and saves data/checkpoints/mini_llm.pt.
generate: loads the checkpoint and generates text until <END>.

Structure

data/raws/animals.txt: base corpus.
elelems/core/data: cleaning, tokenization, vocabulary, and data preparation.
elelems/core/model: causal mini transformer.
elelems/core/train.py: training loop.
elelems/core/generate.py: autoregressive inference.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
data		data
elelems		elelems
public		public
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Elelems

Mini decoder-only causal Transformer (GPT-like SLM)

Quick Start

Commands

Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Elelems

Mini decoder-only causal Transformer (GPT-like SLM)

Quick Start

Commands

Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages