Lumen Language Model

A 128M Parameter Language Model

A 128M Parameter Language Model

A 128M Parameter Language Model

LumenBase is a 128M-parameter transformer language model built from scratch using PyTorch, featuring a custom tokenizer and GQA-based architecture. It includes a complete training and evaluation pipeline, achieving competitive scores on ARC and HellaSwag reasoning benchmarks.

Designing a future I want to see

Avatar of the website author

Hariom Jangra

Think Different, Build Different

Hit me up if you are having any Questions

Hariom.profile

Designing a future I want to see

Avatar of the website author

Hariom Jangra

Think Different, Build Different

Hit me up if you are having any Questions

Hariom.profile

Designing a future I want to see

Avatar of the website author

Hariom Jangra

Think Different, Build Different

Hit me up if you are having any Questions

Hariom.profile