ai

Improving the lower bound for the unit distance problem

May 21, 2026 · 17 min · 3599 words · nor

The modded nanogpt speedrun, but in JAX and on TPUs

Theoretical properties of optimizers on a toy problem, and some intuition

August 2, 2025 · 48 min · 10137 words · nor

Deriving RoPE the proper way

July 28, 2025 · 25 min · 5177 words · nor

Quantizing LLMs for inference