low-level-programming

The modded nanogpt speedrun, but in JAX and on TPUs

A writeup of porting the modded nanoGPT speedrun to pure JAX on TPU v6e, including hardware bottlenecks, bugs, optimizations, and open performance questions.

PSA: Increase your stack size before the Meta Hacker Cup, here's how

A short PSA on increasing stack limits before MHC, with commands, compiler flags, and platform caveats.

On using C on Codeforces (and some compiler update requests)

A discussion of using C on Codeforces.

GCC Optimization Pragmas

An explanation of GCC optimization and target pragmas, what common fake pragmas do not do, and when these flags help or hurt.