Working notebook: a commonplace blog for collecting notes & exploring ideas.
Home. Site Map. Subscribe. More at expLog.

Transformers

Having spent a lot of time over the past few years on building infrastructure for Transformer models, I’m still not crystal clear on the actual calculations that happen within them. This work log is for experimenting with and building my own transformers and looking at the values inside them.

January 2025

Building minimal transformers

As a first attempt, trying to build simple transformers: I have vague memories of doing something similar while working through the videos by Andrej Karpathy but this time around I’ll poke a little bit myself. Reading about circuits was also helpful in getting ready for this.

Things I’d generally like to work on here:

Kunal