Working Notes: a commonplace notebook for recording & exploring ideas.
Home. Site Map. Subscribe. More at expLog.

Nanogpt

NanoGPT

(using hy)

Building nanogpt

Self attention

nanogpt

chatgpt

Follow ups

Notes on my implementation

2023-10-28

To keep things really simple, I'm working with a/b data strings: something that people also use for manually working with transformers.

The data is generated with a->a, a->b and b->b transitions, with equal weights for a & b. I'll play with this as I understand more.

Kunal