Working Notes: a commonplace notebook for recording & exploring ideas.
Home. Site Map. Subscribe. More at expLog.

Transformers

Have the potential to change programming; spending the time to understand how they work, and work backwards to understand what would be valuable with them.

Exploring

Ideally I can use this to build simple models for personal projects (shell completion, etc.) and also to write about mechanical sympathy for models.

Terms

Term One line definition Detailed Notes
Attention
Cublas
Decoder
Encoder
Flash Attention [2]
FP8
MKL
NCCL
Pipeline Parallelism
Transformer

Resources

Kunal