CS25 I Stanford Seminar - Self Attention and Non-parametric transformers (NPTs)