One Attention Head is all you Need

Using transformer_lens to determine the algorithm a single head transfromer uses to sort a fixed-length list.

Comment

attention heat map matrix
loss and accurarcy graph

Inspiration

Given we are in the era of LLMs, we both taught it would be a fun idea to challenge ourselves with a lower-level problem to solve.

What it does

How we built it

a google colab notebook

Challenges we ran into

Generated the

Accomplishments that we're proud of

We were able to watch and be guided by a researcher from a company who built a major LLM model.

What we learned

We learned about Transformer lens.

What's next for One Attention Head is all you Need

More research

Reference

(https://docs.google.com/spreadsheets/d/1oOdrQ80jDK-aGn-EVdDt3dg65GhmzrvBWzJ6MUZB8n4/edit#gid=0)[200 Concrete Problems In Interpretability Spreadsheet - Google Sheets]
200 Concrete Open Problems in Mechanistic Interpretability: Introduction — AI Alignment Forum
One Attention Head Is All You Need for Sorting Fixed-Length Lists by MatthewBaggins (itch.io)

Built With

torch

Updates

Campbell Hutcheson started this project — Mar 24, 2024 02:36 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.