What is a Transformer?

June 28, 2023
OpenAI, Machine Learning, Transformers, AI, Deep Learning

The paper “Attention Is All You Need” kept coming up in my circles to read and I finally found time to look at it. It talks about a revolutionary architecture that has been a game-changer in the field, particularly when it comes to natural language processing. For those in the know, the concept is called a Transformer. It’s old news at this point, but for the uninitiated, what exactly is a Transformer, and how does it work? ...