In Language Modeling, (Selective) Attention Matters

With concepts and examples in question-answering tasks

Arun Jagota
Level Up Coding
Published in
15 min readFeb 14, 2023

--

Image by David from Pixabay

Well, it’s hard to argue against paying attention to the right things. Unfortunately, language modeling algorithms were not doing so for a long time.

This all changed about half a decade back. The rest is history. As witnessed most recently with ChatGPT.

--

--

PhD, Computer Science, neural nets. 14+ years in industry: data science algos developer. 24+ patents issued. 50 academic pubs. Blogs on ML/data science topics.