Transformer Feed-Forward Layers Are Key-Value Memories

Mor Geva, Roei Schuster, Jonathan Berant, Omer Levy

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'Transformer Feed-Forward Layers Are Key-Value Memories'. Together they form a unique fingerprint.

Engineering & Materials Science