Fast Inference from Transformers via Speculative Decoding

Yaniv Leviathan*, Matan Kalman, Yossi Matias

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Fingerprint

Dive into the research topics of 'Fast Inference from Transformers via Speculative Decoding'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science