User Tools

Site Tools


nlp:seq2seq

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
nlp:seq2seq [2025/04/01 06:26] – [Decoding Strategies] jmflanignlp:seq2seq [2025/05/29 07:15] (current) – [Decoding Strategies] jmflanig
Line 27: Line 27:
     * [[https://arxiv.org/pdf/2211.17192|Leviathan et al 2024 - Fast Inference from Transformers via Speculative Decoding]]     * [[https://arxiv.org/pdf/2211.17192|Leviathan et al 2024 - Fast Inference from Transformers via Speculative Decoding]]
     * [[https://arxiv.org/pdf/2404.11912|Sun et al 2024 - TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding]]     * [[https://arxiv.org/pdf/2404.11912|Sun et al 2024 - TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding]]
 +    * [[https://arxiv.org/pdf/2502.17421|Yang et al 2025 - LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification]]
 +    * [[https://arxiv.org/pdf/2505.20776|Cha et al 2025 - SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences]]
   * **Miscellaneous Decoding Techniques**   * **Miscellaneous Decoding Techniques**
     * Contrastive Decoding     * Contrastive Decoding
nlp/seq2seq.1743488797.txt.gz · Last modified: 2025/04/01 06:26 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki