User Tools

Site Tools


ml:state-space_models

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
ml:state-space_models [2025/08/22 17:52] – [Key Papers] jmflanigml:state-space_models [2025/08/22 17:58] (current) – [Key Papers] jmflanig
Line 25: Line 25:
     * [[https://github.com/sustcsonglin/mamba-triton/tree/master]]     * [[https://github.com/sustcsonglin/mamba-triton/tree/master]]
   * [[https://arxiv.org/pdf/2401.13660.pdf|Wang et al 2024 - MambaByte: Token-free Selective State Space Model]]   * [[https://arxiv.org/pdf/2401.13660.pdf|Wang et al 2024 - MambaByte: Token-free Selective State Space Model]]
 +  * [[https://arxiv.org/pdf/2402.19427|De et al 2024 - Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models]]
   * [[https://arxiv.org/pdf/2403.01590|Ali et al 2024 - The Hidden Attention of Mamba Models]]   * [[https://arxiv.org/pdf/2403.01590|Ali et al 2024 - The Hidden Attention of Mamba Models]]
   * [[https://arxiv.org/pdf/2403.19887|Leiber et al 2024 - Jamba: A Hybrid Transformer-Mamba Language Model]]   * [[https://arxiv.org/pdf/2403.19887|Leiber et al 2024 - Jamba: A Hybrid Transformer-Mamba Language Model]]
ml/state-space_models.1755885161.txt.gz · Last modified: 2025/08/22 17:52 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki