ml:state-space_models
This is an old revision of the document!
Table of Contents
State-Space Models
Overviews
Key Papers
- Mega: Ma et al 2022 - Mega: Moving Average Equipped Gated Attention SOTA on long-range arena benchmark. Combines flash attention with state-space models. Still n-squared runtime however
- Orvieto et al 2023 - Resurrecting Recurrent Neural Networks for Long Sequences Gives a nice history
Papers
People
Related Pages
ml/state-space_models.1702435792.txt.gz · Last modified: 2023/12/13 02:49 by jmflanig