ml:efficient_nns
This is an old revision of the document!
Table of Contents
Efficient Neural Networks
Methods having to do with efficiency in neural networks.
Overviews
- General
- For LLMs
- Reasoning LLMs
Efficient Transformers
- Pope 2022 - Efficiently Scaling Transformer Inference Introduced the idea of the KV cache.
Related Pages
ml/efficient_nns.1746597311.txt.gz · Last modified: 2025/05/07 05:55 by jmflanig