ml:conditional_computation
This is an old revision of the document!
Table of Contents
Dynamic NNs and Conditional Computation
Dynamic neural networks use methods such as conditional computation, adaptive computation, dynamic model sparsification, early-exit approaches, etc to build larger models with less compute requirements.
Papers
- Fedus et al 2021 - Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity Uses conditional computation to scale-up the Transformer. Good overview of conditional computation in the introduction.
Workshops
ml/conditional_computation.1659173685.txt.gz · Last modified: 2023/06/15 07:36 (external edit)