ml:conditional_computation
This is an old revision of the document!
Table of Contents
Dynamic and Conditional Computation
Papers
- Fedus et al 2021 - Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity Uses conditional computation to scale-up the Transformer. Good overview of conditional computation in the introduction.
Workshops
ml/conditional_computation.1659173447.txt.gz · Last modified: 2023/06/15 07:36 (external edit)