paper:megatron-lm_training_multi-billion_parameter_language_models_using_model_parallelism

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

paper/megatron-lm_training_multi-billion_parameter_language_models_using_model_parallelism.txt · Last modified: 2023/06/15 07:36 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki