paper:megatron-lm_training_multi-billion_parameter_language_models_using_model_parallelism
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
paper/megatron-lm_training_multi-billion_parameter_language_models_using_model_parallelism.txt · Last modified: 2023/06/15 07:36 by 127.0.0.1