We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents 8a029b0 + 5450b97 commit 819583cCopy full SHA for 819583c
examples/megatron-qwen3/training/run.sh
@@ -63,4 +63,4 @@ PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True NPROC_PER_NODE=$BT_NUM_GPUS NNO
63
--use_precision_aware_optimizer true \
64
--use_hf 1 \
65
--wandb_project qwen3_moe_megatron \
66
- --wandb_exp_name all_training_b10f \
+ --wandb_exp_name all_training_b10f
0 commit comments