We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 8a029b0 commit 5450b97Copy full SHA for 5450b97
examples/megatron-qwen3/training/run.sh
@@ -63,4 +63,4 @@ PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True NPROC_PER_NODE=$BT_NUM_GPUS NNO
63
--use_precision_aware_optimizer true \
64
--use_hf 1 \
65
--wandb_project qwen3_moe_megatron \
66
- --wandb_exp_name all_training_b10f \
+ --wandb_exp_name all_training_b10f
0 commit comments