mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced 2025-12-15 03:10:35 +08:00
resolve gradient checkpointing issue.
This commit is contained in:
@@ -31,6 +31,5 @@ python ../../../src/train_bash.py \
|
||||
--use_badam \
|
||||
--switch_mode descending \
|
||||
--badam_verbose 2 \
|
||||
--switch_block_every 50 \
|
||||
--pure_bf16 \
|
||||
--switch_block_every 50
|
||||
|
||||
|
||||
Reference in New Issue
Block a user