mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced 2025-08-03 04:02:49 +08:00
Usage:
pretrain.sh
sft.sh
->reward.sh
->ppo.sh
sft.sh
->dpo.sh
->predict.sh
Usage:
pretrain.sh
sft.sh
-> reward.sh
-> ppo.sh
sft.sh
-> dpo.sh
-> predict.sh