mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced 2025-11-05 18:32:14 +08:00
Usage:
pretrain.shsft.sh->reward.sh->ppo.shsft.sh->dpo.sh->predict.sh
Usage:
pretrain.shsft.sh -> reward.sh -> ppo.shsft.sh -> dpo.sh -> predict.sh