mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced 2025-08-03 20:22:49 +08:00
6 lines
101 B
Markdown
6 lines
101 B
Markdown
Usage:
|
|
|
|
- `pretrain.sh`
|
|
- `sft.sh` -> `reward.sh` -> `ppo.sh`
|
|
- `sft.sh` -> `dpo.sh` -> `predict.sh`
|