LLaMA-Factory/README.md at 8cf9842f7ac252a1fe7e1e3ebd1cfc27e0974cf4 - LLaMA-Factory - Gitea: Git with a cup of tea

423A35C7/LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-08-03 20:22:49 +08:00

hiyouga 8cf9842f7a add examples

Former-commit-id: 76f31b18eb4d3724f96ea1bad10073677daee36d

2024-03-05 03:16:35 +08:00

6 lines

101 B

Markdown

Raw Blame History

 Usage:
 - `pretrain.sh`
 - `sft.sh` -> `reward.sh` -> `ppo.sh`
 - `sft.sh` -> `dpo.sh` -> `predict.sh`