Files
LLaMA-Factory/examples/lora_single_gpu
hiyouga 8cf9842f7a add examples
Former-commit-id: 76f31b18eb
2024-03-05 03:16:35 +08:00
..
2024-02-28 23:19:25 +08:00
2024-02-28 23:19:25 +08:00
2024-02-28 23:19:25 +08:00
2024-02-28 23:19:25 +08:00
2024-03-05 03:16:35 +08:00
2024-02-28 23:19:25 +08:00
2024-02-28 23:19:25 +08:00

Usage:

  • pretrain.sh
  • sft.sh -> reward.sh -> ppo.sh
  • sft.sh -> dpo.sh -> predict.sh