11 Commits

Author SHA1 Message Date
hiyouga
ed584b9f52 fix reward model loading
Former-commit-id: c52336d14435bc3bd98b6070cc1309b5e7d706c4
2023-11-07 17:20:51 +08:00
hiyouga
b446582bfd fix args
Former-commit-id: d92f112951a8d8b28b180c3e2f504a094a9885dd
2023-11-07 16:36:06 +08:00
hiyouga
3d40bdb600 upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f384ada92618bf21301fe96dac1c10bf
2023-11-07 16:13:36 +08:00
hiyouga
5507014392 fix bug in data loader, support dpo eval
Former-commit-id: b355f6cac99592b66890ccc04e77a9993de0447d
2023-11-03 00:34:26 +08:00
hiyouga
b6e81a0307 fix shift short attention
Former-commit-id: ab65c3063b31b9e6a1aeb62c57224c1296ccdadd
2023-10-09 17:07:46 +08:00
hiyouga
6a71361a54 remove PeftTrainer
Former-commit-id: b218c271edfb07006ddc34b1aca404088de6c528
2023-09-10 22:23:23 +08:00
hiyouga
0d0232479f fix import error
Former-commit-id: 2de1a7610a78e41680970b9f308741f98df489fa
2023-08-23 20:45:03 +08:00
hiyouga
38080233a5 fix #649
Former-commit-id: 57146c101f3e8f688b016a44c85e8ad5d1b6f938
2023-08-23 20:21:15 +08:00
hiyouga
d05d535a58 fix #617
Former-commit-id: 5235b15c9181f2b68f7d6caa9a6324b8570d3d0c
2023-08-21 18:16:11 +08:00
hiyouga
fceca0bb6a update training resuming
Former-commit-id: 58f13e22da18babed0d2d4348474e07745da8fa5
2023-08-18 01:41:17 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00