hiyouga
|
ed584b9f52
|
fix reward model loading
Former-commit-id: c52336d14435bc3bd98b6070cc1309b5e7d706c4
|
2023-11-07 17:20:51 +08:00 |
|
hiyouga
|
b446582bfd
|
fix args
Former-commit-id: d92f112951a8d8b28b180c3e2f504a094a9885dd
|
2023-11-07 16:36:06 +08:00 |
|
hiyouga
|
3d40bdb600
|
upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f384ada92618bf21301fe96dac1c10bf
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
5507014392
|
fix bug in data loader, support dpo eval
Former-commit-id: b355f6cac99592b66890ccc04e77a9993de0447d
|
2023-11-03 00:34:26 +08:00 |
|
hiyouga
|
b6e81a0307
|
fix shift short attention
Former-commit-id: ab65c3063b31b9e6a1aeb62c57224c1296ccdadd
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
6a71361a54
|
remove PeftTrainer
Former-commit-id: b218c271edfb07006ddc34b1aca404088de6c528
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
0d0232479f
|
fix import error
Former-commit-id: 2de1a7610a78e41680970b9f308741f98df489fa
|
2023-08-23 20:45:03 +08:00 |
|
hiyouga
|
38080233a5
|
fix #649
Former-commit-id: 57146c101f3e8f688b016a44c85e8ad5d1b6f938
|
2023-08-23 20:21:15 +08:00 |
|
hiyouga
|
d05d535a58
|
fix #617
Former-commit-id: 5235b15c9181f2b68f7d6caa9a6324b8570d3d0c
|
2023-08-21 18:16:11 +08:00 |
|
hiyouga
|
fceca0bb6a
|
update training resuming
Former-commit-id: 58f13e22da18babed0d2d4348474e07745da8fa5
|
2023-08-18 01:41:17 +08:00 |
|
hiyouga
|
abdfa26d06
|
support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
|
2023-08-11 03:02:53 +08:00 |
|