hiyouga
|
7ada4f5f6f
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
1d0ccf63f2
|
update trainer
Former-commit-id: 0d39b53a5164e34d22fe0a492eaa0d7ac63102fe
|
2023-08-07 13:34:35 +08:00 |
|
hiyouga
|
2fdef55143
|
update ppo trainer
Former-commit-id: c27136a83e167465d3f825e40f10c7b9fcfbf97a
|
2023-08-02 18:46:41 +08:00 |
|
hiyouga
|
ab3f685330
|
fix RM save model
Former-commit-id: 8104cc2425431eb1cddccf3909855296116f922b
|
2023-08-01 11:56:17 +08:00 |
|
hiyouga
|
44c31d6064
|
fix inference
Former-commit-id: 55dc2bdd3eaa552c655e584fc3cbbf017c7bc3e7
|
2023-08-01 00:06:48 +08:00 |
|
hiyouga
|
63123a9098
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
3855557f8c
|
fix save function
Former-commit-id: 1d6beb0c8490a7531ffdf7a2819410597b200d12
|
2023-07-21 14:09:07 +08:00 |
|
hiyouga
|
1f4a65afea
|
update web UI, support rm predict #210
Former-commit-id: 92cc6b655dc91b94d5bf9d8618c3b57d5cf94333
|
2023-07-21 13:27:27 +08:00 |
|
hiyouga
|
75a97a3991
|
fix callback
Former-commit-id: 065680cd2a410d7ceab10a4a76588df43e286117
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
a69b1b1c3a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|