hiyouga
|
8c841d6f76
|
tiny fix
Former-commit-id: 0ee159654ac6339c162745b004e2152ba6fe3c81
|
2023-08-18 13:07:35 +08:00 |
|
hiyouga
|
a8dd39ad08
|
fix PPO trainer #551 , update readme
Former-commit-id: faead74849470cebae9e37cde5fab2a71b32aa43
|
2023-08-18 11:43:10 +08:00 |
|
hiyouga
|
a089d2665a
|
fix ChatGLM2 ppo #527 #528
Former-commit-id: 60d6ad64d7c9f6445b0df8de0153c3a311974198
|
2023-08-18 00:34:59 +08:00 |
|
hiyouga
|
bb7028f7e2
|
fix generation bug #532
Former-commit-id: c071121e67374e5f09798db57cfc8668617a36ae
|
2023-08-17 22:21:34 +08:00 |
|
hiyouga
|
22dec02b5f
|
fix generation
Former-commit-id: 66a0300d312ef91c24fcf80667fa3b0bb8e1a342
|
2023-08-16 22:39:54 +08:00 |
|
hiyouga
|
0684a5adc8
|
fix ChatGLM RLHF
Former-commit-id: 4e43e887e432ceb7e9287b4e309b63af3c3ba1bf
|
2023-08-15 11:19:20 +08:00 |
|
hiyouga
|
7ada4f5f6f
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
18f73169fd
|
modify code structure
Former-commit-id: 6369f9b1751e6f9bb709ba76a85f69cbe0823e5d
|
2023-08-02 23:17:36 +08:00 |
|
hiyouga
|
f5696a08c1
|
fix PPO trainer
Former-commit-id: 21982a7d4dd9b7c3a1145b481f02b9990e32dc00
|
2023-08-02 19:10:23 +08:00 |
|
hiyouga
|
2fdef55143
|
update ppo trainer
Former-commit-id: c27136a83e167465d3f825e40f10c7b9fcfbf97a
|
2023-08-02 18:46:41 +08:00 |
|
hiyouga
|
e6ba6f6d61
|
fix memory leak of PPO trainer
Former-commit-id: 38410894a5ebf0b043b55a6bd5cca3cd0a44b27d
|
2023-08-02 17:41:34 +08:00 |
|
hiyouga
|
ab3f685330
|
fix RM save model
Former-commit-id: 8104cc2425431eb1cddccf3909855296116f922b
|
2023-08-01 11:56:17 +08:00 |
|
hiyouga
|
63123a9098
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
40cf2a3092
|
fix API
Former-commit-id: 9b10c9a12e33ab897056ecc61d977d221c19141b
|
2023-07-19 00:01:14 +08:00 |
|
hiyouga
|
ca972000b6
|
update webUI, fix #179
Former-commit-id: f9074fed5e22585679661588befcf266a79009f2
|
2023-07-18 15:35:17 +08:00 |
|
hiyouga
|
53d7d4c221
|
release v0.1.0
Former-commit-id: 63c8d3a17cb18f0d8a8e37bfa147daf5bdd28ea9
|
2023-07-18 00:18:25 +08:00 |
|
hiyouga
|
72058cc816
|
fix #175
Former-commit-id: fd557ebb5e3ef2ca330b4d97731af43f4a5a5fc5
|
2023-07-17 18:07:17 +08:00 |
|
hiyouga
|
75a97a3991
|
fix callback
Former-commit-id: 065680cd2a410d7ceab10a4a76588df43e286117
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
a69b1b1c3a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|