hiyouga
|
abdfa26d06
|
support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
6404167ab7
|
support val set in streaming mode
Former-commit-id: d86ea314a1
|
2023-08-09 23:00:26 +08:00 |
|
hiyouga
|
4242897b78
|
modify code structure
Former-commit-id: 08f180e788
|
2023-08-02 23:17:36 +08:00 |
|
hiyouga
|
4b8e4398bc
|
fix PPO trainer
Former-commit-id: 1d8a1878ea
|
2023-08-02 19:10:23 +08:00 |
|
hiyouga
|
569df8ccd6
|
update ppo trainer
Former-commit-id: b5ba87952a
|
2023-08-02 18:46:41 +08:00 |
|
hiyouga
|
ab739e72ea
|
fix memory leak of PPO trainer
Former-commit-id: 286f7be346
|
2023-08-02 17:41:34 +08:00 |
|
hiyouga
|
c5ad96375e
|
fix RM save model
Former-commit-id: ac88ce5233
|
2023-08-01 11:56:17 +08:00 |
|
hiyouga
|
e80b75b560
|
support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e1
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
18656a6316
|
fix API
Former-commit-id: 29af67b015
|
2023-07-19 00:01:14 +08:00 |
|
hiyouga
|
0b6f769971
|
update webUI, fix #179
Former-commit-id: 12d8a8633f
|
2023-07-18 15:35:17 +08:00 |
|
hiyouga
|
091805d38e
|
release v0.1.0
Former-commit-id: f8193e8009
|
2023-07-18 00:18:25 +08:00 |
|
hiyouga
|
799524b37b
|
fix #175
Former-commit-id: 85c2210452
|
2023-07-17 18:07:17 +08:00 |
|
hiyouga
|
70b5232f9a
|
fix callback
Former-commit-id: 22d9a9c2af
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
a696148d6b
|
modity code structure
Former-commit-id: f751376613
|
2023-07-15 16:54:28 +08:00 |
|