Commit Graph

42 Commits

Author SHA1 Message Date
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8
2023-08-12 20:46:27 +08:00
hiyouga
7bd4c59b7e fix unusual output of 8bit models #278 #391
Former-commit-id: dd51c24203
2023-08-12 00:25:29 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474
2023-08-11 23:25:57 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
2023-08-11 03:02:53 +08:00
hiyouga
6404167ab7 support val set in streaming mode
Former-commit-id: d86ea314a1
2023-08-09 23:00:26 +08:00
hiyouga
b43f37ca19 fix sft trainer
Former-commit-id: df946e6949
2023-08-09 16:35:03 +08:00
hiyouga
77aa9853fb fix tokenizer #417
Former-commit-id: eecc4b2131
2023-08-08 23:59:41 +08:00
hiyouga
70b53d9503 fix bug
Former-commit-id: 4b841a6b35
2023-08-08 17:55:55 +08:00
hiyouga
c796c542c8 fix chatml template #408
Former-commit-id: a9980617f5
2023-08-08 17:44:39 +08:00
hiyouga
871f7de3d0 fix #376
Former-commit-id: 081345baca
2023-08-07 13:58:59 +08:00
hiyouga
77f6647e8f update trainer
Former-commit-id: 220175ab24
2023-08-07 13:34:35 +08:00
hiyouga
2faa1af4eb fix qwen eos token
Former-commit-id: e21ae01356
2023-08-06 13:31:17 +08:00
hiyouga
0328c0e07c fix mtloader
Former-commit-id: a0173c427d
2023-08-03 19:29:02 +08:00
hiyouga
788d1250c1 fix qwen inference
Former-commit-id: 2780792754
2023-08-03 16:31:55 +08:00
hiyouga
9c84c4ed5d support Qwen-7B, fix InternLM-7B inference
Former-commit-id: 87f8f830e2
2023-08-03 15:53:32 +08:00
hiyouga
91d178f14d fix webui
Former-commit-id: e23a3a366c
2023-08-03 12:43:12 +08:00
hiyouga
4242897b78 modify code structure
Former-commit-id: 08f180e788
2023-08-02 23:17:36 +08:00
hiyouga
4b8e4398bc fix PPO trainer
Former-commit-id: 1d8a1878ea
2023-08-02 19:10:23 +08:00
hiyouga
569df8ccd6 update ppo trainer
Former-commit-id: b5ba87952a
2023-08-02 18:46:41 +08:00
hiyouga
ab739e72ea fix memory leak of PPO trainer
Former-commit-id: 286f7be346
2023-08-02 17:41:34 +08:00
hiyouga
c5ad96375e fix RM save model
Former-commit-id: ac88ce5233
2023-08-01 11:56:17 +08:00
hiyouga
aa4335eac7 release v0.1.4
Former-commit-id: 973a638665
2023-08-01 10:08:47 +08:00
hiyouga
e34fc5fd2e fix inference
Former-commit-id: d3a0692d4d
2023-08-01 00:06:48 +08:00
hiyouga
d5d3b2a42f fix arg check
Former-commit-id: 9cb1f119a4
2023-07-31 23:48:57 +08:00
hiyouga
a437424381 update readme
Former-commit-id: 62dca5bb82
2023-07-31 23:42:32 +08:00
hiyouga
e80b75b560 support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e1
2023-07-31 23:33:00 +08:00
hiyouga
c52dd3e86f fix #242
Former-commit-id: 00efa8a07f
2023-07-25 17:04:02 +08:00
hiyouga
ee7aa86312 release v0.1.3
Former-commit-id: 0b6150bc31
2023-07-21 16:48:34 +08:00
hiyouga
daf81288a1 fix save function
Former-commit-id: d2f18197e3
2023-07-21 14:09:07 +08:00
hiyouga
f769c2d3fc update web UI, support rm predict #210
Former-commit-id: ed0e186a13
2023-07-21 13:27:27 +08:00
hiyouga
64b4f71673 simplify code
Former-commit-id: 67a2773074
2023-07-20 15:08:57 +08:00
hiyouga
106802390b tiny fix
Former-commit-id: d1d8e8bae1
2023-07-19 22:53:46 +08:00
hiyouga
a12785fd9b fix #199
Former-commit-id: d111e658a2
2023-07-19 22:51:29 +08:00
hiyouga
1a23cb2578 fix #196
Former-commit-id: 925a790bc9
2023-07-19 17:35:38 +08:00
hiyouga
18656a6316 fix API
Former-commit-id: 29af67b015
2023-07-19 00:01:14 +08:00
hiyouga
af37ac077c support dev set in web ui
Former-commit-id: fe2887ca13
2023-07-18 20:40:49 +08:00
hiyouga
0b6f769971 update webUI, fix #179
Former-commit-id: 12d8a8633f
2023-07-18 15:35:17 +08:00
hiyouga
091805d38e release v0.1.0
Former-commit-id: f8193e8009
2023-07-18 00:18:25 +08:00
hiyouga
799524b37b fix #175
Former-commit-id: 85c2210452
2023-07-17 18:07:17 +08:00
hiyouga
c4f1d98a1c fix saving custom code
Former-commit-id: 1e1358431d
2023-07-16 18:04:41 +08:00
hiyouga
70b5232f9a fix callback
Former-commit-id: 22d9a9c2af
2023-07-15 17:18:16 +08:00
hiyouga
a696148d6b modity code structure
Former-commit-id: f751376613
2023-07-15 16:54:28 +08:00