Commit Graph

70 Commits

Author SHA1 Message Date
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f7
2023-09-01 19:00:45 +08:00
codemayq
ea74e5a81b update llama2 template
Former-commit-id: 0bcc489c42
2023-08-30 16:23:56 +08:00
codemayq
4b29d9d2b0 add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b
2023-08-23 18:54:23 +08:00
hiyouga
802494e20a update template
Former-commit-id: 4318347d3f
2023-08-22 19:46:09 +08:00
hiyouga
e6f4eab4ab fix #608
Former-commit-id: 02d69b6fde
2023-08-21 17:49:36 +08:00
hiyouga
d3bef03dc6 fix baichuan template for training #597 #616
Former-commit-id: 0a3f698425
2023-08-21 17:41:51 +08:00
hiyouga
caf4a61e21 fix ChatGLM2 ppo #527 #528
Former-commit-id: 9f4c2adc9a
2023-08-18 00:34:59 +08:00
hiyouga
623a34b16f fix generation bug #532
Former-commit-id: be21fc83f9
2023-08-17 22:21:34 +08:00
hiyouga
3021a01b71 fix baichuan and intern template
Former-commit-id: 892fd39373
2023-08-17 01:27:20 +08:00
hiyouga
edc15c62fa fix system prompt
Former-commit-id: 7407d9daa1
2023-08-16 01:35:52 +08:00
hiyouga
2ceaecfb42 fix baichuan template #481
Former-commit-id: 273135f595
2023-08-15 11:38:21 +08:00
hiyouga
d15fe288df alert pad_token source
Former-commit-id: 80b4053602
2023-08-15 00:07:56 +08:00
hiyouga
02a61b08b1 update webui
Former-commit-id: 9d0f6214b6
2023-08-14 22:45:26 +08:00
codemayq
ee7da14f81 add template match and stage in webui
Former-commit-id: 79c68e5527
2023-08-14 20:42:59 +08:00
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8
2023-08-12 20:46:27 +08:00
codemayq
3ba1b81105 add sft script preview in webui
Former-commit-id: 6bc8e9866d
2023-08-12 13:53:55 +08:00
hiyouga
7bd4c59b7e fix unusual output of 8bit models #278 #391
Former-commit-id: dd51c24203
2023-08-12 00:25:29 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474
2023-08-11 23:25:57 +08:00
hiyouga
21bf79e72b add defaults
Former-commit-id: d3844e97e3
2023-08-11 13:56:26 +08:00
hiyouga
eb26bfc2ba fix stop word in baichuan template
Former-commit-id: d59f938959
2023-08-11 13:51:46 +08:00
hiyouga
f1485ab927 fix baichuan template
Former-commit-id: 9c6dd10514
2023-08-11 13:45:47 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
2023-08-11 03:02:53 +08:00
hiyouga
0dc9b41b16 fix template
Former-commit-id: eb6e571cb7
2023-08-09 23:14:27 +08:00
hiyouga
ce9ffca0d9 fix template
Former-commit-id: ac29f4d5f0
2023-08-09 23:10:20 +08:00
hiyouga
6404167ab7 support val set in streaming mode
Former-commit-id: d86ea314a1
2023-08-09 23:00:26 +08:00
hiyouga
d01c1231ed fix tokenizer
Former-commit-id: 572ea3bafb
2023-08-09 17:52:15 +08:00
hiyouga
28a807472b fix rm #420, fix template #426, fix #423
Former-commit-id: 39cd8b6989
2023-08-09 16:23:31 +08:00
hoshi-hiyouga
c3eb40b971 fix llama2 template
Former-commit-id: 2d90685358
2023-08-09 00:58:27 +08:00
hoshi-hiyouga
a37e1c11c9 fix tokenizer
Former-commit-id: 32fa5e8d70
2023-08-09 00:54:54 +08:00
hiyouga
4f714ba314 update webui
Former-commit-id: 3a720aac66
2023-08-09 00:26:11 +08:00
hiyouga
77aa9853fb fix tokenizer #417
Former-commit-id: eecc4b2131
2023-08-08 23:59:41 +08:00
hiyouga
70b53d9503 fix bug
Former-commit-id: 4b841a6b35
2023-08-08 17:55:55 +08:00
hiyouga
c796c542c8 fix chatml template #408
Former-commit-id: a9980617f5
2023-08-08 17:44:39 +08:00
hiyouga
733b395822 update readme
Former-commit-id: 20cf27976f
2023-08-07 15:02:02 +08:00
hiyouga
39955c28ff fix qwen tokenizer #361
Former-commit-id: 7f18d2a335
2023-08-05 17:06:05 +08:00
hiyouga
45af1a951f fix template for tiktoken
Former-commit-id: 1afa51c2fa
2023-08-05 13:42:42 +08:00
hiyouga
1a1caf2116 remove redundant code
Former-commit-id: 53d95725c5
2023-08-05 00:27:27 +08:00
hiyouga
ab95f569a4 fix template
Former-commit-id: c183b3551d
2023-08-05 00:25:00 +08:00
hiyouga
7a89fce4c7 fix llama2 template
Former-commit-id: e4a15f863c
2023-08-05 00:07:54 +08:00
hiyouga
65369ecf48 fix bos and eos token
Former-commit-id: d87c8fd8ab
2023-08-04 23:55:57 +08:00
hiyouga
dbb284b5a2 fix encode
Former-commit-id: 8172ad1b5e
2023-08-04 23:27:55 +08:00
hiyouga
ea045b0e5b support chatml safe encoding
Former-commit-id: b4852f9406
2023-08-04 23:14:28 +08:00
hiyouga
b32ed1d7be support interleave probs
Former-commit-id: 69744c17e8
2023-08-04 21:27:35 +08:00
hiyouga
2d96ec9c3e tiny fix
Former-commit-id: ff98f1cba8
2023-08-03 17:42:28 +08:00
hiyouga
9c84c4ed5d support Qwen-7B, fix InternLM-7B inference
Former-commit-id: 87f8f830e2
2023-08-03 15:53:32 +08:00
hiyouga
4242897b78 modify code structure
Former-commit-id: 08f180e788
2023-08-02 23:17:36 +08:00
hiyouga
534e3320b5 release v0.1.5
Former-commit-id: c689857bbb
2023-08-02 16:10:31 +08:00
YC Chen
bb2b38a31f [fix] Remove useless code
Former-commit-id: ca125da0eb
2023-08-02 14:35:35 +08:00
YC Chen
bf844e8a99 [feature] Fix template of Llama2 to match the offical template
Former-commit-id: 4323773089
2023-08-02 14:10:15 +08:00
hiyouga
5c7337d6f3 Fix #294
Former-commit-id: e6a3894b99
2023-08-01 18:13:03 +08:00