Commit Graph

575 Commits

Author SHA1 Message Date
beat4ocean
c81d1d40fa fix KeyError: 'lang' bug
Former-commit-id: 7b45de6b9f
2023-08-20 15:32:36 +08:00
hiyouga
570ccc3618 fix ppo trainer #551
Former-commit-id: 0676497104
2023-08-20 14:07:11 +08:00
hiyouga
acaac6df9e Release v0.1.7
Former-commit-id: 9c9009f49f
2023-08-18 17:21:27 +08:00
hiyouga
9f1688924d tiny fix
Former-commit-id: d75e377b0f
2023-08-18 13:07:35 +08:00
hiyouga
b88f0b396c support ppo score norm (trl 0.5.1.dev required)
Former-commit-id: 53e33418d0
2023-08-18 12:02:42 +08:00
hiyouga
03edfd07e7 fix PPO trainer #551 , update readme
Former-commit-id: 9020524418
2023-08-18 11:43:10 +08:00
hiyouga
fceca0bb6a update training resuming
Former-commit-id: 58f13e22da
2023-08-18 01:41:17 +08:00
hoshi-hiyouga
49d4ae3704 Merge branch 'main' into main
Former-commit-id: 7252903245
2023-08-18 01:37:23 +08:00
hiyouga
66771352bb support bf16 ppo #551
Former-commit-id: d125218cde
2023-08-18 00:40:32 +08:00
hiyouga
caf4a61e21 fix ChatGLM2 ppo #527 #528
Former-commit-id: 9f4c2adc9a
2023-08-18 00:34:59 +08:00
hiyouga
623a34b16f fix generation bug #532
Former-commit-id: be21fc83f9
2023-08-17 22:21:34 +08:00
hiyouga
a46f277477 fix streaming in pt stage #548 #549
Former-commit-id: b0ed0dec5e
2023-08-17 17:59:26 +08:00
hiyouga
3021a01b71 fix baichuan and intern template
Former-commit-id: 892fd39373
2023-08-17 01:27:20 +08:00
hiyouga
048f99354f fix generation
Former-commit-id: d9e62711a3
2023-08-16 22:39:54 +08:00
hiyouga
edc15c62fa fix system prompt
Former-commit-id: 7407d9daa1
2023-08-16 01:35:52 +08:00
hiyouga
2ceaecfb42 fix baichuan template #481
Former-commit-id: 273135f595
2023-08-15 11:38:21 +08:00
hiyouga
a9ab8f71d7 fix ChatGLM RLHF
Former-commit-id: af6c011fcb
2023-08-15 11:19:20 +08:00
hiyouga
d15fe288df alert pad_token source
Former-commit-id: 80b4053602
2023-08-15 00:07:56 +08:00
hiyouga
02a61b08b1 update webui
Former-commit-id: 9d0f6214b6
2023-08-14 22:45:26 +08:00
hoshi-hiyouga
d021d31a9c Merge pull request #511 from hiyouga/feature-autoTemplate
add template match and stage in webui

Former-commit-id: adb0f186e9
2023-08-14 22:44:04 +08:00
codemayq
4a4623cf2d auto match template when change model_name
Former-commit-id: 0bf892ff1a
2023-08-14 20:56:05 +08:00
codemayq
ee7da14f81 add template match and stage in webui
Former-commit-id: 79c68e5527
2023-08-14 20:42:59 +08:00
hiyouga
2aa2c363ad fix ChatGLM lm_head #494
Former-commit-id: d019956808
2023-08-14 14:14:48 +08:00
hiyouga
2f70f96463 fix bug in webui
Former-commit-id: 20a29297b1
2023-08-14 11:38:42 +08:00
hiyouga
307c814f38 fix webui cache
Former-commit-id: ca08e5efd3
2023-08-14 11:37:01 +08:00
hiyouga
6c9b035c0e web UI integrating RLHF
Former-commit-id: ec94274ca1
2023-08-14 10:48:47 +08:00
hiyouga
e75024fde3 fix #480
Former-commit-id: 2f2fd55d81
2023-08-14 00:23:56 +08:00
hiyouga
7984ae8b62 fix webui
Former-commit-id: d69b1388e6
2023-08-12 23:52:07 +08:00
hiyouga
e1b43dfc7f tiny fix
Former-commit-id: 9dc6a296e3
2023-08-12 22:02:43 +08:00
hiyouga
bd611e0090 fix rope scaling
Former-commit-id: 8545c11c45
2023-08-12 22:00:01 +08:00
hiyouga
ba65dcb15e update readme
Former-commit-id: 1836c020c5
2023-08-12 21:00:11 +08:00
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8
2023-08-12 20:46:27 +08:00
codemayq
3ba1b81105 add sft script preview in webui
Former-commit-id: 6bc8e9866d
2023-08-12 13:53:55 +08:00
hiyouga
7bd4c59b7e fix unusual output of 8bit models #278 #391
Former-commit-id: dd51c24203
2023-08-12 00:25:29 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474
2023-08-11 23:25:57 +08:00
hiyouga
21bf79e72b add defaults
Former-commit-id: d3844e97e3
2023-08-11 13:56:26 +08:00
hiyouga
eb26bfc2ba fix stop word in baichuan template
Former-commit-id: d59f938959
2023-08-11 13:51:46 +08:00
hiyouga
f1485ab927 fix baichuan template
Former-commit-id: 9c6dd10514
2023-08-11 13:45:47 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
2023-08-11 03:02:53 +08:00
hoshi-hiyouga
6c32b6922b Merge pull request #451 from jovialchen/main
huggingface login for projects must login while running

Former-commit-id: 685dae4eff
2023-08-10 17:25:38 +08:00
hiyouga
58e95776e1 fix webui val size
Former-commit-id: ad6e7c76c7
2023-08-10 15:20:44 +08:00
jiongxuc
7ffd961b8b huggingface login for projects must login while running
Former-commit-id: 3e000c2b60
2023-08-10 14:57:12 +08:00
hiyouga
0dc9b41b16 fix template
Former-commit-id: eb6e571cb7
2023-08-09 23:14:27 +08:00
hiyouga
ce9ffca0d9 fix template
Former-commit-id: ac29f4d5f0
2023-08-09 23:10:20 +08:00
hiyouga
6404167ab7 support val set in streaming mode
Former-commit-id: d86ea314a1
2023-08-09 23:00:26 +08:00
hiyouga
d01c1231ed fix tokenizer
Former-commit-id: 572ea3bafb
2023-08-09 17:52:15 +08:00
niuba
53a89f53aa add last_checkpoint support
Former-commit-id: 2ec68d3398
2023-08-09 16:39:27 +08:00
hiyouga
b43f37ca19 fix sft trainer
Former-commit-id: df946e6949
2023-08-09 16:35:03 +08:00
hiyouga
28a807472b fix rm #420, fix template #426, fix #423
Former-commit-id: 39cd8b6989
2023-08-09 16:23:31 +08:00
hoshi-hiyouga
c3eb40b971 fix llama2 template
Former-commit-id: 2d90685358
2023-08-09 00:58:27 +08:00