hiyouga
|
d8cf8cfdeb
|
fix SFT trainer
|
2023-10-31 21:52:52 +08:00 |
|
hiyouga
|
f4e4a04529
|
fix #1316
|
2023-10-31 11:32:08 +08:00 |
|
hiyouga
|
7b4acf7265
|
reimplement neftune
|
2023-10-22 16:15:08 +08:00 |
|
anvie
|
57fb40aa04
|
add NEFTune optimization
|
2023-10-21 13:24:10 +07:00 |
|
hiyouga
|
ab65c3063b
|
fix shift short attention
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
dbaef776a1
|
tiny fix
|
2023-09-21 19:52:06 +08:00 |
|
hiyouga
|
b218c271ed
|
remove PeftTrainer
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
d8aa1404be
|
support FlashAttention2
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
815b92e698
|
fix #850
|
2023-09-10 14:22:03 +08:00 |
|
hiyouga
|
8ea32e4046
|
change to right-padding, update reward score #803
|
2023-09-08 20:04:31 +08:00 |
|
hiyouga
|
85b1f6632a
|
fix baichuan templates
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
2de1a7610a
|
fix import error
|
2023-08-23 20:45:03 +08:00 |
|
hiyouga
|
57146c101f
|
fix #649
|
2023-08-23 20:21:15 +08:00 |
|
hiyouga
|
9c9009f49f
|
Release v0.1.7
|
2023-08-18 17:21:27 +08:00 |
|
hiyouga
|
58f13e22da
|
update training resuming
|
2023-08-18 01:41:17 +08:00 |
|
hoshi-hiyouga
|
7252903245
|
Merge branch 'main' into main
|
2023-08-18 01:37:23 +08:00 |
|
hiyouga
|
9f4c2adc9a
|
fix ChatGLM2 ppo #527 #528
|
2023-08-18 00:34:59 +08:00 |
|
hiyouga
|
be21fc83f9
|
fix generation bug #532
|
2023-08-17 22:21:34 +08:00 |
|
hiyouga
|
b0ed0dec5e
|
fix streaming in pt stage #548 #549
|
2023-08-17 17:59:26 +08:00 |
|
hiyouga
|
d9e62711a3
|
fix generation
|
2023-08-16 22:39:54 +08:00 |
|
hiyouga
|
3ec4351cfd
|
support DPO training (2305.18290)
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
d86ea314a1
|
support val set in streaming mode
|
2023-08-09 23:00:26 +08:00 |
|
niuba
|
2ec68d3398
|
add last_checkpoint support
|
2023-08-09 16:39:27 +08:00 |
|
hiyouga
|
df946e6949
|
fix sft trainer
|
2023-08-09 16:35:03 +08:00 |
|
hiyouga
|
08f180e788
|
modify code structure
|
2023-08-02 23:17:36 +08:00 |
|
hiyouga
|
0411a4b3e1
|
support streaming data, fix #284 #274 #268
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
67a2773074
|
simplify code
|
2023-07-20 15:08:57 +08:00 |
|
hiyouga
|
925a790bc9
|
fix #196
|
2023-07-19 17:35:38 +08:00 |
|
hiyouga
|
fe2887ca13
|
support dev set in web ui
|
2023-07-18 20:40:49 +08:00 |
|
hiyouga
|
85c2210452
|
fix #175
|
2023-07-17 18:07:17 +08:00 |
|
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
|