hiyouga
|
f902b0d420
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
27ef5b1aa7
|
add loftq
Former-commit-id: 0b900882ef19ac49604a24fbae8b3254f1bff7ad
|
2023-12-14 21:53:56 +08:00 |
|
hiyouga
|
c32303fc7e
|
fix valuehead model
Former-commit-id: 9f628debb6510f2d1c91b00f121a721ab5d648e9
|
2023-12-14 20:15:20 +08:00 |
|
hoshi-hiyouga
|
45abe361ba
|
tiny fix
Former-commit-id: 987df4c62f34026adfe2089910f4ff9ac6ebd9a6
|
2023-12-13 17:32:36 +08:00 |
|
hoshi-hiyouga
|
3ae479faae
|
revert peft version
Former-commit-id: 6440fa1a8c28fd2db58d0905a67d071837e0edd1
|
2023-12-13 10:49:45 +08:00 |
|
hoshi-hiyouga
|
5698038f49
|
update peft version
Former-commit-id: 31c01e1272bd2cd9588e5ee68c1924a3dd55c67e
|
2023-12-13 10:23:51 +08:00 |
|
hoshi-hiyouga
|
020233f725
|
tiny fix
Former-commit-id: 1478bc052417e0939188f55a0adcbf00956960f2
|
2023-12-13 10:21:29 +08:00 |
|
hoshi-hiyouga
|
6f9d55b8eb
|
fix #1819
Former-commit-id: f2e2b0354cbe9a7190ccab807f690cc8ab433a6e
|
2023-12-13 10:14:01 +08:00 |
|
hiyouga
|
2542b62d77
|
remove loftq
Former-commit-id: e175c0a1c631296117abda2403a4b87bbdd35a66
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
e39bbdd287
|
support loftq
Former-commit-id: e7ac2eb7f7daae17525a278ffbe2f82c0fbd8093
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
934d00ea1e
|
support system column #1765
Former-commit-id: f425584a511c5e42bae8b3ba090eaa898b28adad
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
c27675f70d
|
fix modelscope data hub
Former-commit-id: 5b63e8c22538a4788e4b6c8df50e6e6be93ceeac
|
2023-12-12 18:33:06 +08:00 |
|
hiyouga
|
9e2cc21d04
|
update readme
Former-commit-id: 42e042a4206aeb5177ddde56386e9655b0c06460
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
6975124a57
|
support mixtral
Former-commit-id: 75b5b8e36ab1933b2625f11b645f56cbc805fd85
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
9f69307db1
|
fix baichuan resize
Former-commit-id: 66956d13074a9bc74d7a737b9476f38361a7764a
|
2023-12-11 20:55:50 +08:00 |
|
hiyouga
|
c3448a045c
|
tiny fix
Former-commit-id: 1f839fc4f278c2a258df22899241fc66a2cca682
|
2023-12-11 18:09:40 +08:00 |
|
hiyouga
|
95c561983c
|
support resize embeddings #1786
Former-commit-id: 368a41bd3c6a04f869083058d9165954fbdad105
|
2023-12-11 17:50:02 +08:00 |
|
hiyouga
|
7a03c8dab5
|
use peft 0.7.0, fix #1561 #1764
Former-commit-id: 423947bd58aa50da8785b8ceca1e7e288447a9da
|
2023-12-11 17:13:40 +08:00 |
|
hiyouga
|
2e6ed731cf
|
fix #1771 and temporarily fix #1764
Former-commit-id: d0e5a5d604e16c2fe0035b0ac1d54dc3625d4da3
|
2023-12-08 16:26:20 +08:00 |
|
hiyouga
|
fb4c5f3c91
|
fix #1715
Former-commit-id: 3f9192dbbbafdc2171d2eb80282d5cae47565b7b
|
2023-12-03 22:35:47 +08:00 |
|
hiyouga
|
29545d0e5e
|
implement rm server #1543
Former-commit-id: 2e5bb6888c86079493456c2ddd525f8c52b9963e
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
4a14099cfd
|
fix #1707 #1710
Former-commit-id: 243a596518ad69cf1eec20a082534b9e94353ce4
|
2023-12-03 11:33:12 +08:00 |
|
hiyouga
|
5ea6a7c6d6
|
fix #1642
Former-commit-id: 11be28201f688ac21cf94135067d37e9aa7ab0a1
|
2023-12-02 00:37:53 +08:00 |
|
hiyouga
|
5f572cbd77
|
fix gptq training
Former-commit-id: bec58e3dc575aa4247e563881a456328ee5ef496
|
2023-12-02 00:27:15 +08:00 |
|
hiyouga
|
679bd3ab30
|
tiny fix
Former-commit-id: fd2782a06ba4efa76cacbb49eb76a05de8d8aca6
|
2023-12-01 23:37:10 +08:00 |
|
hiyouga
|
da3d59fada
|
fix gptq model inference
Former-commit-id: f7da9a87cb48cacb7d56322817b05d6f471f6508
|
2023-12-01 23:34:14 +08:00 |
|
hiyouga
|
72bbd5bdef
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
d323ccc3ec
|
add readme
Former-commit-id: 3d5ec6f12b4ae7d04520e6865516a9a6dd4f7efe
|
2023-12-01 16:11:30 +08:00 |
|
yuze.zyz
|
6933c1fed2
|
fix
Former-commit-id: e8774b4c9cbc8f894621ec72957f720d5c83d22b
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
9d125bf533
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
670ee3934f
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
0105cd48f2
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: fdccc6cc9b68890199e9250cabdb996ff2f853b9
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hiyouga
|
bcd661afa6
|
fix value head model resuming
Former-commit-id: ccf0b65d886c09c7c49977c43b0544fe1bfcc258
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
adf2730d1d
|
fix #1567
Former-commit-id: 8c01ffe8d277d49a413571e0669f460c8d0802bf
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
d2ff09a404
|
fix model card network issue
Former-commit-id: 36155cd1893bea036f15c648c06b0047c02dfb4f
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
6889f044fb
|
fix #1558
Former-commit-id: 263b2b24c8a649b51fa5ae768a24e67def8e0e96
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
3d1ee27ccd
|
fix evaluator and cached_file in 4.31.0
Former-commit-id: 970897da402f604220d45084d492de4dab809ba4
|
2023-11-18 19:39:23 +08:00 |
|
hiyouga
|
a7bf0b85d7
|
fix quantization
Former-commit-id: 8268aefe8fba268065e24ffe159a9c49f7c6f3a5
|
2023-11-17 22:21:29 +08:00 |
|
hiyouga
|
5ce5ea84a9
|
fix #1550
Former-commit-id: c12acd21a5a500892ed739c79327ccd39fddad5b
|
2023-11-17 17:23:13 +08:00 |
|
hiyouga
|
dab9385297
|
fix bug in web ui
Former-commit-id: a598f145ec903dd2b2c984d951b6c450b142ece5
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
f9d4e37b3c
|
fix bug in freeze tuning
Former-commit-id: f6b436a08421ca17d64abc51497f4aa43729a43b
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
e017266b98
|
fix bug in PPO training
Former-commit-id: 2e99f0e53ce6de0acbcab85dd50aef874e8c6336
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
f81a8a5e5c
|
fix import bug
Former-commit-id: 2356029cdd120d5f7bf630b80681ce8c53bff90d
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
b2ac8376e1
|
support multiple modules in freeze training #1514
Former-commit-id: 60abac70dfd778df2ae8b3a2e960ed8b607d7ab6
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
09a4474e7f
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|