A-Cepheus
|
18ad259fb3
|
fix: ZeRO3 does not work with MoE models
Former-commit-id: b2844c049a88ea89f8e1812e2d2e8662b4002965
|
2024-01-22 15:21:14 +08:00 |
|
hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
a423274fd9
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
ccc5b324fe
|
fix rm server
Former-commit-id: 81bc1638682a9fd01518f9f25250a6b584d2a9e6
|
2024-01-03 15:30:46 +08:00 |
|
hiyouga
|
ebb32e85f8
|
fix version
Former-commit-id: dd7500b65d0d548441eece101b60d51fa619cc0f
|
2023-12-29 04:53:36 +08:00 |
|
hiyouga
|
921f593632
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
940403720a
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
306a70c7ba
|
fix unsloth dtype
Former-commit-id: fd22e6546ce5f38a6a075cf894aafc3d206b2fcd
|
2023-12-23 01:59:49 +08:00 |
|
hiyouga
|
6faf9c35a9
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
ShaneTian
|
d05febe5de
|
Fix slow model initialization in bfloat16 dtype.
Former-commit-id: cf2e2f6f9b7f09b1e2faf6fbc413e3f62e3846c7
|
2023-12-22 16:27:28 +08:00 |
|
hiyouga
|
31cbc67986
|
match version
Former-commit-id: 16db52522584a8e084d4db2a7c253c8b88f27371
|
2023-12-20 22:17:35 +08:00 |
|
hiyouga
|
cc16ece283
|
fix #1900
Former-commit-id: 4c35214396f873588562606b084740b6581188d9
|
2023-12-19 17:21:46 +08:00 |
|
hiyouga
|
790a31404a
|
fix #1742
Former-commit-id: efbb32afdcf0d6aa4ca26f54c95f76dbb84f77dc
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
d81ad2d4bc
|
support dpo-ftx
Former-commit-id: 86dfa04f9821556019fa777106787f73eb70b452
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
296711d502
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
33521fb45e
|
fix bug
Former-commit-id: 95ac272907a04a64785f928536de1fd099150f92
|
2023-12-15 21:54:02 +08:00 |
|
hiyouga
|
e5204e60ed
|
fix bug
Former-commit-id: 8b80baf02cfece53527c27712f0899fa3532c414
|
2023-12-15 21:49:26 +08:00 |
|
hiyouga
|
0409428d87
|
add configurer
Former-commit-id: c40c9889615ffb49c7ce24c69c0d3d20d841c800
|
2023-12-15 21:46:40 +08:00 |
|
hiyouga
|
f902b0d420
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
27ef5b1aa7
|
add loftq
Former-commit-id: 0b900882ef19ac49604a24fbae8b3254f1bff7ad
|
2023-12-14 21:53:56 +08:00 |
|
hiyouga
|
c32303fc7e
|
fix valuehead model
Former-commit-id: 9f628debb6510f2d1c91b00f121a721ab5d648e9
|
2023-12-14 20:15:20 +08:00 |
|
hoshi-hiyouga
|
3ae479faae
|
revert peft version
Former-commit-id: 6440fa1a8c28fd2db58d0905a67d071837e0edd1
|
2023-12-13 10:49:45 +08:00 |
|
hoshi-hiyouga
|
5698038f49
|
update peft version
Former-commit-id: 31c01e1272bd2cd9588e5ee68c1924a3dd55c67e
|
2023-12-13 10:23:51 +08:00 |
|
hiyouga
|
2542b62d77
|
remove loftq
Former-commit-id: e175c0a1c631296117abda2403a4b87bbdd35a66
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
e39bbdd287
|
support loftq
Former-commit-id: e7ac2eb7f7daae17525a278ffbe2f82c0fbd8093
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
934d00ea1e
|
support system column #1765
Former-commit-id: f425584a511c5e42bae8b3ba090eaa898b28adad
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
c27675f70d
|
fix modelscope data hub
Former-commit-id: 5b63e8c22538a4788e4b6c8df50e6e6be93ceeac
|
2023-12-12 18:33:06 +08:00 |
|
hiyouga
|
6975124a57
|
support mixtral
Former-commit-id: 75b5b8e36ab1933b2625f11b645f56cbc805fd85
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
95c561983c
|
support resize embeddings #1786
Former-commit-id: 368a41bd3c6a04f869083058d9165954fbdad105
|
2023-12-11 17:50:02 +08:00 |
|
hiyouga
|
7a03c8dab5
|
use peft 0.7.0, fix #1561 #1764
Former-commit-id: 423947bd58aa50da8785b8ceca1e7e288447a9da
|
2023-12-11 17:13:40 +08:00 |
|
hiyouga
|
2e6ed731cf
|
fix #1771 and temporarily fix #1764
Former-commit-id: d0e5a5d604e16c2fe0035b0ac1d54dc3625d4da3
|
2023-12-08 16:26:20 +08:00 |
|
hiyouga
|
fb4c5f3c91
|
fix #1715
Former-commit-id: 3f9192dbbbafdc2171d2eb80282d5cae47565b7b
|
2023-12-03 22:35:47 +08:00 |
|
hiyouga
|
29545d0e5e
|
implement rm server #1543
Former-commit-id: 2e5bb6888c86079493456c2ddd525f8c52b9963e
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
4a14099cfd
|
fix #1707 #1710
Former-commit-id: 243a596518ad69cf1eec20a082534b9e94353ce4
|
2023-12-03 11:33:12 +08:00 |
|
hiyouga
|
da3d59fada
|
fix gptq model inference
Former-commit-id: f7da9a87cb48cacb7d56322817b05d6f471f6508
|
2023-12-01 23:34:14 +08:00 |
|
hiyouga
|
72bbd5bdef
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
d323ccc3ec
|
add readme
Former-commit-id: 3d5ec6f12b4ae7d04520e6865516a9a6dd4f7efe
|
2023-12-01 16:11:30 +08:00 |
|
yuze.zyz
|
6933c1fed2
|
fix
Former-commit-id: e8774b4c9cbc8f894621ec72957f720d5c83d22b
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
9d125bf533
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
670ee3934f
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hiyouga
|
bcd661afa6
|
fix value head model resuming
Former-commit-id: ccf0b65d886c09c7c49977c43b0544fe1bfcc258
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
adf2730d1d
|
fix #1567
Former-commit-id: 8c01ffe8d277d49a413571e0669f460c8d0802bf
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
a7bf0b85d7
|
fix quantization
Former-commit-id: 8268aefe8fba268065e24ffe159a9c49f7c6f3a5
|
2023-11-17 22:21:29 +08:00 |
|
hiyouga
|
5ce5ea84a9
|
fix #1550
Former-commit-id: c12acd21a5a500892ed739c79327ccd39fddad5b
|
2023-11-17 17:23:13 +08:00 |
|
hiyouga
|
dab9385297
|
fix bug in web ui
Former-commit-id: a598f145ec903dd2b2c984d951b6c450b142ece5
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
09a4474e7f
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|