hiyouga
|
296711d502
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
f902b0d420
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
2542b62d77
|
remove loftq
Former-commit-id: e175c0a1c631296117abda2403a4b87bbdd35a66
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
e39bbdd287
|
support loftq
Former-commit-id: e7ac2eb7f7daae17525a278ffbe2f82c0fbd8093
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
934d00ea1e
|
support system column #1765
Former-commit-id: f425584a511c5e42bae8b3ba090eaa898b28adad
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
c27675f70d
|
fix modelscope data hub
Former-commit-id: 5b63e8c22538a4788e4b6c8df50e6e6be93ceeac
|
2023-12-12 18:33:06 +08:00 |
|
hoshi-hiyouga
|
b9736c13e0
|
Merge branch 'main' into feat/support_ms
Former-commit-id: 698756dffb7d4e602b3e0cab66ef0a4befe7215c
|
2023-12-12 17:55:32 +08:00 |
|
xingjun.wang
|
ed26bb3d82
|
update args for MsDataset.load
Former-commit-id: c5f69357a167cbf99a93607177526e787419ea05
|
2023-12-12 13:02:54 +08:00 |
|
hiyouga
|
9e2cc21d04
|
update readme
Former-commit-id: 42e042a4206aeb5177ddde56386e9655b0c06460
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
f3ffa8310f
|
fix #1784
Former-commit-id: 4e1af5a5d39d9e2f374c1372e2d67120c63fea09
|
2023-12-09 20:53:18 +08:00 |
|
yuze.zyz
|
596f496f19
|
support ms dataset
Former-commit-id: 98638b35dc24045ac17b9b01d08d3a02372acef3
|
2023-12-08 18:00:57 +08:00 |
|
hiyouga
|
29545d0e5e
|
implement rm server #1543
Former-commit-id: 2e5bb6888c86079493456c2ddd525f8c52b9963e
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
72bbd5bdef
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
hoshi-hiyouga
|
a1ec668b70
|
Merge branch 'main' into feat/support_ms
Former-commit-id: b8954342611e24bc3af972747fd016cde89eee3f
|
2023-12-01 20:23:46 +08:00 |
|
hiyouga
|
f3c622b665
|
fix err hint
Former-commit-id: 935a4a01bd9204129dd72a500ed75b268714d1e8
|
2023-12-01 17:13:22 +08:00 |
|
yuze.zyz
|
9d125bf533
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
670ee3934f
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
569860d7ac
|
support export size setting
Former-commit-id: 1a4de54586c21cdbbc89f8a716ca5a54c87a6120
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
Yuchen Han
|
bcd31cf245
|
Update finetuning_args.py
Former-commit-id: 30e3430553f1f7e09cd57ef2c9843b549746c618
|
2023-11-17 00:15:51 -08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
e017266b98
|
fix bug in PPO training
Former-commit-id: 2e99f0e53ce6de0acbcab85dd50aef874e8c6336
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
b2ac8376e1
|
support multiple modules in freeze training #1514
Former-commit-id: 60abac70dfd778df2ae8b3a2e960ed8b607d7ab6
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
c9a4551012
|
fix #1494
Former-commit-id: 07c8d734529f03e47ef638a1bda222e8824d3d38
|
2023-11-14 18:07:20 +08:00 |
|
hiyouga
|
64fc9ba678
|
refactor evaluation, upgrade trl to 074
Former-commit-id: ed09ebe2c1926ffdb0520b3866f7fd03a9aed046
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
f0766a2ab0
|
add todo
Former-commit-id: 0bd884feb11736d0ab24ca19885151cb47d9dcd3
|
2023-11-10 14:38:18 +08:00 |
|
hiyouga
|
68dd1ef121
|
tiny fix
Former-commit-id: 97ba2027bb1ddc01a3c824c40d5a180828810c2c
|
2023-11-09 17:20:49 +08:00 |
|
Yanqing
|
b4f1ab93d1
|
Update finetuning_args.py
更新 chatglm/falcon/bloom 的 lora_target 的名称
Former-commit-id: 06606739af035a80ae9ddba9d12c965ed289305d
|
2023-11-09 17:04:40 +08:00 |
|
hiyouga
|
f5ba2190fb
|
fix ppo train and dpo eval
Former-commit-id: ced863031836632cb5920e22ae6991f251372118
|
2023-11-07 22:48:51 +08:00 |
|
hiyouga
|
f7f0c3070e
|
delete file
Former-commit-id: 7d6355db0fd5809b99f3fa42753cf4dffd251fd1
|
2023-11-07 16:20:12 +08:00 |
|
hiyouga
|
2eb65d21ac
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
4bb643e685
|
update data readme (zh)
Former-commit-id: b32fb3a984c681732b82f6544d6c05a98c34cf4c
|
2023-11-02 23:42:49 +08:00 |
|
hiyouga
|
b77c745b1a
|
support sharegpt format, add datasets
Former-commit-id: 202daf8987ccb7523be03ca535b572b5c9e65994
|
2023-11-02 23:10:04 +08:00 |
|
hiyouga
|
f3e4b72957
|
fix #1356
Former-commit-id: d2ed436108a339d405dad1be1ca15baca3d6d3e4
|
2023-11-02 16:51:52 +08:00 |
|
hiyouga
|
8d52fb46ca
|
fix #1325
Former-commit-id: 59f2cbbd52d4646fbd1ba83032bf522ecc49a50f
|
2023-11-01 23:38:49 +08:00 |
|
hiyouga
|
c762168ed0
|
support dataset cache
Former-commit-id: f79ee62eb4a2a4a01cb4e2a6aa2d07158cf8eb59
|
2023-10-26 21:48:45 +08:00 |
|
hiyouga
|
6da51565f5
|
reimplement neftune
Former-commit-id: efe9e5a194d3a9f052701d904715238816e4c09e
|
2023-10-22 16:15:08 +08:00 |
|
anvie
|
af2d61178d
|
add NEFTune optimization
Former-commit-id: 603e0298af64116ac07130fe6661a9ba823c186c
|
2023-10-21 13:24:10 +07:00 |
|
hiyouga
|
d602f06882
|
fix #1232
Former-commit-id: 49975755d47344e362145c52548fdda8783f2c0c
|
2023-10-20 23:28:52 +08:00 |
|
hiyouga
|
47a1f73d0f
|
fix #1218
Former-commit-id: b301f35bd4a3bf368159c8f5fb4e2736f922115b
|
2023-10-19 16:17:41 +08:00 |
|
hiyouga
|
c2e84d4558
|
refactor export, fix #1190
Former-commit-id: 30e60e37023a7c4a2db033ffec0542efa3d5cdfb
|
2023-10-15 16:01:48 +08:00 |
|
hiyouga
|
27dd87c890
|
fix #1184
Former-commit-id: 5b069a967823e659dbc70b0d50361b3ad248087e
|
2023-10-14 19:20:11 +08:00 |
|
hiyouga
|
97b74d328b
|
fix ppo args
Former-commit-id: 0f12899951808f53a482082eb116bda309775930
|
2023-10-11 23:40:50 +08:00 |
|
hiyouga
|
3198a7e5f4
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
1c150995ae
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
386d85ae72
|
refactor finetuning Args
Former-commit-id: be425a70a4c8f051717cf1e4464dbd79dae4c0b5
|
2023-09-27 22:28:06 +08:00 |
|
hiyouga
|
20130b486c
|
support LongLoRA
Former-commit-id: 0832ed37e7947d699f17375648a52f80752c2b6b
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
dc68c313ee
|
fix #944
Former-commit-id: 032245647848aaa4167086636b6c985268c5fee3
|
2023-09-21 19:51:02 +08:00 |
|
hiyouga
|
a402161631
|
support FlashAttention2
Former-commit-id: 23e56c5554b948d4f08ad87849b261eafd2c7890
|
2023-09-10 20:43:56 +08:00 |
|