hiyouga
|
6123a4e713
|
fix export model
Former-commit-id: 7ba2f7bf8da3c559e05d8dde20e93cd1d3d4e8ef
|
2024-03-05 11:05:41 +08:00 |
|
hiyouga
|
10845a2fe7
|
fix #2649
Former-commit-id: 1c850de660c671d92f0bc63f230d338b60b7c0bd
|
2024-03-01 13:02:41 +08:00 |
|
hoshi-hiyouga
|
f2326c94e9
|
Merge pull request #2525 from stephen-nju/main
update project_kwargs for ppo config
Former-commit-id: e7a6910141cc8d8dd966c1f54388d9ef764418d0
|
2024-02-25 15:54:00 +08:00 |
|
hiyouga
|
c7875bdf27
|
fix #2532
Former-commit-id: 23a8e64f1c47cd473c627effbe271233c136369c
|
2024-02-21 21:55:14 +08:00 |
|
stephen
|
8bceef81c0
|
update project_kwargs for ppo config
Former-commit-id: 14f106962fc0a87802ae9ecffff00d52f7f5f046
|
2024-02-21 13:47:38 +08:00 |
|
hiyouga
|
562b9d0167
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
f14cadf12d
|
fix #2189
Former-commit-id: b3d81b229d376671e1c12229aeb487b0d84f2548
|
2024-02-04 00:47:37 +08:00 |
|
hiyouga
|
eafdae8e94
|
fix #2320
Former-commit-id: e0b0c4415aaf80e75f6dd4f3777a0616b0e60f84
|
2024-01-24 16:19:18 +08:00 |
|
hoshi-hiyouga
|
8cfcce504d
|
Update tuner.py
Former-commit-id: 691420661f7115f809e76484c1f29f74637e7cd0
|
2024-01-21 12:39:38 +08:00 |
|
yhyu13
|
6ac798eb2a
|
Remove manully set use_cache; torch_dtype is not str, save model as bfloat16 used to fail;
Former-commit-id: 75557fb5df16fd6eda7586cf041a16822dcfee8e
|
2024-01-21 11:12:15 +08:00 |
|
hiyouga
|
c0e4eebf17
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
01c22ad7f8
|
fix tests
Former-commit-id: 23f97bd437424ef43b2b84743d56acc5d1ca70d5
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
e5a751ded0
|
support longlora for main branch
Former-commit-id: f869501ad4c368df26534c41f62c6d63c6be17dd
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
2cc280c7b5
|
Update tuner.py
Former-commit-id: db30107385f100f88c9370abea6692ce6030a0c9
|
2024-01-18 15:06:02 +08:00 |
|
hiyouga
|
a9fc7dbfa6
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
cde9c1a42a
|
support export push_to_hub #2183
Former-commit-id: fac09da7123a500d255de74810a8d057fb5c5f07
|
2024-01-16 23:59:42 +08:00 |
|
hiyouga
|
215b4b5c03
|
fix #2164
Former-commit-id: abe23bb4aca4fa571ebafc329ec9a9d457e37d41
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
ed2212f197
|
fix #2161
Former-commit-id: 9acd5a2b678cd07f8e3b48eca76c4cbacb559e37
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
71a1c0de56
|
improve model export
Former-commit-id: d1b795aac1fccbcb8a9ec2057065c33b46ce1a5a
|
2024-01-09 22:26:24 +08:00 |
|
hiyouga
|
9537bc7f3e
|
fix #1789
Former-commit-id: d86455f685fa531e651333e00b4fe54d895cf2e4
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
33f1141705
|
improve model export
Former-commit-id: 31255147a566a23ce1a48402662d14af8ac267ab
|
2024-01-05 18:51:49 +08:00 |
|
hiyouga
|
512a086221
|
fix args
Former-commit-id: ff18f327a3dc96d9677ef32841e8f29ab2eeb7ef
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
e25016cc6b
|
fix export format
Former-commit-id: 7c82bd396b9e6ff395850ad544d95cbf1b7557cd
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
bb4e62cbcd
|
fix ppo trainer
Former-commit-id: ca5b5823b03822ef899405d233a82396be997f44
|
2023-12-28 18:09:28 +08:00 |
|
hiyouga
|
8465803bec
|
fix dpo trainer
Former-commit-id: c160dd7cd86e296e32775ace2e4258a473449c41
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
9cdaa43d1c
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
869e54f39b
|
fix #1073 #1462 #1735 #1908
Former-commit-id: cd8e2535aa66931b24b96e76c2b56ce703a579b1
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
67e284b2e4
|
fix #1742
Former-commit-id: efbb32afdcf0d6aa4ca26f54c95f76dbb84f77dc
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
3eec0052cc
|
support dpo-ftx
Former-commit-id: 86dfa04f9821556019fa777106787f73eb70b452
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
eafe889e00
|
update tips
Former-commit-id: 4432cbda6b7535bcbb40ba77df069fca631b4be8
|
2023-12-15 23:52:50 +08:00 |
|
hiyouga
|
f847f50510
|
fix #1770
Former-commit-id: 8266187cec70bb4bd1b4837d51b09409ec11f93e
|
2023-12-15 23:50:15 +08:00 |
|
hiyouga
|
a08089f449
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
8432e50396
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
28ed4cb3f4
|
fix ppo trainer save logic
Former-commit-id: 5e70c41e4e12a1109570b0ff56346fe212c028ed
|
2023-12-04 19:00:19 +08:00 |
|
hiyouga
|
3c667217be
|
fix bug
Former-commit-id: 2fd7a8fc3134af66193a5e8db8fea35025f82de9
|
2023-12-03 21:40:40 +08:00 |
|
hiyouga
|
3d200af1d2
|
ppo support rm server
Former-commit-id: 20b0edf16f5b42cb2c4a795674647afb68cb3a4a
|
2023-12-03 21:38:51 +08:00 |
|
hiyouga
|
74851e7033
|
implement rm server #1543
Former-commit-id: 2e5bb6888c86079493456c2ddd525f8c52b9963e
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
24dd0db807
|
fix #1597
Former-commit-id: d77a3a79a0e854803a57af8ac6a7246691f69f70
|
2023-11-30 21:47:06 +08:00 |
|
hiyouga
|
ca70d8393d
|
fix #1668
Former-commit-id: bccc71259e703ca1e1d88169e385a026c4efa92e
|
2023-11-30 21:02:00 +08:00 |
|
hiyouga
|
d072f771d2
|
fix #1658
Former-commit-id: 3126687c4820c34daa6a2e9e3bf9065ad59e92dc
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
a073c3824a
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
7796b04ddb
|
support export size setting
Former-commit-id: 1a4de54586c21cdbbc89f8a716ca5a54c87a6120
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
da30d9ba02
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: fdccc6cc9b68890199e9250cabdb996ff2f853b9
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
78e6ac0156
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
05fd97c637
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 1b64678fa4979485f67c3bb1420dfdff6fcbc6e7
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
4febd99b99
|
fix #1567
Former-commit-id: 8c01ffe8d277d49a413571e0669f460c8d0802bf
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
2dba8ad987
|
fix #1263
Former-commit-id: faff5d32621f187ebd3124d7ade04e3fa437c53e
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
226156bdf1
|
fix #1558
Former-commit-id: 263b2b24c8a649b51fa5ae768a24e67def8e0e96
|
2023-11-19 14:15:47 +08:00 |
|
Yuchen Han
|
b6c80a4d43
|
Update workflow.py
Former-commit-id: f70b7ffe6442217a222e0ef797c407f259a13886
|
2023-11-17 00:16:27 -08:00 |
|
hiyouga
|
2b9ec24a5e
|
fix bug in freeze tuning
Former-commit-id: f6b436a08421ca17d64abc51497f4aa43729a43b
|
2023-11-16 14:25:11 +08:00 |
|