hiyouga
|
ddd48ce8ab
|
Update tuner.py
|
2024-01-18 15:06:02 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
42859f0734
|
support export push_to_hub #2183
|
2024-01-16 23:59:42 +08:00 |
|
hiyouga
|
4b2d11ec28
|
fix #2164
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
898ec3696a
|
fix #2161
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
05ed4e8028
|
improve model export
|
2024-01-09 22:26:24 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
d2a676c8ba
|
improve model export
|
2024-01-05 18:51:49 +08:00 |
|
hiyouga
|
65c5b0477c
|
fix args
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
e165354fac
|
fix export format
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
5431be42f9
|
fix ppo trainer
|
2023-12-28 18:09:28 +08:00 |
|
hiyouga
|
074745b170
|
fix dpo trainer
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
7aad0b889d
|
support unsloth
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
31165a9822
|
fix #1073 #1462 #1735 #1908
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
870426ff70
|
fix #1742
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
b87c74289d
|
support dpo-ftx
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
3551171d49
|
update tips
|
2023-12-15 23:52:50 +08:00 |
|
hiyouga
|
439a26c276
|
fix #1770
|
2023-12-15 23:50:15 +08:00 |
|
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
0716f5e470
|
refactor adapter hparam
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
d3dccd0693
|
fix ppo trainer save logic
|
2023-12-04 19:00:19 +08:00 |
|
hiyouga
|
8b681ee273
|
fix bug
|
2023-12-03 21:40:40 +08:00 |
|
hiyouga
|
747db40172
|
ppo support rm server
|
2023-12-03 21:38:51 +08:00 |
|
hiyouga
|
7df4f3ab20
|
implement rm server #1543
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
327d7f7efe
|
fix #1597
|
2023-11-30 21:47:06 +08:00 |
|
hiyouga
|
1585962eb7
|
fix #1668
|
2023-11-30 21:02:00 +08:00 |
|
hiyouga
|
77d1b14fc2
|
fix #1658
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
475a3fa0f4
|
fix #1659
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
859a6ea942
|
support export size setting
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
9ea9380145
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
5021062493
|
update ppo trainer
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
48211e3799
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
99a3f06377
|
fix #1567
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
065bfaeed4
|
fix #1263
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
1740131d63
|
fix #1558
|
2023-11-19 14:15:47 +08:00 |
|
Yuchen Han
|
eeb5249d0b
|
Update workflow.py
|
2023-11-17 00:16:27 -08:00 |
|
hiyouga
|
ff52b1779c
|
fix bug in freeze tuning
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
1817ffc86f
|
fix rlhf callback
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
856522a3df
|
fix bug in PPO training
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|