Commit Graph

92 Commits

Author SHA1 Message Date
hiyouga
ddd48ce8ab Update tuner.py 2024-01-18 15:06:02 +08:00
hiyouga
d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga
42859f0734 support export push_to_hub #2183 2024-01-16 23:59:42 +08:00
hiyouga
4b2d11ec28 fix #2164 2024-01-12 00:27:57 +08:00
hiyouga
898ec3696a fix #2161 2024-01-11 17:04:13 +08:00
hiyouga
05ed4e8028 improve model export 2024-01-09 22:26:24 +08:00
hiyouga
4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga
d2a676c8ba improve model export 2024-01-05 18:51:49 +08:00
hiyouga
65c5b0477c fix args 2023-12-28 18:47:19 +08:00
hiyouga
e165354fac fix export format 2023-12-28 18:40:46 +08:00
hiyouga
5431be42f9 fix ppo trainer 2023-12-28 18:09:28 +08:00
hiyouga
074745b170 fix dpo trainer 2023-12-23 01:51:55 +08:00
hiyouga
7aad0b889d support unsloth 2023-12-23 00:14:33 +08:00
hiyouga
31165a9822 fix #1073 #1462 #1735 #1908 2023-12-20 17:15:40 +08:00
hiyouga
870426ff70 fix #1742 2023-12-16 20:50:45 +08:00
hiyouga
b87c74289d support dpo-ftx 2023-12-16 19:21:41 +08:00
hiyouga
3551171d49 update tips 2023-12-15 23:52:50 +08:00
hiyouga
439a26c276 fix #1770 2023-12-15 23:50:15 +08:00
hiyouga
3524aa1e58 support quantization in export model 2023-12-15 23:44:50 +08:00
hiyouga
0716f5e470 refactor adapter hparam 2023-12-15 20:53:11 +08:00
hiyouga
d3dccd0693 fix ppo trainer save logic 2023-12-04 19:00:19 +08:00
hiyouga
8b681ee273 fix bug 2023-12-03 21:40:40 +08:00
hiyouga
747db40172 ppo support rm server 2023-12-03 21:38:51 +08:00
hiyouga
7df4f3ab20 implement rm server #1543 2023-12-03 20:52:54 +08:00
hiyouga
327d7f7efe fix #1597 2023-11-30 21:47:06 +08:00
hiyouga
1585962eb7 fix #1668 2023-11-30 21:02:00 +08:00
hiyouga
77d1b14fc2 fix #1658 2023-11-28 20:57:24 +08:00
hiyouga
475a3fa0f4 fix #1659 2023-11-28 20:52:28 +08:00
hiyouga
859a6ea942 support export size setting 2023-11-26 18:34:09 +08:00
hiyouga
9ea9380145 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569 2023-11-20 22:52:11 +08:00
hiyouga
5021062493 update ppo trainer 2023-11-20 21:39:15 +08:00
hoshi-hiyouga
48211e3799 Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
2023-11-20 20:32:55 +08:00
hiyouga
99a3f06377 fix #1567 2023-11-20 18:46:36 +08:00
hiyouga
065bfaeed4 fix #1263 2023-11-19 16:05:18 +08:00
hiyouga
1740131d63 fix #1558 2023-11-19 14:15:47 +08:00
Yuchen Han
eeb5249d0b Update workflow.py 2023-11-17 00:16:27 -08:00
hiyouga
ff52b1779c fix bug in freeze tuning 2023-11-16 14:25:11 +08:00
hiyouga
1817ffc86f fix rlhf callback 2023-11-16 03:26:19 +08:00
hiyouga
856522a3df fix bug in PPO training 2023-11-16 02:32:54 +08:00
hiyouga
35b91ea34c fix import bug 2023-11-16 02:27:03 +08:00
hiyouga
ce78303600 support full-parameter PPO 2023-11-16 02:08:04 +08:00
hiyouga
4736344eb1 disentangle model from tuner and rename modules 2023-11-15 16:29:09 +08:00