hiyouga
|
7cc0721028
|
fix #2189
Former-commit-id: b988ce0a0c
|
2024-02-04 00:47:37 +08:00 |
|
hiyouga
|
b8a827faeb
|
fix #2320
Former-commit-id: 2bc30763e9
|
2024-01-24 16:19:18 +08:00 |
|
hoshi-hiyouga
|
5159c9719c
|
Update tuner.py
Former-commit-id: 662b9a9dcf
|
2024-01-21 12:39:38 +08:00 |
|
yhyu13
|
f036b9c7ba
|
Remove manully set use_cache; torch_dtype is not str, save model as bfloat16 used to fail;
Former-commit-id: 9cdbd3bfc8
|
2024-01-21 11:12:15 +08:00 |
|
hiyouga
|
b27e91222c
|
format style
Former-commit-id: 638234ceee
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
2f7684a8ee
|
fix tests
Former-commit-id: f6d6e00337
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
69e8925249
|
support longlora for main branch
Former-commit-id: 38af076a75
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
6e33982849
|
Update tuner.py
Former-commit-id: ddd48ce8ab
|
2024-01-18 15:06:02 +08:00 |
|
hiyouga
|
4e3bfb799d
|
support function calling
Former-commit-id: d9f1cae351
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
6a954cc075
|
support export push_to_hub #2183
Former-commit-id: 42859f0734
|
2024-01-16 23:59:42 +08:00 |
|
hiyouga
|
69d966eb1f
|
fix #2164
Former-commit-id: 4b2d11ec28
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
6378864390
|
fix #2161
Former-commit-id: 898ec3696a
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
fbe945aba0
|
improve model export
Former-commit-id: 05ed4e8028
|
2024-01-09 22:26:24 +08:00 |
|
hiyouga
|
61960189b2
|
fix #1789
Former-commit-id: 4571068e1e
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
6bbcf5ad16
|
improve model export
Former-commit-id: d2a676c8ba
|
2024-01-05 18:51:49 +08:00 |
|
hiyouga
|
cae66bce3d
|
fix args
Former-commit-id: 65c5b0477c
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
16688b773a
|
fix export format
Former-commit-id: e165354fac
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
d0946f08db
|
fix ppo trainer
Former-commit-id: 5431be42f9
|
2023-12-28 18:09:28 +08:00 |
|
hiyouga
|
938c4cb132
|
fix dpo trainer
Former-commit-id: 074745b170
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
f0d405f392
|
support unsloth
Former-commit-id: 7aad0b889d
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
82a79e9fdf
|
fix #1073 #1462 #1735 #1908
Former-commit-id: 31165a9822
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
8154b4bdf6
|
fix #1742
Former-commit-id: 870426ff70
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
4e75ca1222
|
support dpo-ftx
Former-commit-id: b87c74289d
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
7db6fe4754
|
update tips
Former-commit-id: 3551171d49
|
2023-12-15 23:52:50 +08:00 |
|
hiyouga
|
9a88387b91
|
fix #1770
Former-commit-id: 439a26c276
|
2023-12-15 23:50:15 +08:00 |
|
hiyouga
|
7dbc670902
|
support quantization in export model
Former-commit-id: 3524aa1e58
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
bd03307bbd
|
refactor adapter hparam
Former-commit-id: 0716f5e470
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
027caabbb6
|
fix ppo trainer save logic
Former-commit-id: d3dccd0693
|
2023-12-04 19:00:19 +08:00 |
|
hiyouga
|
6493558c3b
|
fix bug
Former-commit-id: 8b681ee273
|
2023-12-03 21:40:40 +08:00 |
|
hiyouga
|
64eead3fb1
|
ppo support rm server
Former-commit-id: 747db40172
|
2023-12-03 21:38:51 +08:00 |
|
hiyouga
|
1cb390b9b2
|
implement rm server #1543
Former-commit-id: 7df4f3ab20
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
3d291a82d3
|
fix #1597
Former-commit-id: 327d7f7efe
|
2023-11-30 21:47:06 +08:00 |
|
hiyouga
|
ba6d290d0b
|
fix #1668
Former-commit-id: 1585962eb7
|
2023-11-30 21:02:00 +08:00 |
|
hiyouga
|
ecfc7d1b50
|
fix #1658
Former-commit-id: 77d1b14fc2
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
ae1048db6d
|
fix #1659
Former-commit-id: 475a3fa0f4
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
b015ac35d8
|
support export size setting
Former-commit-id: 859a6ea942
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
4966bd7911
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: 9ea9380145
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
f06c4c8f7a
|
update ppo trainer
Former-commit-id: 5021062493
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
d72f123851
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 48211e3799
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
682d81caa9
|
fix #1567
Former-commit-id: 99a3f06377
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
a53afb27eb
|
fix #1263
Former-commit-id: 065bfaeed4
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
48d6d925f7
|
fix #1558
Former-commit-id: 1740131d63
|
2023-11-19 14:15:47 +08:00 |
|
Yuchen Han
|
a419122179
|
Update workflow.py
Former-commit-id: eeb5249d0b
|
2023-11-17 00:16:27 -08:00 |
|
hiyouga
|
0ed0b8f9c5
|
fix bug in freeze tuning
Former-commit-id: ff52b1779c
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
678052a7ef
|
fix rlhf callback
Former-commit-id: 1817ffc86f
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
b71da932eb
|
fix bug in PPO training
Former-commit-id: 856522a3df
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
eb5a852dd5
|
fix import bug
Former-commit-id: 35b91ea34c
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce78303600
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1
|
2023-11-15 16:29:09 +08:00 |
|