Commit Graph

72 Commits

Author SHA1 Message Date
hiyouga
9bec3c98a2 fix #2777 #2895 2024-03-20 17:59:45 +08:00
hiyouga
8e04794b2d fix packages 2024-03-17 22:32:03 +08:00
hiyouga
6bc2c23b6d fix export 2024-03-15 15:06:30 +08:00
hiyouga
6ebde4f23e tiny fix 2024-03-14 21:19:06 +08:00
hiyouga
3b4a59bfb1 fix export 2024-03-14 18:17:01 +08:00
hiyouga
8172530d54 fix bug 2024-03-13 23:55:31 +08:00
hiyouga
714d936dfb fix bug 2024-03-13 23:43:42 +08:00
hiyouga
72367307df improve lora+ impl. 2024-03-13 23:32:51 +08:00
齐保元
a0965cd62c [FEATURE]: ADD LORA+ ALGORITHM 2024-03-13 19:43:27 +08:00
hiyouga
e874c00906 fix #2775 2024-03-11 00:42:54 +08:00
hiyouga
8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga
bdb496644c allow non-packing pretraining 2024-03-09 22:21:46 +08:00
hiyouga
412c52e325 fix #2766 2024-03-09 21:35:24 +08:00
hiyouga
e8dd38b7fd fix #2756 , patch #2746 2024-03-09 02:01:26 +08:00
hiyouga
33a4c24a8a fix galore 2024-03-08 00:44:51 +08:00
hiyouga
28f7862188 support galore 2024-03-07 22:41:36 +08:00
hiyouga
0048a2021e tiny fix 2024-03-06 17:25:08 +08:00
hiyouga
e5edcf440f fix export model 2024-03-05 11:05:41 +08:00
hiyouga
4e5fae2fac fix #2649 2024-03-01 13:02:41 +08:00
hoshi-hiyouga
4aab19c7ef Merge pull request #2525 from stephen-nju/main
update project_kwargs for ppo config
2024-02-25 15:54:00 +08:00
hiyouga
3cc10a01a7 fix #2532 2024-02-21 21:55:14 +08:00
stephen
42c23798f2 update project_kwargs for ppo config 2024-02-21 13:47:38 +08:00
hiyouga
7924ffc55d support llama pro #2338 , add rslora 2024-02-15 02:27:36 +08:00
hiyouga
b988ce0a0c fix #2189 2024-02-04 00:47:37 +08:00
hiyouga
2bc30763e9 fix #2320 2024-01-24 16:19:18 +08:00
hoshi-hiyouga
662b9a9dcf Update tuner.py 2024-01-21 12:39:38 +08:00
yhyu13
9cdbd3bfc8 Remove manully set use_cache; torch_dtype is not str, save model as bfloat16 used to fail; 2024-01-21 11:12:15 +08:00
hiyouga
638234ceee format style 2024-01-20 20:15:56 +08:00
hiyouga
f6d6e00337 fix tests 2024-01-20 19:58:04 +08:00
hiyouga
38af076a75 support longlora for main branch 2024-01-20 19:25:22 +08:00
hiyouga
ddd48ce8ab Update tuner.py 2024-01-18 15:06:02 +08:00
hiyouga
d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga
42859f0734 support export push_to_hub #2183 2024-01-16 23:59:42 +08:00
hiyouga
4b2d11ec28 fix #2164 2024-01-12 00:27:57 +08:00
hiyouga
898ec3696a fix #2161 2024-01-11 17:04:13 +08:00
hiyouga
05ed4e8028 improve model export 2024-01-09 22:26:24 +08:00
hiyouga
4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga
d2a676c8ba improve model export 2024-01-05 18:51:49 +08:00
hiyouga
65c5b0477c fix args 2023-12-28 18:47:19 +08:00
hiyouga
e165354fac fix export format 2023-12-28 18:40:46 +08:00
hiyouga
5431be42f9 fix ppo trainer 2023-12-28 18:09:28 +08:00
hiyouga
074745b170 fix dpo trainer 2023-12-23 01:51:55 +08:00
hiyouga
7aad0b889d support unsloth 2023-12-23 00:14:33 +08:00
hiyouga
31165a9822 fix #1073 #1462 #1735 #1908 2023-12-20 17:15:40 +08:00
hiyouga
870426ff70 fix #1742 2023-12-16 20:50:45 +08:00
hiyouga
b87c74289d support dpo-ftx 2023-12-16 19:21:41 +08:00
hiyouga
3551171d49 update tips 2023-12-15 23:52:50 +08:00
hiyouga
439a26c276 fix #1770 2023-12-15 23:50:15 +08:00
hiyouga
3524aa1e58 support quantization in export model 2023-12-15 23:44:50 +08:00
hiyouga
0716f5e470 refactor adapter hparam 2023-12-15 20:53:11 +08:00