Commit Graph

86 Commits

Author SHA1 Message Date
hiyouga
5f86053d75 add CodeQwen models 2024-04-17 23:27:22 +08:00
hiyouga
6543f3d449 add codegemma 2024-04-16 00:11:15 +08:00
hiyouga
e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
hoshi-hiyouga
268f53dddb Update constants.py 2024-04-15 22:56:55 +08:00
marko1616
ab033dac4f Typo fix 2024-04-13 17:30:21 +08:00
marko1616
d0705518ee Add c4ai-command-r-plus link 2024-04-13 07:32:40 +08:00
marko1616
6574a721d2 Add template&support(Not tested) 2024-04-13 04:31:33 +08:00
hiyouga
9a99fbc86d tiny fix 2024-04-08 21:28:39 +08:00
hoshi-hiyouga
4c6c4a0d88 Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
2024-04-08 20:56:51 +08:00
codingma
7b76b4ca08 add empty line 2024-04-07 18:28:08 +08:00
codingma
5a780e9eec rename template to breeze 2024-04-07 11:39:54 +08:00
codingma
2565a32bd9 support https://github.com/hiyouga/LLaMA-Factory/issues/3152 2024-04-07 11:34:01 +08:00
sliderSun
1d117b7bb6 fix spell error 2024-04-07 10:59:15 +08:00
sliderSun
21650d467c support Qwen1.5-32B 2024-04-07 10:56:03 +08:00
sliderSun
77044d9ef4 support Qwen1.5-32B 2024-04-07 10:26:13 +08:00
hiyouga
54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga
70a3052dd8 patch for gemma cpt 2024-03-12 21:21:54 +08:00
hiyouga
b3247d6a16 support olmo 2024-03-12 18:30:38 +08:00
hiyouga
57452a4aa1 add Yi-9B model 2024-03-07 23:11:57 +08:00
hiyouga
0048a2021e tiny fix 2024-03-06 17:25:08 +08:00
hiyouga
894d183214 update readme, add starcoder2, cosmopedia 2024-03-03 01:01:46 +08:00
hiyouga
38d8b2cef8 update chatglm3 template 2024-02-28 21:11:23 +08:00
hiyouga
cfefacaa37 support DoRA, AWQ, AQLM #2512 2024-02-28 19:53:28 +08:00
hiyouga
3ba1054593 update readme 2024-02-26 17:25:47 +08:00
Rayrtfr
6e0fba60b3 Support Atom Model 2024-02-26 10:44:10 +08:00
hiyouga
c99e19641a support gemma 2024-02-21 23:27:36 +08:00
hiyouga
88a1bc9773 lint 2024-02-07 01:10:04 +08:00
hiyouga
85622ae757 add models 2024-02-06 14:57:23 +08:00
hiyouga
ccabb5b04a support qwen1.5 2024-02-06 00:10:51 +08:00
hiyouga
6fc2d5cc03 add orion models 2024-01-22 21:26:53 +08:00
hiyouga
638234ceee format style 2024-01-20 20:15:56 +08:00
hiyouga
5608a0da8e update readme 2024-01-18 14:30:48 +08:00
hiyouga
d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga
5a207bb723 tiny fix 2024-01-15 23:34:23 +08:00
hiyouga
bf73224f33 support solar 10.7B #1907 2024-01-14 00:30:30 +08:00
hiyouga
ca3933dc52 support deepseek moe 2024-01-14 00:14:49 +08:00
hiyouga
d1a73fe26c fix phi modules 2024-01-13 23:12:47 +08:00
hiyouga
919acc2b0b modify weight name 2024-01-09 20:22:47 +08:00
hiyouga
4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga
c7ea17d616 add yuan model 2023-12-29 13:50:24 +08:00
hiyouga
2df923540c add xverse-65B-2 model 2023-12-18 19:24:09 +08:00
hiyouga
709ac8870a add models 2023-12-18 19:09:31 +08:00
hiyouga
7ae6919b9b add xverse-65b-chat model 2023-12-16 20:21:29 +08:00
hiyouga
71389be37c support autogptq in llama board #246 2023-12-16 16:31:30 +08:00
hiyouga
3524aa1e58 support quantization in export model 2023-12-15 23:44:50 +08:00
hiyouga
3552035d7e add model urls 2023-12-13 00:09:17 +08:00
hiyouga
96380f5e18 support mixtral 2023-12-12 11:39:04 +08:00
hiyouga
e25f7bae16 add models 2023-12-06 13:33:18 +08:00
hiyouga
6e7af11b98 add xuanyuan models 2023-12-02 00:35:29 +08:00