hiyouga
|
5f86053d75
|
add CodeQwen models
|
2024-04-17 23:27:22 +08:00 |
|
hiyouga
|
6543f3d449
|
add codegemma
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
e0dbac2845
|
support cohere commandR #3184
|
2024-04-15 23:26:42 +08:00 |
|
hoshi-hiyouga
|
268f53dddb
|
Update constants.py
|
2024-04-15 22:56:55 +08:00 |
|
marko1616
|
ab033dac4f
|
Typo fix
|
2024-04-13 17:30:21 +08:00 |
|
marko1616
|
d0705518ee
|
Add c4ai-command-r-plus link
|
2024-04-13 07:32:40 +08:00 |
|
marko1616
|
6574a721d2
|
Add template&support(Not tested)
|
2024-04-13 04:31:33 +08:00 |
|
hiyouga
|
9a99fbc86d
|
tiny fix
|
2024-04-08 21:28:39 +08:00 |
|
hoshi-hiyouga
|
4c6c4a0d88
|
Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
|
2024-04-08 20:56:51 +08:00 |
|
codingma
|
7b76b4ca08
|
add empty line
|
2024-04-07 18:28:08 +08:00 |
|
codingma
|
5a780e9eec
|
rename template to breeze
|
2024-04-07 11:39:54 +08:00 |
|
codingma
|
2565a32bd9
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
|
2024-04-07 11:34:01 +08:00 |
|
sliderSun
|
1d117b7bb6
|
fix spell error
|
2024-04-07 10:59:15 +08:00 |
|
sliderSun
|
21650d467c
|
support Qwen1.5-32B
|
2024-04-07 10:56:03 +08:00 |
|
sliderSun
|
77044d9ef4
|
support Qwen1.5-32B
|
2024-04-07 10:26:13 +08:00 |
|
hiyouga
|
54b7d34908
|
add qwen1.5 moe
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
70a3052dd8
|
patch for gemma cpt
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
b3247d6a16
|
support olmo
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
57452a4aa1
|
add Yi-9B model
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
0048a2021e
|
tiny fix
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
894d183214
|
update readme, add starcoder2, cosmopedia
|
2024-03-03 01:01:46 +08:00 |
|
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
cfefacaa37
|
support DoRA, AWQ, AQLM #2512
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
6e0fba60b3
|
Support Atom Model
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
88a1bc9773
|
lint
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
85622ae757
|
add models
|
2024-02-06 14:57:23 +08:00 |
|
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
|
hiyouga
|
6fc2d5cc03
|
add orion models
|
2024-01-22 21:26:53 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
|
hiyouga
|
bf73224f33
|
support solar 10.7B #1907
|
2024-01-14 00:30:30 +08:00 |
|
hiyouga
|
ca3933dc52
|
support deepseek moe
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
|
hiyouga
|
919acc2b0b
|
modify weight name
|
2024-01-09 20:22:47 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
c7ea17d616
|
add yuan model
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
2df923540c
|
add xverse-65B-2 model
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
709ac8870a
|
add models
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
7ae6919b9b
|
add xverse-65b-chat model
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
71389be37c
|
support autogptq in llama board #246
|
2023-12-16 16:31:30 +08:00 |
|
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
3552035d7e
|
add model urls
|
2023-12-13 00:09:17 +08:00 |
|
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
e25f7bae16
|
add models
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
6e7af11b98
|
add xuanyuan models
|
2023-12-02 00:35:29 +08:00 |
|