86 Commits

Author SHA1 Message Date
hiyouga
3b491b4e1a add CodeQwen models
Former-commit-id: 5f86053d7555423cd35bd2b8610adf423ba3dbd0
2024-04-17 23:27:22 +08:00
hiyouga
bd2b758b48 add codegemma
Former-commit-id: 6543f3d4496218f7f90c582cb6aa8c852d716cbf
2024-04-16 00:11:15 +08:00
hiyouga
2dc3343b1c support cohere commandR #3184
Former-commit-id: e0dbac28450a0e1e0b84e1577ef785fc762c0b46
2024-04-15 23:26:42 +08:00
hoshi-hiyouga
63f3f6b80c Update constants.py
Former-commit-id: 268f53dddbda5905859c6facfce90e90736c6f7d
2024-04-15 22:56:55 +08:00
marko1616
94d988a8e6 Typo fix
Former-commit-id: ab033dac4fde392b18955984e40473a923f745af
2024-04-13 17:30:21 +08:00
marko1616
768153d0f0 Add c4ai-command-r-plus link
Former-commit-id: d0705518ee85408364172803ca3dd68e978e829e
2024-04-13 07:32:40 +08:00
marko1616
6f1323722c Add template&support(Not tested)
Former-commit-id: 6574a721d20108174fc770caf9f17cf8fa81c4b4
2024-04-13 04:31:33 +08:00
hiyouga
3069f37021 tiny fix
Former-commit-id: 9a99fbc86d0be7bd0e6dd1a5475868c32006b08d
2024-04-08 21:28:39 +08:00
hoshi-hiyouga
8682d033eb Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B

Former-commit-id: 4c6c4a0d8829bc098c202e4c3df36f702d579537
2024-04-08 20:56:51 +08:00
codingma
b5f0ac4c3f add empty line
Former-commit-id: 7b76b4ca08066af0465f138fa756615cbaef32ad
2024-04-07 18:28:08 +08:00
codingma
ed14f8bae7 rename template to breeze
Former-commit-id: 5a780e9eec7e8c560d417c1a95afa1be0f878d32
2024-04-07 11:39:54 +08:00
codingma
80aa1f70b6 support https://github.com/hiyouga/LLaMA-Factory/issues/3152
Former-commit-id: 2565a32bd98ef46a3b6e1a5c334093ca46b820d2
2024-04-07 11:34:01 +08:00
sliderSun
7037dcbf38 fix spell error
Former-commit-id: 1d117b7bb6b71ff9ff98b7d5cc0bbf7b879ad385
2024-04-07 10:59:15 +08:00
sliderSun
1fbf190eda support Qwen1.5-32B
Former-commit-id: 21650d467c19c99d30f538f8a70984d4e3417604
2024-04-07 10:56:03 +08:00
sliderSun
09107affda support Qwen1.5-32B
Former-commit-id: 77044d9ef4ce39a038ad769f93569d5eb2f3bfb0
2024-04-07 10:26:13 +08:00
hiyouga
8d987b7af7 add qwen1.5 moe
Former-commit-id: 54b7d349088a828d32551fde56b3467a82df1b9b
2024-04-01 21:49:40 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
2024-03-31 18:29:50 +08:00
hiyouga
096c31bfb6 patch for gemma cpt
Former-commit-id: 70a3052dd8a2d1322fa01ab19e369e465842d416
2024-03-12 21:21:54 +08:00
hiyouga
14ed926a2d support olmo
Former-commit-id: b3247d6a1604f4cbeb0d7c163d0082ce91afb870
2024-03-12 18:30:38 +08:00
hiyouga
f373290012 add Yi-9B model
Former-commit-id: 57452a4aa1d37a047d659f002c1aaa6246f64178
2024-03-07 23:11:57 +08:00
hiyouga
31c618f1f7 tiny fix
Former-commit-id: 0048a2021e94d068f7c6054df0b9569ae4912eb1
2024-03-06 17:25:08 +08:00
hiyouga
9ae1514a75 update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214417b10af64d6add7be082d63e8b1f3
2024-03-03 01:01:46 +08:00
hiyouga
57f85add58 update chatglm3 template
Former-commit-id: 38d8b2cef8d70ce8c390de0317559df7f04b4a5d
2024-02-28 21:11:23 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37453a15c55866d019887f24e886a577
2024-02-28 19:53:28 +08:00
hiyouga
3af5fea981 update readme
Former-commit-id: 3ba10545937cd9a1dea9cf65d98dd174a205337d
2024-02-26 17:25:47 +08:00
Rayrtfr
41e9dd2cf8 Support Atom Model
Former-commit-id: 6e0fba60b3857c8920edb6bd2a34a7f2d4ac46be
2024-02-26 10:44:10 +08:00
hiyouga
1845e03921 support gemma
Former-commit-id: c99e19641a9b893da0a3277bd41bd1d3996d1913
2024-02-21 23:27:36 +08:00
hiyouga
23dd337ac2 lint
Former-commit-id: 88a1bc97736bf06f292cd768fc8b61503aca1988
2024-02-07 01:10:04 +08:00
hiyouga
9debd64cef add models
Former-commit-id: 85622ae757e2ffe7f3da15f0a9123e8410d82b28
2024-02-06 14:57:23 +08:00
hiyouga
dcfb9b5cfa support qwen1.5
Former-commit-id: ccabb5b04a6ec36e102c3be680184e4f76e08b2b
2024-02-06 00:10:51 +08:00
hiyouga
9898712a24 add orion models
Former-commit-id: 6fc2d5cc0375a183ec95f003e7a27567b2a71514
2024-01-22 21:26:53 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
2024-01-20 20:15:56 +08:00
hiyouga
0a8f46882c update readme
Former-commit-id: 5608a0da8e24f499a04db9bfbd45e296d8011977
2024-01-18 14:30:48 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
7e16d27fca tiny fix
Former-commit-id: 5a207bb7230789ddefba932095de83002d01c005
2024-01-15 23:34:23 +08:00
hiyouga
21020e51ca support solar 10.7B #1907
Former-commit-id: bf73224f33c93647a6101d36bde5bf8ddfc91438
2024-01-14 00:30:30 +08:00
hiyouga
9771acfd75 support deepseek moe
Former-commit-id: ca3933dc5295bd8d9e5e37ce869ff8fb44761047
2024-01-14 00:14:49 +08:00
hiyouga
dad632091d fix phi modules
Former-commit-id: d1a73fe26ce2c12953c8eebd1d1118abc90dcf74
2024-01-13 23:12:47 +08:00
hiyouga
b29c4fb308 modify weight name
Former-commit-id: 919acc2b0be03ede08cc0018784e8874a920e300
2024-01-09 20:22:47 +08:00
hiyouga
61960189b2 fix #1789
Former-commit-id: 4571068e1e00dc234c9131185fe0924c726add84
2024-01-09 18:31:27 +08:00
hiyouga
4735cb96c1 add yuan model
Former-commit-id: c7ea17d6168d0a032960dbf51c12482f97529b1e
2023-12-29 13:50:24 +08:00
hiyouga
51c636db54 add xverse-65B-2 model
Former-commit-id: 2df923540c3cbf3b06c74801ea66d3523718b84a
2023-12-18 19:24:09 +08:00
hiyouga
1af13cb737 add models
Former-commit-id: 709ac8870a17a96e786b32c75ad8c4e573148cee
2023-12-18 19:09:31 +08:00
hiyouga
397f6bb615 add xverse-65b-chat model
Former-commit-id: 7ae6919b9bb9ecc8d821eea47a03eacd9eb997ac
2023-12-16 20:21:29 +08:00
hiyouga
f0f9d253d8 support autogptq in llama board #246
Former-commit-id: 71389be37cb0f1a65db6e501e11ca14e615c1a24
2023-12-16 16:31:30 +08:00
hiyouga
7dbc670902 support quantization in export model
Former-commit-id: 3524aa1e58da94ab00e9a2024952ea1b4119b2af
2023-12-15 23:44:50 +08:00
hiyouga
f9ab303629 add model urls
Former-commit-id: 3552035d7eecff86943f02aa26693544fe295f49
2023-12-13 00:09:17 +08:00
hiyouga
b7d99ad5f4 support mixtral
Former-commit-id: 96380f5e1887bb166be339e58ab8f65e464d4010
2023-12-12 11:39:04 +08:00
hiyouga
9b84a706af add models
Former-commit-id: e25f7bae16b7ea41a4a1fd1e8db1b961e55d0c5b
2023-12-06 13:33:18 +08:00
hiyouga
f8376b228a add xuanyuan models
Former-commit-id: 6e7af11b989e4cf97ffacbab4736e3434ff6c925
2023-12-02 00:35:29 +08:00