Commit Graph

250 Commits

Author SHA1 Message Date
hiyouga
ed8f8be752 update api and support abort eval in webui 2024-05-04 15:59:15 +08:00
hiyouga
24cc93ab15 fix eval in webui 2024-05-04 00:19:19 +08:00
hiyouga
9585838ebe fix callback log multigpu #3559 2024-05-03 21:24:27 +08:00
hiyouga
245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga
4dbbce21d5 add models to 0.7.0 2024-04-28 01:50:30 +08:00
hiyouga
168f56683a release v0.7.0 2024-04-26 23:18:00 +08:00
hiyouga
375b25131b support Qwen1.5 110B 2024-04-26 19:59:22 +08:00
hiyouga
cd3a960f81 add llava to llamaboard 2024-04-26 06:41:35 +08:00
hiyouga
44a43ee152 add olmo 1.7 2024-04-24 05:50:50 +08:00
hiyouga
69eb03a8fe add dbrx and jamba models 2024-04-24 05:39:52 +08:00
hiyouga
aa2b79eb23 refactor patcher 2024-04-24 03:02:23 +08:00
hiyouga
07737a3d2d reenable sdpa and fast tok by default 2024-04-24 02:18:44 +08:00
hiyouga
1a13f05555 support phi-3 2024-04-24 00:28:53 +08:00
hiyouga
f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
hiyouga
14a605a2da fix llama3 template 2024-04-19 15:46:51 +08:00
hoshi-hiyouga
2aaaede247 support llama3 2024-04-19 01:13:50 +08:00
hiyouga
3b43a3b7c5 tiny fix 2024-04-18 00:22:17 +08:00
hiyouga
cab0598fd0 add mixtral 8x22B models 2024-04-17 23:35:59 +08:00
hiyouga
5f86053d75 add CodeQwen models 2024-04-17 23:27:22 +08:00
hiyouga
6d641af703 fix #3317 2024-04-17 22:17:19 +08:00
hiyouga
5d62a51c12 update readme and gradio version 2024-04-16 18:09:16 +08:00
hiyouga
6543f3d449 add codegemma 2024-04-16 00:11:15 +08:00
hiyouga
e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
hoshi-hiyouga
7a8ae3f4ac Merge pull request #3254 from marko1616/feature/Add-support-for-CohereForAI/c4ai-command-r-plus
Add template&support for c4ai-command-r/plus (tested)
2024-04-15 22:59:35 +08:00
hoshi-hiyouga
268f53dddb Update constants.py 2024-04-15 22:56:55 +08:00
hiyouga
cce52351b5 update examples 2024-04-15 22:14:34 +08:00
marko1616
ab033dac4f Typo fix 2024-04-13 17:30:21 +08:00
marko1616
d0705518ee Add c4ai-command-r-plus link 2024-04-13 07:32:40 +08:00
marko1616
6574a721d2 Add template&support(Not tested) 2024-04-13 04:31:33 +08:00
hiyouga
9d4c949461 release v0.6.2 2024-04-11 20:08:51 +08:00
hiyouga
a99f5ed0b6 fix #3225 2024-04-10 23:57:59 +08:00
hiyouga
9a99fbc86d tiny fix 2024-04-08 21:28:39 +08:00
hoshi-hiyouga
4c6c4a0d88 Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
2024-04-08 20:56:51 +08:00
codingma
7b76b4ca08 add empty line 2024-04-07 18:28:08 +08:00
codingma
5a780e9eec rename template to breeze 2024-04-07 11:39:54 +08:00
codingma
2565a32bd9 support https://github.com/hiyouga/LLaMA-Factory/issues/3152 2024-04-07 11:34:01 +08:00
sliderSun
1d117b7bb6 fix spell error 2024-04-07 10:59:15 +08:00
sliderSun
21650d467c support Qwen1.5-32B 2024-04-07 10:56:03 +08:00
sliderSun
77044d9ef4 support Qwen1.5-32B 2024-04-07 10:26:13 +08:00
hiyouga
4b920f24d3 back to gradio 4.21 and fix chat 2024-04-04 02:07:20 +08:00
hiyouga
5ddcecda50 fix bug in latest gradio 2024-04-04 00:55:31 +08:00
hiyouga
92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga
b267aeb53f add moe aux loss control #3085 2024-04-02 14:26:31 +08:00
hiyouga
54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga
aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga
7a086ed333 support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
2024-03-30 23:09:12 +08:00
hiyouga
8d603f8820 fix #2982 2024-03-28 20:22:31 +08:00
hiyouga
b19c14870d fix #3010 2024-03-28 18:31:17 +08:00
hiyouga
140ad4ad56 fix #2936 2024-03-24 00:43:21 +08:00