hiyouga
|
3b42f1abce
|
add models to 0.7.0
Former-commit-id: 4dbbce21d5
|
2024-04-28 01:50:30 +08:00 |
|
hiyouga
|
eb14501a52
|
release v0.7.0
Former-commit-id: 168f56683a
|
2024-04-26 23:18:00 +08:00 |
|
hiyouga
|
7131f6ae2d
|
support Qwen1.5 110B
Former-commit-id: 375b25131b
|
2024-04-26 19:59:22 +08:00 |
|
hiyouga
|
eb70328eec
|
add llava to llamaboard
Former-commit-id: cd3a960f81
|
2024-04-26 06:41:35 +08:00 |
|
hiyouga
|
fff1fb1232
|
add olmo 1.7
Former-commit-id: 44a43ee152
|
2024-04-24 05:50:50 +08:00 |
|
hiyouga
|
80f0a63f73
|
add dbrx and jamba models
Former-commit-id: 69eb03a8fe
|
2024-04-24 05:39:52 +08:00 |
|
hiyouga
|
8465e54d38
|
refactor patcher
Former-commit-id: aa2b79eb23
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
80c8586534
|
reenable sdpa and fast tok by default
Former-commit-id: 07737a3d2d
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
1bea1ed868
|
support phi-3
Former-commit-id: 1a13f05555
|
2024-04-24 00:28:53 +08:00 |
|
hiyouga
|
ec81d45d27
|
fix mod stuff
Former-commit-id: f58425ab45
|
2024-04-21 18:11:10 +08:00 |
|
hiyouga
|
ac7ec7ed10
|
fix llama3 template
Former-commit-id: 14a605a2da
|
2024-04-19 15:46:51 +08:00 |
|
hoshi-hiyouga
|
15d17a1e86
|
support llama3
Former-commit-id: 2aaaede247
|
2024-04-19 01:13:50 +08:00 |
|
hiyouga
|
e2e0bbde12
|
tiny fix
Former-commit-id: 3b43a3b7c5
|
2024-04-18 00:22:17 +08:00 |
|
hiyouga
|
8a369cc084
|
add mixtral 8x22B models
Former-commit-id: cab0598fd0
|
2024-04-17 23:35:59 +08:00 |
|
hiyouga
|
3b491b4e1a
|
add CodeQwen models
Former-commit-id: 5f86053d75
|
2024-04-17 23:27:22 +08:00 |
|
hiyouga
|
f9c859e97b
|
fix #3317
Former-commit-id: 6d641af703
|
2024-04-17 22:17:19 +08:00 |
|
hiyouga
|
8beb7a9239
|
update readme and gradio version
Former-commit-id: 5d62a51c12
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
bd2b758b48
|
add codegemma
Former-commit-id: 6543f3d449
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
2dc3343b1c
|
support cohere commandR #3184
Former-commit-id: e0dbac2845
|
2024-04-15 23:26:42 +08:00 |
|
hoshi-hiyouga
|
57ddf739c2
|
Merge pull request #3254 from marko1616/feature/Add-support-for-CohereForAI/c4ai-command-r-plus
Add template&support for c4ai-command-r/plus (tested)
Former-commit-id: 7a8ae3f4ac
|
2024-04-15 22:59:35 +08:00 |
|
hoshi-hiyouga
|
63f3f6b80c
|
Update constants.py
Former-commit-id: 268f53dddb
|
2024-04-15 22:56:55 +08:00 |
|
hiyouga
|
fb385b8c26
|
update examples
Former-commit-id: cce52351b5
|
2024-04-15 22:14:34 +08:00 |
|
marko1616
|
94d988a8e6
|
Typo fix
Former-commit-id: ab033dac4f
|
2024-04-13 17:30:21 +08:00 |
|
marko1616
|
768153d0f0
|
Add c4ai-command-r-plus link
Former-commit-id: d0705518ee
|
2024-04-13 07:32:40 +08:00 |
|
marko1616
|
6f1323722c
|
Add template&support(Not tested)
Former-commit-id: 6574a721d2
|
2024-04-13 04:31:33 +08:00 |
|
hiyouga
|
431e9804ee
|
release v0.6.2
Former-commit-id: 9d4c949461
|
2024-04-11 20:08:51 +08:00 |
|
hiyouga
|
d5ca8763ea
|
fix #3225
Former-commit-id: a99f5ed0b6
|
2024-04-10 23:57:59 +08:00 |
|
hiyouga
|
3069f37021
|
tiny fix
Former-commit-id: 9a99fbc86d
|
2024-04-08 21:28:39 +08:00 |
|
hoshi-hiyouga
|
8682d033eb
|
Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
Former-commit-id: 4c6c4a0d88
|
2024-04-08 20:56:51 +08:00 |
|
codingma
|
b5f0ac4c3f
|
add empty line
Former-commit-id: 7b76b4ca08
|
2024-04-07 18:28:08 +08:00 |
|
codingma
|
ed14f8bae7
|
rename template to breeze
Former-commit-id: 5a780e9eec
|
2024-04-07 11:39:54 +08:00 |
|
codingma
|
80aa1f70b6
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
Former-commit-id: 2565a32bd9
|
2024-04-07 11:34:01 +08:00 |
|
sliderSun
|
7037dcbf38
|
fix spell error
Former-commit-id: 1d117b7bb6
|
2024-04-07 10:59:15 +08:00 |
|
sliderSun
|
1fbf190eda
|
support Qwen1.5-32B
Former-commit-id: 21650d467c
|
2024-04-07 10:56:03 +08:00 |
|
sliderSun
|
09107affda
|
support Qwen1.5-32B
Former-commit-id: 77044d9ef4
|
2024-04-07 10:26:13 +08:00 |
|
hiyouga
|
f334b89616
|
back to gradio 4.21 and fix chat
Former-commit-id: 4b920f24d3
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
54a4a8217a
|
fix bug in latest gradio
Former-commit-id: 5ddcecda50
|
2024-04-04 00:55:31 +08:00 |
|
hiyouga
|
bf5ffeeae0
|
simplify readme
Former-commit-id: 92dab8a90b
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
f4be51f356
|
add moe aux loss control #3085
Former-commit-id: b267aeb53f
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
8d987b7af7
|
add qwen1.5 moe
Former-commit-id: 54b7d34908
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
34f1de0574
|
fix #3077
Former-commit-id: aee634cd20
|
2024-04-01 21:35:18 +08:00 |
|
hiyouga
|
2f878bde11
|
support ORPO
Former-commit-id: 17bf8a2c3a
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
3bf6dde3a5
|
support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
Former-commit-id: 7a086ed333
|
2024-03-30 23:09:12 +08:00 |
|
hiyouga
|
e4f3d583df
|
fix #2982
Former-commit-id: 8d603f8820
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
eac2a5b1d3
|
fix #3010
Former-commit-id: b19c14870d
|
2024-03-28 18:31:17 +08:00 |
|
hiyouga
|
84c3d509fa
|
fix #2936
Former-commit-id: 140ad4ad56
|
2024-03-24 00:43:21 +08:00 |
|
hiyouga
|
58aa576ae5
|
fix #2941
Former-commit-id: a1c8c98c5f
|
2024-03-24 00:28:44 +08:00 |
|
hiyouga
|
7999836fb6
|
support fsdp + qlora
Former-commit-id: 8408225162
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
096c31bfb6
|
patch for gemma cpt
Former-commit-id: 70a3052dd8
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
c28818c39f
|
fix plot issues
Former-commit-id: 60cc17f3a8
|
2024-03-12 18:41:35 +08:00 |
|