hiyouga
|
366c0eb1c5
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hiyouga
|
cc2892e891
|
fix llama3 template
Former-commit-id: 20e95250168fbe081c779b2e1ff23f5df3ce02f7
|
2024-04-19 15:46:51 +08:00 |
|
hoshi-hiyouga
|
39f0cc7d8b
|
support llama3
Former-commit-id: c1eabb751a5fd73b710714451b146732e0ed4558
|
2024-04-19 01:13:50 +08:00 |
|
hiyouga
|
dcc34ab729
|
tiny fix
Former-commit-id: 86399ca8c06273c42c2b184664ae25d3405b3bf6
|
2024-04-18 00:22:17 +08:00 |
|
hiyouga
|
ca2c480736
|
add mixtral 8x22B models
Former-commit-id: eccbeecff0909e1fa124b5439ffbbfbc5607e1d6
|
2024-04-17 23:35:59 +08:00 |
|
hiyouga
|
7b0da78222
|
add CodeQwen models
Former-commit-id: 9f6094241391f8f717818c8ba94e11d1791b4a5c
|
2024-04-17 23:27:22 +08:00 |
|
hiyouga
|
d3adeed72a
|
fix #3317
Former-commit-id: 7dce1763be4374cf616d96db95ae964ff510a9d6
|
2024-04-17 22:17:19 +08:00 |
|
hiyouga
|
16e20ffa8f
|
update readme and gradio version
Former-commit-id: 4029b60ddcbd15b5354503c51178f0f5e7e9aedf
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
2aa1d1476e
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
19874e39ee
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hoshi-hiyouga
|
41783ae083
|
Merge pull request #3254 from marko1616/feature/Add-support-for-CohereForAI/c4ai-command-r-plus
Add template&support for c4ai-command-r/plus (tested)
Former-commit-id: 41d39ec4889abad050820bf153133ac3a11228a3
|
2024-04-15 22:59:35 +08:00 |
|
hoshi-hiyouga
|
03c83838b1
|
Update constants.py
Former-commit-id: 39199f712aa7b7a1c66080d9c84651fd2eb0b425
|
2024-04-15 22:56:55 +08:00 |
|
hiyouga
|
be206df674
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
marko1616
|
bbded4412f
|
Typo fix
Former-commit-id: 607625497738b2c8be736be7b0bd5c6f4cbaad5e
|
2024-04-13 17:30:21 +08:00 |
|
marko1616
|
07d01ed16a
|
Add c4ai-command-r-plus link
Former-commit-id: acaf953ca46eca8fb378067f4ada133654e4f088
|
2024-04-13 07:32:40 +08:00 |
|
marko1616
|
ba9fc46712
|
Add template&support(Not tested)
Former-commit-id: 60bb60c4dc30a9641ddb57a44ef126f0768566c4
|
2024-04-13 04:31:33 +08:00 |
|
hiyouga
|
a97f8d1fa8
|
release v0.6.2
Former-commit-id: f92ad0a62d957b595f6a76a5403216b163eb3d17
|
2024-04-11 20:08:51 +08:00 |
|
hiyouga
|
8f6f06ceb5
|
fix #3225
Former-commit-id: 94110ecf27c32e263f1f2ee61842a3a301b9e089
|
2024-04-10 23:57:59 +08:00 |
|
hiyouga
|
4739c45e94
|
tiny fix
Former-commit-id: d8f1ff51d4c920d4d0aeb9d53db29d1efb733c85
|
2024-04-08 21:28:39 +08:00 |
|
hoshi-hiyouga
|
cdd24a2f2d
|
Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
Former-commit-id: af92ac8b62b919a75673011a1c56832e67882ee8
|
2024-04-08 20:56:51 +08:00 |
|
codingma
|
b7cc559649
|
add empty line
Former-commit-id: 1c6c2e611d10e9fa662e3f4e1e7d23b80ae496cb
|
2024-04-07 18:28:08 +08:00 |
|
codingma
|
a61e69c0d9
|
rename template to breeze
Former-commit-id: 1d894e7cfb73b8a29dababb554d051bd50e4f01d
|
2024-04-07 11:39:54 +08:00 |
|
codingma
|
34d061f963
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
Former-commit-id: 708f0ab4b0aa72e2c73ca36eb9ed058910e43092
|
2024-04-07 11:34:01 +08:00 |
|
sliderSun
|
6c9e8e89cf
|
fix spell error
Former-commit-id: e6d36a2e593ebc1193b1735075c4ddb5d9f54990
|
2024-04-07 10:59:15 +08:00 |
|
sliderSun
|
3bdbe17702
|
support Qwen1.5-32B
Former-commit-id: c419adf1697b92520342f4ffa697c84bf19ca37d
|
2024-04-07 10:56:03 +08:00 |
|
sliderSun
|
6eb424adbe
|
support Qwen1.5-32B
Former-commit-id: 8f2c67b95a8e177eb4096382417a70cacba38e90
|
2024-04-07 10:26:13 +08:00 |
|
hiyouga
|
1afadd9975
|
back to gradio 4.21 and fix chat
Former-commit-id: 695734a40a702ea059d855da54080cc8d161e41a
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
b112959309
|
fix bug in latest gradio
Former-commit-id: 44a962862b4a74e50ef5786c8d5719faaa65f63f
|
2024-04-04 00:55:31 +08:00 |
|
hiyouga
|
75819c1220
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
76ba7b51c1
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
ff301c08d6
|
add qwen1.5 moe
Former-commit-id: 3ea94f0d12cec25ac694a2c4ae8971c356990b61
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
8365522ce2
|
fix #3077
Former-commit-id: d0340391e8075cff0d84b3ef879c2101b66ca1dc
|
2024-04-01 21:35:18 +08:00 |
|
hiyouga
|
e6c7e6e667
|
support ORPO
Former-commit-id: f44a4c27e2461cdaa1b16865f597a31033c0e6d9
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
d7ea73957f
|
support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
Former-commit-id: b5a062aa2d4a37670007e8b3dae5b6f5b7ffb15c
|
2024-03-30 23:09:12 +08:00 |
|
hiyouga
|
5c8794ad1a
|
fix #2982
Former-commit-id: e5e6a0c50c7a1c0052ed6b459450b9735ff2c9a1
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
66420ea460
|
fix #3010
Former-commit-id: a5e823ae75556eaa3b52ce7a887a6e7838a1eb5f
|
2024-03-28 18:31:17 +08:00 |
|
hiyouga
|
5382e30bb7
|
fix #2936
Former-commit-id: 9ae646fbbd809057a9c54fe41e1ae5a07a674556
|
2024-03-24 00:43:21 +08:00 |
|
hiyouga
|
06019b7ee3
|
fix #2941
Former-commit-id: 3775ab52017f0b610ddd8199cccfb8c001eda507
|
2024-03-24 00:28:44 +08:00 |
|
hiyouga
|
b590e82d41
|
support fsdp + qlora
Former-commit-id: b894bf8e84be689db258021f0638e9ac939abcbc
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
7b72952adc
|
patch for gemma cpt
Former-commit-id: fc0b19c62f52a90d78b63761dda3d8970a42f2da
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
0016732c21
|
fix plot issues
Former-commit-id: 01ae196b4916433da9aeec9c0b5c660c6b34464c
|
2024-03-12 18:41:35 +08:00 |
|
hiyouga
|
28acb02e80
|
support olmo
Former-commit-id: 2719510e8c6baa591c74458b773e4e47215e6052
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
56565bdbd4
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
6e8df5733a
|
add Yi-9B model
Former-commit-id: bfcb0245b832242eefb84de6f70bd75544f3ceb7
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
3f56782ffa
|
support galore
Former-commit-id: b67a4a46a88d83bb2a3459b3317b66cda15e0171
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
ddabd699ca
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
6bec66c192
|
tiny fix
Former-commit-id: c3145afa4164dd28888f17599a154f7dddbe9326
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
4aa6db78fb
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
a0815ff726
|
fix sub-process error in thread
Former-commit-id: 3448ad43d05301b12a19a02c1cc23d7b0ee525c3
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
28f3e60189
|
update readme, add starcoder2, cosmopedia
Former-commit-id: 1ae7c183640146bb9b06c98942985a1721d2b9c9
|
2024-03-03 01:01:46 +08:00 |
|