hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
4b6252151e
|
support gemma-2-2b
Former-commit-id: 7037192cf6049fd7d675aed4a6237ed929c6b170
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
5c6d88e91c
|
add mistral nemo model
Former-commit-id: 428bb49f53b32947bc0a62ca19ab10844154c07c
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
0a04d9470f
|
add llama3.1
Former-commit-id: 3c433890f9b61c520572f5233aae70584da0f330
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
0d6ec70c6f
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a90c6306f8
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
60558388ec
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
1408aa078d
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
2196448c21
|
add TeleChat-1B
Former-commit-id: 1b81b43fc483a21e0c2985b98459ecf5137aa4c4
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
a715490c2a
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
42e7489713
|
add Gemma2 models
Former-commit-id: 8fc5a248ecfd6861cb90dac6c14fe89cdeaf8921
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
4c89aca243
|
update readme
Former-commit-id: a1477208471039d3578980f929f1ca8c2a07aa96
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
6c185a2c57
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
665df5d733
|
add deepseek coder v2 #4346
Former-commit-id: d83d3846d8e3bf5c40d4b90c24e2c5909ec61864
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
dd7a1dbfae
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
308abfec6c
|
add minicpm #4227
Former-commit-id: e1bb18ce60be9a1b203989def30f1b9194286325
|
2024-06-15 17:58:52 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
3f6b3eed98
|
add resume args in webui
Former-commit-id: 1d86ad768b1f36e54b4c2a9f18f6ea5a7df04c90
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
a4d335b42f
|
add qwen2 models
Former-commit-id: 49cb694d02c876e3740a003a8b332349f4310ad3
|
2024-06-07 00:22:57 +08:00 |
|
hiyouga
|
937f49ec3d
|
lora modules: all by default
Former-commit-id: 52c4ae87c7f4312704c31ef26b079b2c5b95ea5f
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
abc2a73a33
|
add codestral 22B
Former-commit-id: b011c7f527a57cb1d21c4e2c9631c2fb62bb835e
|
2024-06-06 03:42:50 +08:00 |
|
hiyouga
|
7528bc1bc0
|
support glm-4
Former-commit-id: a10f4718fbf3f3c89dc7eb31cb8e1a46ca6adda5
|
2024-06-05 15:16:38 +08:00 |
|
hiyouga
|
87aa332583
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 84cfb2452cc86b037ccddee6e833f8eb7c129fa4
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
9a65820592
|
update readme
Former-commit-id: 440e9de66986ef7736361ce8ec3e23ce68655a56
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
29cb4a1327
|
add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: e0675385c88af03aaef8d51586c8a282829c4051
|
2024-05-29 15:00:37 +08:00 |
|
hiyouga
|
0206e7b9de
|
tiny fix
Former-commit-id: 4c47b3dcef9e400a1c35fce1ad53619a0a86fe81
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
a886544d3d
|
Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support
Former-commit-id: 92e6bba3cab22b7835a68f787caf7992a398978e
|
2024-05-27 20:52:37 +08:00 |
|
Jianbai Ye
|
0d9e364a90
|
add openchat-3.6-8B support
Former-commit-id: b66f39d50d896d7597a1506e67ec210b31c9b700
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
c43bc74fe6
|
support Aya23
Former-commit-id: 071935b90006e2c79e39bb9ee0c5d48c6c910501
|
2024-05-27 20:23:24 +08:00 |
|
hiyouga
|
9670f5e41a
|
add phi-3 7b/14b, mistral v0.3 models
Former-commit-id: 86dab182f9710b063f518922ccb49b01aa71c576
|
2024-05-27 18:20:16 +08:00 |
|
hiyouga
|
97a23e1cbe
|
update readme
Former-commit-id: b8d0170fe0d094acce85dcb5f91775e4685ee055
|
2024-05-27 18:14:02 +08:00 |
|
hiyouga
|
b0d9966663
|
support SimPO #3900
Former-commit-id: 6b954ce60155cf8334150b795cfc4bb63ca74c8b
|
2024-05-26 23:46:33 +08:00 |
|
hiyouga
|
e0e8507108
|
support paligemma
Former-commit-id: 11c27f9bf204d3d6a9ca5bd4f0a19a420160453f
|
2024-05-21 00:01:22 +08:00 |
|
hiyouga
|
b31d808655
|
fix paligemma inference
Former-commit-id: 46357b7a677e8ba2e0a7c9d4ec1974abd061569c
|
2024-05-20 23:36:43 +08:00 |
|
hoshi-hiyouga
|
e4570e28a8
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
Former-commit-id: f60faa23e23022fd855dac6b1ecbd21e095bccb5
|
2024-05-18 03:07:18 +08:00 |
|
hiyouga
|
a32c3a50fc
|
add deepseek v2 lite model
Former-commit-id: 5e864e6b721d8b891b1cc2ca2dcac41babb9eaaf
|
2024-05-17 13:25:36 +08:00 |
|
enji.zhou
|
66b5634ebf
|
add kto
Former-commit-id: ec51986cf70b0bdd79b8141e45916670fb97a08e
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
6481321470
|
add falcon 11b
Former-commit-id: 897acc725edc204fad393cc9616828431b4fa768
|
2024-05-17 00:08:33 +08:00 |
|
hiyouga
|
dfa686b617
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|