Ricardo
|
d2bb1c2041
|
_is_bf16_available judgment supports npu
Former-commit-id: 50a1e892a1005b4cdd82dca1005f71db08ed89a2
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
b31a2da778
|
fix: report correct device count for intel xpu
Former-commit-id: 0618f660b6511599365bd9be64499dbab41a79ba
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
e1352c5e11
|
add qwen2 math models
Former-commit-id: 72ff43a1772c9de5ff914d5e1c8bdc8dea9ae0c8
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
019a932b2f
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
af9eda3e4d
|
support gemma-2-2b
Former-commit-id: 7037192cf6049fd7d675aed4a6237ed929c6b170
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
15c845dc5b
|
add mistral nemo model
Former-commit-id: 428bb49f53b32947bc0a62ca19ab10844154c07c
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
aae189fada
|
add llama3.1
Former-commit-id: 3c433890f9b61c520572f5233aae70584da0f330
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
eacf4ecfb0
|
set dev version
Former-commit-id: 0b9a2275dc533b65578278f979ce053e95a644b3
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
1b48a6c4f8
|
release v0.8.3
Former-commit-id: 7180a3b99c3c218dfb0dc607ad5e87219269a678
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
746e9b352e
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hoshi-hiyouga
|
2e11c6ecdc
|
Update packages.py
Former-commit-id: c61ee780f3aed51c31a81e912f25fbfd11dc7edd
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
25e086e02d
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: b73c23a88cef237db626a16ab2a30261afd36564
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
1fe104fd2c
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
9dcdaee09c
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bd294e7cc3
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
7c08a4a82a
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
ca106d1f1b
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
1df3f02aca
|
add TeleChat-1B
Former-commit-id: 1b81b43fc483a21e0c2985b98459ecf5137aa4c4
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
9174675ba9
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
35c65ddf8c
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
81094dc09a
|
add Gemma2 models
Former-commit-id: 8fc5a248ecfd6861cb90dac6c14fe89cdeaf8921
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
884a4a33ee
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
28c2c7fba5
|
support HQQ/EETQ #4113
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
83ee461b9a
|
update readme
Former-commit-id: a1477208471039d3578980f929f1ca8c2a07aa96
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
4d345f7901
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
fadad08706
|
set dev version
Former-commit-id: 221665345d97f839ce4ba8d54643da30c71b6083
|
2024-06-19 21:08:16 +08:00 |
|
hiyouga
|
741c0b7566
|
release v0.8.2
Former-commit-id: 3050bbe51d46acd8473275d2713fc28932e4a3d3
|
2024-06-19 20:42:09 +08:00 |
|
hiyouga
|
f312b7db06
|
add deepseek coder v2 #4346
Former-commit-id: d83d3846d8e3bf5c40d4b90c24e2c5909ec61864
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
84e1f06e45
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
640372cb66
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
4851ef85b7
|
add tests
Former-commit-id: 484634ee9c982e82e919ff67d507e0210345182d
|
2024-06-15 19:51:20 +08:00 |
|
hiyouga
|
61aaab22c9
|
add minicpm #4227
Former-commit-id: e1bb18ce60be9a1b203989def30f1b9194286325
|
2024-06-15 17:58:52 +08:00 |
|
hiyouga
|
acfae2e677
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
344d1192ac
|
clean code
Former-commit-id: f54cafd5c7f0383370d1a2f357834a61a97397ce
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
e540759f4f
|
set dev version
Former-commit-id: 16c47cc15226119e33e46ba0f2f6ccb37072257f
|
2024-06-11 00:50:53 +08:00 |
|
hiyouga
|
41eadf5459
|
release v0.8.1
Former-commit-id: 875a34f492701d1c644facbe9ede411af2931513
|
2024-06-11 00:44:26 +08:00 |
|
hiyouga
|
a2acefea6e
|
fix llamafactory-cli env
Former-commit-id: b0515e5f42831b67d1f4d049999ecb68756e66db
|
2024-06-08 07:15:45 +08:00 |
|
hiyouga
|
088292e84a
|
set dev version
Former-commit-id: 08b7fe1c452cc99264ff0312e310b579590c6a45
|
2024-06-08 06:46:09 +08:00 |
|
hiyouga
|
cabe5ca7d0
|
release v0.8.0
Former-commit-id: 004db680b9e3996ec511ee818df6c0c02bf13603
|
2024-06-08 05:20:54 +08:00 |
|
hiyouga
|
5606780ab6
|
add resume args in webui
Former-commit-id: 1d86ad768b1f36e54b4c2a9f18f6ea5a7df04c90
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
8cc3bbdc62
|
fix #4120
Former-commit-id: 2a44da678a5e360a9c0f9056397ac9e801329321
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
093abed7cc
|
add qwen2 models
Former-commit-id: 49cb694d02c876e3740a003a8b332349f4310ad3
|
2024-06-07 00:22:57 +08:00 |
|
hiyouga
|
d3a378ffea
|
fix torch gc
Former-commit-id: e173799d057598e5692a407601c30d8ce1513461
|
2024-06-06 20:30:25 +08:00 |
|
hiyouga
|
990dd6d44c
|
lora modules: all by default
Former-commit-id: 52c4ae87c7f4312704c31ef26b079b2c5b95ea5f
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
8d9f3022d2
|
add codestral 22B
Former-commit-id: b011c7f527a57cb1d21c4e2c9631c2fb62bb835e
|
2024-06-06 03:42:50 +08:00 |
|
hiyouga
|
e9f9b1f250
|
lint
Former-commit-id: 9030501eaef97ea249347198272adf0d709503ec
|
2024-06-06 03:33:44 +08:00 |
|
hoshi-hiyouga
|
fbc1168294
|
Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
Former-commit-id: d2816f343f405f3fab09f2a8eade774b886e8f92
|
2024-06-06 03:32:04 +08:00 |
|
hiyouga
|
0b671615d0
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
1935f4a1e0
|
add llamafactory-cli env
Former-commit-id: 1df077184845ff5f394b9324d46f8c382869e590
|
2024-06-06 01:28:14 +08:00 |
|