hoshi-hiyouga
|
7d4cb79822
|
add modelscope models
Former-commit-id: 4de3081eea9cede78a1f2db65cf22a5731c54447
|
2024-09-26 11:22:48 +08:00 |
|
marko1616
|
b867e164fe
|
Chore: Support llama3.2.
Former-commit-id: 2741ac784c1a776bd545fa6dffc07b6346273519
|
2024-09-25 16:08:44 -04:00 |
|
hoshi-hiyouga
|
0118a2fc04
|
add qwen2.5 models
Former-commit-id: 408a7d7b2e1a2316cbeefade872b732c88191b75
|
2024-09-19 02:07:54 +08:00 |
|
hiyouga
|
f7e85cd7de
|
set dev version
Former-commit-id: 39edf597f050bcb2099a10d6f6018f96e29b7e65
|
2024-09-11 18:56:37 +08:00 |
|
hiyouga
|
588ea95732
|
update accelerate ver for schedule_free optimizers
Former-commit-id: 2de74e79049ce8e50f605f649275b1dbfb899c8c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
dfff411e1a
|
release v0.9.0 (real)
Former-commit-id: 8ff781c8ae5654680f738f69a6db9d7b95d76baf
|
2024-09-09 01:00:25 +08:00 |
|
hiyouga
|
e20baa4218
|
fix constants
Former-commit-id: fce6671d2764d7a2b77c44401fc5582c7cbb77aa
|
2024-09-08 23:52:30 +08:00 |
|
hiyouga
|
d1ab9b501a
|
release v0.9.0
Former-commit-id: 594c450f648ad326ef39c0f4d70d67cda5f36159
|
2024-09-08 23:43:35 +08:00 |
|
hiyouga
|
eb5af3d90b
|
support vllm 0.6.0
Former-commit-id: e39470ec51a9c74ad901871eb816df10e851f351
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
7f71276ad8
|
add docstrings, refactor logger
Former-commit-id: c34e489d71f8f539028543ccf8ee92cecedd6276
|
2024-09-08 00:56:56 +08:00 |
|
hoshi-hiyouga
|
d6ce902d80
|
Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
Former-commit-id: 2e3c221d9c87bd59f48648be8878b7b50347280f
|
2024-09-05 21:35:42 +08:00 |
|
liudan
|
ce6dcf3600
|
根据代码规范修改了代码
Former-commit-id: fe5351980b42e0e38175b0da2401a61b3807fa7c
|
2024-09-05 20:17:55 +08:00 |
|
hiyouga
|
72222d1598
|
support Yi-Coder models
Former-commit-id: ea3f1659e70541c4fa8b7079a0a8c94fce9a41c8
|
2024-09-05 03:12:24 +08:00 |
|
hiyouga
|
1874d579c5
|
video datasets
Former-commit-id: 33f28ce82d9e44d2615909250dc56d6a4a03cd99
|
2024-09-05 02:04:17 +08:00 |
|
liudan
|
c692339020
|
增加了对minicpm3.0的适配'
Former-commit-id: 4ad3a761af2452ef3f6c61190b7e47c9ea5227b9
|
2024-09-04 23:10:05 +08:00 |
|
hiyouga
|
7e4c5d4bb3
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
7c6785d3df
|
fix #5295
Former-commit-id: c76873b0eb8225f6e6bfc7223c6012387dceb8ed
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
ca5a759f94
|
tiny fix
Former-commit-id: d2cede7023bbe28525ef8b4ad27247445d8c22e5
|
2024-08-27 12:49:32 +08:00 |
|
hiyouga
|
d111a324bc
|
tiny fix
Former-commit-id: 23961bdf6fdbcde64e7b943f699fdeb4ac024043
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
525747b472
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: cd3c536cb3936061d905256850b0e57df4498010
|
2024-08-19 23:51:39 +08:00 |
|
Ricardo
|
57d4c4a4f8
|
_is_bf16_available judgment supports npu
Former-commit-id: 50a1e892a1005b4cdd82dca1005f71db08ed89a2
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
3595d26846
|
fix: report correct device count for intel xpu
Former-commit-id: 0618f660b6511599365bd9be64499dbab41a79ba
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
7fb61bad04
|
add qwen2 math models
Former-commit-id: 72ff43a1772c9de5ff914d5e1c8bdc8dea9ae0c8
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
4b6252151e
|
support gemma-2-2b
Former-commit-id: 7037192cf6049fd7d675aed4a6237ed929c6b170
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
5c6d88e91c
|
add mistral nemo model
Former-commit-id: 428bb49f53b32947bc0a62ca19ab10844154c07c
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
0a04d9470f
|
add llama3.1
Former-commit-id: 3c433890f9b61c520572f5233aae70584da0f330
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
adff3e5050
|
set dev version
Former-commit-id: 0b9a2275dc533b65578278f979ce053e95a644b3
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
3fff875f99
|
release v0.8.3
Former-commit-id: 7180a3b99c3c218dfb0dc607ad5e87219269a678
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
8c93921952
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hoshi-hiyouga
|
00b93d8b2f
|
Update packages.py
Former-commit-id: c61ee780f3aed51c31a81e912f25fbfd11dc7edd
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
281fd5bb89
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: b73c23a88cef237db626a16ab2a30261afd36564
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
0d6ec70c6f
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a90c6306f8
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
60558388ec
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
1408aa078d
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
2196448c21
|
add TeleChat-1B
Former-commit-id: 1b81b43fc483a21e0c2985b98459ecf5137aa4c4
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
a715490c2a
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
188b4be64d
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
42e7489713
|
add Gemma2 models
Former-commit-id: 8fc5a248ecfd6861cb90dac6c14fe89cdeaf8921
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
8aaf1185a5
|
support HQQ/EETQ #4113
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
4c89aca243
|
update readme
Former-commit-id: a1477208471039d3578980f929f1ca8c2a07aa96
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
6c185a2c57
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
a7d7f79855
|
set dev version
Former-commit-id: 221665345d97f839ce4ba8d54643da30c71b6083
|
2024-06-19 21:08:16 +08:00 |
|
hiyouga
|
b631bdc5b7
|
release v0.8.2
Former-commit-id: 3050bbe51d46acd8473275d2713fc28932e4a3d3
|
2024-06-19 20:42:09 +08:00 |
|
hiyouga
|
665df5d733
|
add deepseek coder v2 #4346
Former-commit-id: d83d3846d8e3bf5c40d4b90c24e2c5909ec61864
|
2024-06-18 22:53:54 +08:00 |
|