hoshi-hiyouga
|
93cc1f167b
|
Merge pull request #6600 from hiyouga/hiyouga/refactor_mllm_param
[model] refactor mllm param logic
Former-commit-id: 382e932228d1bcfcdee0a25ee3f1977226f1c433
|
2025-01-10 23:53:37 +08:00 |
|
hiyouga
|
c89d17ab63
|
refactor mllm param logic
Former-commit-id: f6f630a1c96514053176abb12e35a06242e62abd
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
9213e48fa2
|
add minicpmv2.6
Former-commit-id: e45329e7456b647d5684b1f9428641ad18af92d1
|
2025-01-10 23:45:44 +08:00 |
|
fzc8578
|
0fb50f9c88
|
add some
Former-commit-id: 771cc802941cf1953b32e5102c817c6a3090b5ce
|
2025-01-10 23:29:06 +08:00 |
|
fzc8578
|
bcbe37ff52
|
add some
Former-commit-id: ae1f528df31194fe37a123ba1e5a4cd263a61602
|
2025-01-10 21:25:32 +08:00 |
|
fzc8578
|
994049380d
|
fix some
Former-commit-id: 15bbcdf8d3265f4154d3937719da5e54a5963355
|
2025-01-10 20:55:52 +08:00 |
|
fzc8578
|
cc6a6f698f
|
fix version
Former-commit-id: d09032049c1f24336a1899908bf47a98e77b3211
|
2025-01-10 20:31:04 +08:00 |
|
fzc8578
|
7138b43873
|
fix some
Former-commit-id: 2ee8ba2f390551af1b865cfa813f5c8b7bbb41c5
|
2025-01-10 20:27:06 +08:00 |
|
fzc8578
|
aeb4f82ef2
|
tiny fix
Former-commit-id: 84026be06e34239a828a0cc8b1706084afcfa4ea
|
2025-01-10 20:15:39 +08:00 |
|
Zhangchi Feng
|
f51ac40f0a
|
Merge branch 'main' into minicpmv
Former-commit-id: fc045d7dd871985d621430b5662cba882188a59c
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
165fe8e219
|
add some
Former-commit-id: 096a6cb67a7dfd14a6e339d96baab78c12d36a87
|
2025-01-10 20:01:22 +08:00 |
|
hoshi-hiyouga
|
4243c618f0
|
Merge pull request #6597 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
Former-commit-id: b308ddf0971606f0f8f39e26f5711852abad3e79
|
2025-01-10 18:41:47 +08:00 |
|
hiyouga
|
368d22f79a
|
update wechat
Former-commit-id: 70ed03b288c1853f262e47b06e8601eaf49ccc1b
|
2025-01-10 10:40:25 +00:00 |
|
hoshi-hiyouga
|
b3561ae552
|
Merge pull request #6588 from hiyouga/hiyouga/upd_issue_temp
[gh] update issue template
Former-commit-id: 5ffd8ad192bb3932fbe230757d4bf1c907ca3aa4
|
2025-01-10 03:03:48 +08:00 |
|
hiyouga
|
b395540826
|
update issue template
Former-commit-id: aa8d0a223b0345e1f665b6703678c0ce526ff950
|
2025-01-09 18:58:53 +00:00 |
|
hoshi-hiyouga
|
a1b5644889
|
Merge pull request #6585 from hiyouga/hiyouga/add_phi4
[model] add phi4 model
Former-commit-id: 8b209cb49d9cc6058ce61c97bf2216f6371c5f7c
|
2025-01-10 02:39:17 +08:00 |
|
hiyouga
|
b471def13d
|
improve template, add phi4 model
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
|
2025-01-09 18:27:54 +00:00 |
|
hoshi-hiyouga
|
b777fed171
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: 6b34b69fa688c4622489d3d5f33d847fb6b95528
|
2025-01-08 18:14:18 +08:00 |
|
hoshi-hiyouga
|
618ceda6e9
|
Merge pull request #6565 from hiyouga/hiyouga/improve_log
[misc] imporve log
Former-commit-id: 18431527bac8da57d9a2fc014695e5891f7a3068
|
2025-01-08 18:08:21 +08:00 |
|
zhubin
|
014a7ea042
|
fix get ray args when args not a dict
Former-commit-id: 9c4c84828b77acf48caf60726e4e7ef3e972118d
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
|
2025-01-08 09:56:10 +00:00 |
|
hoshi-hiyouga
|
984b202f83
|
Merge pull request #6542 from erictang000/et/ray-integration
Ray Train integration with LLaMA-Factory
Former-commit-id: d23a98825bcb569bc51e21a3c2236eccd2f6d2fd
|
2025-01-08 11:46:03 +08:00 |
|
hiyouga
|
0c1ad5f3fb
|
fix llamaboard with ray
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
bba52e258e
|
run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
1217240918
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
|
2025-01-07 08:55:44 +00:00 |
|
hoshi-hiyouga
|
a0bcac80c0
|
Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo
[trainer] fix pixtral dpo
Former-commit-id: c973f32849b979a3ebb80caa01029b43fbb620ac
|
2025-01-07 14:38:55 +08:00 |
|
hiyouga
|
8c57169eb7
|
fix #6546
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
|
2025-01-07 06:30:44 +00:00 |
|
fzc8578
|
b9eeaa9706
|
add some
Former-commit-id: 785cc70ff205f5962c3ca67f453589e4a471ba8c
|
2025-01-06 19:32:39 +08:00 |
|
hoshi-hiyouga
|
621d73e87c
|
Merge pull request #6528 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
Former-commit-id: b832ed9a60a5fd0bc7d9f975bb881a71e7d35245
|
2025-01-04 16:01:21 +08:00 |
|
hiyouga
|
a02a140840
|
update wechat
Former-commit-id: cd1433650653810f7934c65cb1de91052eb73dcf
|
2025-01-04 07:59:57 +00:00 |
|
Zhangchi Feng
|
a0188a430f
|
Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: ab87bd6b1398b379b1a7a95f01a6539743b9db2d
|
2025-01-04 11:20:33 +08:00 |
|
fzc8578
|
b5ef5059ee
|
add some
Former-commit-id: 79c2d7090cbf364063ea3608814ab18aa27fdc87
|
2025-01-04 11:11:15 +08:00 |
|
hoshi-hiyouga
|
084d356c2c
|
Merge pull request #6524 from hiyouga/hiyouga/upd_scripts
[misc] update scripts
Former-commit-id: e6d603ac374c04df354361f9617173afa8c1edae
|
2025-01-03 23:52:26 +08:00 |
|
hiyouga
|
20a9565e36
|
update scripts
Former-commit-id: dd44c65d7f60cb6f5d0e0d8ee5f4e7643defb89b
|
2025-01-03 10:50:32 +00:00 |
|
hoshi-hiyouga
|
85317bcbaf
|
Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name
Former-commit-id: 51ef90ce0ace4a45f9c01ba7e674adf5e3c92baa
|
2025-01-02 20:20:02 +08:00 |
|
hiyouga
|
528fb4f799
|
update model name
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
|
2025-01-02 12:19:21 +00:00 |
|
hoshi-hiyouga
|
aa7ec44367
|
Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project
Former-commit-id: a766cad5d49f226eb61a550bc3d157870c1068cc
|
2025-01-02 20:16:15 +08:00 |
|
hoshi-hiyouga
|
b2ecb80729
|
Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model
Former-commit-id: 29ddc6b77862f740570a00d3b8ea548ee1a2ce03
|
2025-01-02 20:15:55 +08:00 |
|
hiyouga
|
9a3afbd5d1
|
add project
Former-commit-id: b3e1137fbbdfa4cc081903983fea36acff7afd75
|
2025-01-02 12:15:41 +00:00 |
|
hiyouga
|
37c60c7d14
|
add gpt2 model
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
|
2025-01-02 12:07:38 +00:00 |
|
hoshi-hiyouga
|
b921dde749
|
Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic
Former-commit-id: 72d86ecc9e327933a0a2c893b8ffd2740c99be6b
|
2025-01-02 19:36:54 +08:00 |
|
hoshi-hiyouga
|
d195329185
|
Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building
Former-commit-id: 8741e5b3e87a392a3c9d50455e4916c3a938fb24
|
2025-01-02 19:34:17 +08:00 |
|
hiyouga
|
da8721a70e
|
fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
|
2025-01-02 11:28:54 +00:00 |
|
hoshi-hiyouga
|
f318dc9464
|
Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
Former-commit-id: f8e80d566f7666b6af00360df97065698a1d3a9f
|
2024-12-30 21:55:03 +08:00 |
|
hiyouga
|
01bbe66f41
|
update wechat
Former-commit-id: a400d896a18e317acdbd3c79282c81b50cc2c54d
|
2024-12-30 13:54:22 +00:00 |
|
hoshi-hiyouga
|
bb664d2fc5
|
Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model
Former-commit-id: 2382a5f0317d768ba8f4931977f5caed6057b3c0
|
2024-12-30 21:50:13 +08:00 |
|
hiyouga
|
d0e729cd33
|
add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
|
2024-12-30 13:39:20 +00:00 |
|
hoshi-hiyouga
|
1178cb0e33
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
Former-commit-id: 91467ed313802ac3950c2e11a7d0997a36bcbddd
|
2024-12-30 21:08:25 +08:00 |
|
hoshi-hiyouga
|
089f824cd1
|
Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype
Former-commit-id: 40805b0cc0cff478703f68067a330ba307bb5809
|
2024-12-30 16:34:32 +08:00 |
|