Commit Graph

2613 Commits

Author SHA1 Message Date
hiyouga
f6f630a1c9 refactor mllm param logic 2025-01-10 15:45:48 +00:00
fzc8578
e45329e745 add minicpmv2.6 2025-01-10 23:45:44 +08:00
fzc8578
771cc80294 add some 2025-01-10 23:29:06 +08:00
fzc8578
ae1f528df3 add some 2025-01-10 21:25:32 +08:00
fzc8578
15bbcdf8d3 fix some 2025-01-10 20:55:52 +08:00
fzc8578
d09032049c fix version 2025-01-10 20:31:04 +08:00
fzc8578
2ee8ba2f39 fix some 2025-01-10 20:27:06 +08:00
fzc8578
84026be06e tiny fix 2025-01-10 20:15:39 +08:00
Zhangchi Feng
fc045d7dd8 Merge branch 'main' into minicpmv 2025-01-10 20:12:07 +08:00
fzc8578
096a6cb67a add some 2025-01-10 20:01:22 +08:00
hoshi-hiyouga
b308ddf097 Merge pull request #6597 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2025-01-10 18:41:47 +08:00
hiyouga
70ed03b288 update wechat 2025-01-10 10:40:25 +00:00
hoshi-hiyouga
5ffd8ad192 Merge pull request #6588 from hiyouga/hiyouga/upd_issue_temp
[gh] update issue template
2025-01-10 03:03:48 +08:00
hiyouga
aa8d0a223b update issue template 2025-01-09 18:58:53 +00:00
hoshi-hiyouga
8b209cb49d Merge pull request #6585 from hiyouga/hiyouga/add_phi4
[model] add phi4 model
2025-01-10 02:39:17 +08:00
hiyouga
ae16ea755d improve template, add phi4 model 2025-01-09 18:27:54 +00:00
hoshi-hiyouga
6b34b69fa6 Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
2025-01-08 18:14:18 +08:00
hoshi-hiyouga
18431527ba Merge pull request #6565 from hiyouga/hiyouga/improve_log
[misc] imporve log
2025-01-08 18:08:21 +08:00
zhubin
9c4c84828b fix –get ray args when args not a dict 2025-01-08 10:06:02 +00:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
hoshi-hiyouga
d23a98825b Merge pull request #6542 from erictang000/et/ray-integration
Ray Train integration with LLaMA-Factory
2025-01-08 11:46:03 +08:00
hiyouga
c46675d5e5 fix llamaboard with ray 2025-01-07 09:59:24 +00:00
hiyouga
d8cac6f546 refactor ray integration, support save ckpt 2025-01-07 09:39:10 +00:00
Eric Tang
1e8e7be0a5 run style check 2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
163ddb680b drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2025-01-07 08:55:44 +00:00
hoshi-hiyouga
c973f32849 Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo
[trainer] fix pixtral dpo
2025-01-07 14:38:55 +08:00
hiyouga
870f23d7ea fix #6546 2025-01-07 06:30:44 +00:00
fzc8578
785cc70ff2 add some 2025-01-06 19:32:39 +08:00
hoshi-hiyouga
b832ed9a60 Merge pull request #6528 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2025-01-04 16:01:21 +08:00
hiyouga
cd14336506 update wechat 2025-01-04 07:59:57 +00:00
Zhangchi Feng
ab87bd6b13 Merge branch 'hiyouga:main' into minicpmv 2025-01-04 11:20:33 +08:00
fzc8578
79c2d7090c add some 2025-01-04 11:11:15 +08:00
hoshi-hiyouga
e6d603ac37 Merge pull request #6524 from hiyouga/hiyouga/upd_scripts
[misc] update scripts
2025-01-03 23:52:26 +08:00
hiyouga
dd44c65d7f update scripts 2025-01-03 10:50:32 +00:00
hoshi-hiyouga
51ef90ce0a Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name
2025-01-02 20:20:02 +08:00
hiyouga
4b8add7287 update model name 2025-01-02 12:19:21 +00:00
hoshi-hiyouga
a766cad5d4 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project
2025-01-02 20:16:15 +08:00
hoshi-hiyouga
29ddc6b778 Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model
2025-01-02 20:15:55 +08:00
hiyouga
b3e1137fbb add project 2025-01-02 12:15:41 +00:00
hiyouga
67442bd497 add gpt2 model 2025-01-02 12:07:38 +00:00
hoshi-hiyouga
72d86ecc9e Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic
2025-01-02 19:36:54 +08:00
hoshi-hiyouga
8741e5b3e8 Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building
2025-01-02 19:34:17 +08:00
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hoshi-hiyouga
f8e80d566f Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2024-12-30 21:55:03 +08:00
hiyouga
a400d896a1 update wechat 2024-12-30 13:54:22 +00:00
hoshi-hiyouga
2382a5f031 Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model
2024-12-30 21:50:13 +08:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hoshi-hiyouga
91467ed313 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
2024-12-30 21:08:25 +08:00
hoshi-hiyouga
40805b0cc0 Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype
2024-12-30 16:34:32 +08:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00