Commit Graph

2595 Commits

Author SHA1 Message Date
zhubin
014a7ea042 fix –get ray args when args not a dict
Former-commit-id: 9c4c84828b
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hoshi-hiyouga
984b202f83 Merge pull request #6542 from erictang000/et/ray-integration
Ray Train integration with LLaMA-Factory

Former-commit-id: d23a98825b
2025-01-08 11:46:03 +08:00
hiyouga
0c1ad5f3fb fix llamaboard with ray
Former-commit-id: c46675d5e5
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
hoshi-hiyouga
a0bcac80c0 Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo
[trainer] fix pixtral dpo

Former-commit-id: c973f32849
2025-01-07 14:38:55 +08:00
hiyouga
8c57169eb7 fix #6546
Former-commit-id: 870f23d7ea
2025-01-07 06:30:44 +00:00
fzc8578
b9eeaa9706 add some
Former-commit-id: 785cc70ff2
2025-01-06 19:32:39 +08:00
hoshi-hiyouga
621d73e87c Merge pull request #6528 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: b832ed9a60
2025-01-04 16:01:21 +08:00
hiyouga
a02a140840 update wechat
Former-commit-id: cd14336506
2025-01-04 07:59:57 +00:00
Zhangchi Feng
a0188a430f Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: ab87bd6b13
2025-01-04 11:20:33 +08:00
fzc8578
b5ef5059ee add some
Former-commit-id: 79c2d7090c
2025-01-04 11:11:15 +08:00
hoshi-hiyouga
084d356c2c Merge pull request #6524 from hiyouga/hiyouga/upd_scripts
[misc] update scripts

Former-commit-id: e6d603ac37
2025-01-03 23:52:26 +08:00
hiyouga
20a9565e36 update scripts
Former-commit-id: dd44c65d7f
2025-01-03 10:50:32 +00:00
hoshi-hiyouga
85317bcbaf Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name

Former-commit-id: 51ef90ce0a
2025-01-02 20:20:02 +08:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add7287
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project

Former-commit-id: a766cad5d4
2025-01-02 20:16:15 +08:00
hoshi-hiyouga
b2ecb80729 Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model

Former-commit-id: 29ddc6b778
2025-01-02 20:15:55 +08:00
hiyouga
9a3afbd5d1 add project
Former-commit-id: b3e1137fbb
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497
2025-01-02 12:07:38 +00:00
hoshi-hiyouga
b921dde749 Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic

Former-commit-id: 72d86ecc9e
2025-01-02 19:36:54 +08:00
hoshi-hiyouga
d195329185 Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building

Former-commit-id: 8741e5b3e8
2025-01-02 19:34:17 +08:00
hiyouga
da8721a70e fix #6499
Former-commit-id: 1800f8c72d
2025-01-02 11:28:54 +00:00
hoshi-hiyouga
f318dc9464 Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: f8e80d566f
2024-12-30 21:55:03 +08:00
hiyouga
01bbe66f41 update wechat
Former-commit-id: a400d896a1
2024-12-30 13:54:22 +00:00
hoshi-hiyouga
bb664d2fc5 Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model

Former-commit-id: 2382a5f031
2024-12-30 21:50:13 +08:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3a
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template

Former-commit-id: 91467ed313
2024-12-30 21:08:25 +08:00
hoshi-hiyouga
089f824cd1 Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype

Former-commit-id: 40805b0cc0
2024-12-30 16:34:32 +08:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hoshi-hiyouga
951d845af2 Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss

Former-commit-id: b55890291b
2024-12-28 01:02:56 +08:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 2719867982
2024-12-27 16:54:39 +00:00
shibingli@yeah.net
c76c33ddb1 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.
Former-commit-id: f1d76786e0
2024-12-27 18:31:14 +08:00
shibingli@yeah.net
a37ef0eaae Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions.
Former-commit-id: a3a49b1ea4
2024-12-27 18:17:17 +08:00
hoshi-hiyouga
377dfe5665 Merge pull request #6457 from youkaichao/module-run
[misc] enable module run

Former-commit-id: f68074d87b
2024-12-26 23:41:37 +08:00
youkaichao
f6d5dd6f10 Update cli.py
Former-commit-id: c39d81cd1d
2024-12-26 23:22:09 +08:00
hoshi-hiyouga
a36f9d923e Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq

Former-commit-id: cd56f88ff2
2024-12-25 15:53:19 +08:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f41
2024-12-25 07:52:41 +00:00
hoshi-hiyouga
c5780f5eaa Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: cbd494ddaf
2024-12-24 16:13:20 +08:00
hiyouga
4cd1d05429 update wechat
Former-commit-id: 83202c9027
2024-12-24 08:12:53 +00:00
hoshi-hiyouga
459219a260 Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme

Former-commit-id: b9f73fc5ca
2024-12-23 22:17:19 +08:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00
hoshi-hiyouga
c0418062c0 Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger

Former-commit-id: d58746eca2
2024-12-22 05:47:55 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144 fix paligemma infer
Former-commit-id: 84cd1188ac
2024-12-21 20:24:32 +00:00
hoshi-hiyouga
9d437a5f4f Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
Former-commit-id: a2ad0738a2
2024-12-22 04:08:26 +08:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c268
2024-12-21 20:59:25 +08:00