Commit Graph

2538 Commits

Author SHA1 Message Date
hiyouga
d8cac6f546 refactor ray integration, support save ckpt 2025-01-07 09:39:10 +00:00
Eric Tang
1e8e7be0a5 run style check 2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
163ddb680b drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2025-01-07 08:55:44 +00:00
hoshi-hiyouga
c973f32849 Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo
[trainer] fix pixtral dpo
2025-01-07 14:38:55 +08:00
hiyouga
870f23d7ea fix #6546 2025-01-07 06:30:44 +00:00
hoshi-hiyouga
b832ed9a60 Merge pull request #6528 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2025-01-04 16:01:21 +08:00
hiyouga
cd14336506 update wechat 2025-01-04 07:59:57 +00:00
hoshi-hiyouga
e6d603ac37 Merge pull request #6524 from hiyouga/hiyouga/upd_scripts
[misc] update scripts
2025-01-03 23:52:26 +08:00
hiyouga
dd44c65d7f update scripts 2025-01-03 10:50:32 +00:00
hoshi-hiyouga
51ef90ce0a Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name
2025-01-02 20:20:02 +08:00
hiyouga
4b8add7287 update model name 2025-01-02 12:19:21 +00:00
hoshi-hiyouga
a766cad5d4 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project
2025-01-02 20:16:15 +08:00
hoshi-hiyouga
29ddc6b778 Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model
2025-01-02 20:15:55 +08:00
hiyouga
b3e1137fbb add project 2025-01-02 12:15:41 +00:00
hiyouga
67442bd497 add gpt2 model 2025-01-02 12:07:38 +00:00
hoshi-hiyouga
72d86ecc9e Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic
2025-01-02 19:36:54 +08:00
hoshi-hiyouga
8741e5b3e8 Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building
2025-01-02 19:34:17 +08:00
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hoshi-hiyouga
f8e80d566f Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2024-12-30 21:55:03 +08:00
hiyouga
a400d896a1 update wechat 2024-12-30 13:54:22 +00:00
hoshi-hiyouga
2382a5f031 Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model
2024-12-30 21:50:13 +08:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hoshi-hiyouga
91467ed313 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
2024-12-30 21:08:25 +08:00
hoshi-hiyouga
40805b0cc0 Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype
2024-12-30 16:34:32 +08:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hoshi-hiyouga
b55890291b Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss
2024-12-28 01:02:56 +08:00
hiyouga
2719867982 fix #6448 2024-12-27 16:54:39 +00:00
shibingli@yeah.net
f1d76786e0 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building. 2024-12-27 18:31:14 +08:00
shibingli@yeah.net
a3a49b1ea4 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions. 2024-12-27 18:17:17 +08:00
hoshi-hiyouga
f68074d87b Merge pull request #6457 from youkaichao/module-run
[misc] enable module run
2024-12-26 23:41:37 +08:00
youkaichao
c39d81cd1d Update cli.py 2024-12-26 23:22:09 +08:00
hoshi-hiyouga
cd56f88ff2 Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq
2024-12-25 15:53:19 +08:00
hiyouga
ee0e400f41 add qvq #6439 2024-12-25 07:52:41 +00:00
hoshi-hiyouga
cbd494ddaf Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
2024-12-24 16:13:20 +08:00
hiyouga
83202c9027 update wechat 2024-12-24 08:12:53 +00:00
hoshi-hiyouga
b9f73fc5ca Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme
2024-12-23 22:17:19 +08:00
hiyouga
8fd38d273e update readme 2024-12-23 14:08:59 +00:00
hoshi-hiyouga
c23a4d0658 Merge pull request #5922 from Tuyohai/main
support granite3 models
2024-12-23 16:46:02 +08:00
hoshi-hiyouga
d58746eca2 Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger
2024-12-22 05:47:55 +08:00
hiyouga
5111cac6f8 support report custom args 2024-12-21 21:42:45 +00:00
hiyouga
84cd1188ac fix paligemma infer 2024-12-21 20:24:32 +00:00
hoshi-hiyouga
a2ad0738a2 Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
2024-12-22 04:08:26 +08:00
ZeYi Lin
744ef8c268 docs: use swanlab 2024-12-21 20:59:25 +08:00
hoshi-hiyouga
947e22a4a3 Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
2024-12-21 14:09:33 +08:00
ZeYi Lin
82e5d75014 fix: project blank 2024-12-20 18:26:02 +08:00
ZeYi Lin
3a7ea2048a fix: by hiyouga suggestion 2024-12-20 16:43:03 +08:00
ZeYi Lin
5f6dafd70e feat: ui improve 2024-12-20 11:03:02 +08:00
ZeYi Lin
0a52962db3 fix: text 2024-12-19 21:26:02 +08:00
ZeYi Lin
d0eb64d5e3 fix: bugs 2024-12-19 21:08:16 +08:00
hoshi-hiyouga
c6e3c14a93 Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
[generate] fix generate kwargs
2024-12-19 20:24:17 +08:00