hiyouga
|
d8cac6f546
|
refactor ray integration, support save ckpt
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
1e8e7be0a5
|
run style check
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
163ddb680b
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2025-01-07 08:55:44 +00:00 |
|
hoshi-hiyouga
|
c973f32849
|
Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo
[trainer] fix pixtral dpo
|
2025-01-07 14:38:55 +08:00 |
|
hiyouga
|
870f23d7ea
|
fix #6546
|
2025-01-07 06:30:44 +00:00 |
|
hoshi-hiyouga
|
b832ed9a60
|
Merge pull request #6528 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
|
2025-01-04 16:01:21 +08:00 |
|
hiyouga
|
cd14336506
|
update wechat
|
2025-01-04 07:59:57 +00:00 |
|
hoshi-hiyouga
|
e6d603ac37
|
Merge pull request #6524 from hiyouga/hiyouga/upd_scripts
[misc] update scripts
|
2025-01-03 23:52:26 +08:00 |
|
hiyouga
|
dd44c65d7f
|
update scripts
|
2025-01-03 10:50:32 +00:00 |
|
hoshi-hiyouga
|
51ef90ce0a
|
Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name
|
2025-01-02 20:20:02 +08:00 |
|
hiyouga
|
4b8add7287
|
update model name
|
2025-01-02 12:19:21 +00:00 |
|
hoshi-hiyouga
|
a766cad5d4
|
Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project
|
2025-01-02 20:16:15 +08:00 |
|
hoshi-hiyouga
|
29ddc6b778
|
Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model
|
2025-01-02 20:15:55 +08:00 |
|
hiyouga
|
b3e1137fbb
|
add project
|
2025-01-02 12:15:41 +00:00 |
|
hiyouga
|
67442bd497
|
add gpt2 model
|
2025-01-02 12:07:38 +00:00 |
|
hoshi-hiyouga
|
72d86ecc9e
|
Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic
|
2025-01-02 19:36:54 +08:00 |
|
hoshi-hiyouga
|
8741e5b3e8
|
Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building
|
2025-01-02 19:34:17 +08:00 |
|
hiyouga
|
1800f8c72d
|
fix #6499
|
2025-01-02 11:28:54 +00:00 |
|
hoshi-hiyouga
|
f8e80d566f
|
Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
|
2024-12-30 21:55:03 +08:00 |
|
hiyouga
|
a400d896a1
|
update wechat
|
2024-12-30 13:54:22 +00:00 |
|
hoshi-hiyouga
|
2382a5f031
|
Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model
|
2024-12-30 21:50:13 +08:00 |
|
hiyouga
|
e67b9dcc3a
|
add deepseek3 model
|
2024-12-30 13:39:20 +00:00 |
|
hoshi-hiyouga
|
91467ed313
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
|
2024-12-30 21:08:25 +08:00 |
|
hoshi-hiyouga
|
40805b0cc0
|
Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype
|
2024-12-30 16:34:32 +08:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hoshi-hiyouga
|
b55890291b
|
Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss
|
2024-12-28 01:02:56 +08:00 |
|
hiyouga
|
2719867982
|
fix #6448
|
2024-12-27 16:54:39 +00:00 |
|
shibingli@yeah.net
|
f1d76786e0
|
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.
|
2024-12-27 18:31:14 +08:00 |
|
shibingli@yeah.net
|
a3a49b1ea4
|
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions.
|
2024-12-27 18:17:17 +08:00 |
|
hoshi-hiyouga
|
f68074d87b
|
Merge pull request #6457 from youkaichao/module-run
[misc] enable module run
|
2024-12-26 23:41:37 +08:00 |
|
youkaichao
|
c39d81cd1d
|
Update cli.py
|
2024-12-26 23:22:09 +08:00 |
|
hoshi-hiyouga
|
cd56f88ff2
|
Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq
|
2024-12-25 15:53:19 +08:00 |
|
hiyouga
|
ee0e400f41
|
add qvq #6439
|
2024-12-25 07:52:41 +00:00 |
|
hoshi-hiyouga
|
cbd494ddaf
|
Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
|
2024-12-24 16:13:20 +08:00 |
|
hiyouga
|
83202c9027
|
update wechat
|
2024-12-24 08:12:53 +00:00 |
|
hoshi-hiyouga
|
b9f73fc5ca
|
Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme
|
2024-12-23 22:17:19 +08:00 |
|
hiyouga
|
8fd38d273e
|
update readme
|
2024-12-23 14:08:59 +00:00 |
|
hoshi-hiyouga
|
c23a4d0658
|
Merge pull request #5922 from Tuyohai/main
support granite3 models
|
2024-12-23 16:46:02 +08:00 |
|
hoshi-hiyouga
|
d58746eca2
|
Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger
|
2024-12-22 05:47:55 +08:00 |
|
hiyouga
|
5111cac6f8
|
support report custom args
|
2024-12-21 21:42:45 +00:00 |
|
hiyouga
|
84cd1188ac
|
fix paligemma infer
|
2024-12-21 20:24:32 +00:00 |
|
hoshi-hiyouga
|
a2ad0738a2
|
Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
|
2024-12-22 04:08:26 +08:00 |
|
ZeYi Lin
|
744ef8c268
|
docs: use swanlab
|
2024-12-21 20:59:25 +08:00 |
|
hoshi-hiyouga
|
947e22a4a3
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
82e5d75014
|
fix: project blank
|
2024-12-20 18:26:02 +08:00 |
|
ZeYi Lin
|
3a7ea2048a
|
fix: by hiyouga suggestion
|
2024-12-20 16:43:03 +08:00 |
|
ZeYi Lin
|
5f6dafd70e
|
feat: ui improve
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
0a52962db3
|
fix: text
|
2024-12-19 21:26:02 +08:00 |
|
ZeYi Lin
|
d0eb64d5e3
|
fix: bugs
|
2024-12-19 21:08:16 +08:00 |
|
hoshi-hiyouga
|
c6e3c14a93
|
Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
[generate] fix generate kwargs
|
2024-12-19 20:24:17 +08:00 |
|