Eric Tang
|
d8a5571be7
|
[3rdparty] fix redundant process group destroy for ray (#7395)
* fix redundant process group destroy for ray
* Update tuner.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-21 10:56:47 +08:00 |
|
hoshi-hiyouga
|
555b71a1cb
|
[version] fix minicpmo (#7378)
|
2025-03-20 16:59:31 +08:00 |
|
hoshi-hiyouga
|
4a5d0f0ba7
|
[assets] update wechat (#7361)
|
2025-03-18 21:31:09 +08:00 |
|
hoshi-hiyouga
|
c518146e62
|
[misc] set dev version (#7351)
|
2025-03-18 00:10:53 +08:00 |
|
hoshi-hiyouga
|
1d2131e5cb
|
[data] fix template (#7349)
|
2025-03-17 23:45:20 +08:00 |
|
hoshi-hiyouga
|
48a6584fb1
|
[assets] update videos (#7340)
* Update README.md
* Update README_zh.md
|
2025-03-17 15:48:02 +08:00 |
|
Hertz
|
a71e685021
|
[model] support hunyuan 7b (#7317)
* [Model]supported tencent-hunyuan model
* [Model]supported tencent-hunyuan model(fix)
* [Model]supported tencent-hunyuan model(fix)
|
2025-03-15 20:55:24 +08:00 |
|
Qiaolin Yu
|
30038d9ce7
|
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-15 04:37:58 +08:00 |
|
hoshi-hiyouga
|
ef5f1c1def
|
[data] gemma3 plugin pan and scan (#7294)
* gemma3 pan and scan
* add test case
* fix test
|
2025-03-13 23:29:23 +08:00 |
|
Victor Nogueira
|
3dff4ecca8
|
[dataset] fix ultrachat_200k dataset (#7259)
The `HuggingFaceH4/ultrachat_200k` dataset doesn't contain the default "train" split. The correct split is "train_sft".
|
2025-03-13 20:20:18 +08:00 |
|
hoshi-hiyouga
|
0dbce72fb8
|
[assets] update wechat (#7288)
|
2025-03-13 18:48:59 +08:00 |
|
hoshi-hiyouga
|
e9b427d535
|
[assets] update video (#7287)
|
2025-03-13 18:45:47 +08:00 |
|
Ritesh Goru
|
d7d79f7e06
|
[data] efficient 4d_attention_mask creation in neat_packing (#7272)
|
2025-03-13 03:31:12 +08:00 |
|
hoshi-hiyouga
|
9ccfb97a2c
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
165d3ed084
|
[model] support gemma3 (#7273)
|
2025-03-13 01:35:23 +08:00 |
|
hoshi-hiyouga
|
142fd7e755
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hoshi-hiyouga
|
7c1640ed5f
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
cdafa8a15e
|
[ci] update workflow (#7255)
|
2025-03-11 22:57:49 +08:00 |
|
hoshi-hiyouga
|
b256ca86f0
|
[core] release v0.9.2 (#7254)
|
2025-03-11 22:42:23 +08:00 |
|
hoshi-hiyouga
|
7a7071e504
|
Merge pull request #7242 from hiyouga/hiyouga/release
[release] release v0.9.2
Former-commit-id: 6b25268990bf225d84e29d4067595cf720fa12d8
|
2025-03-11 15:28:45 +08:00 |
|
hoshi-hiyouga
|
847ae972d0
|
Merge pull request #7247 from hiyouga/hiyouga/commit
[misc] support print commit info
Former-commit-id: 0f7ec4f8529a5d7ea2153b881335821038307bb7
|
2025-03-11 15:28:04 +08:00 |
|
hoshi-hiyouga
|
1c634d9c53
|
Merge pull request #7244 from hiyouga/hiyouga/token
[data] avoid exit after saving preprocessed data
Former-commit-id: dcbf01b0035062fa14187e5bdbb925080d349501
|
2025-03-11 15:17:15 +08:00 |
|
hiyouga
|
99b71768a0
|
support commit info
Former-commit-id: af752b1c27
|
2025-03-11 15:13:59 +08:00 |
|
hiyouga
|
37b844d929
|
remove exit in preprocess
Former-commit-id: 1a800f9993
|
2025-03-11 15:08:25 +08:00 |
|
hiyouga
|
f5810a6e47
|
release v0.9.2
Former-commit-id: aaad963593
|
2025-03-11 14:49:13 +08:00 |
|
hoshi-hiyouga
|
317d0855d2
|
[infer] fix vllm args (#7235)
Former-commit-id: ef7af457fc
|
2025-03-11 01:15:35 +08:00 |
|
Ze-Yi LIN
|
0a43bc1960
|
[tracking] add swanlab_logdir param (#7219)
* feat: add swanlab_logdir param
* fix
Former-commit-id: a1e76af3d9
|
2025-03-11 00:53:07 +08:00 |
|
hoshi-hiyouga
|
5a29f49fb1
|
[config] update args (#7231)
Former-commit-id: ed8b12e3cb
|
2025-03-10 23:04:43 +08:00 |
|
hoshi-hiyouga
|
4e68828e46
|
[config] fix export max len (#7230)
Former-commit-id: 728c2f6819
|
2025-03-10 16:46:08 +08:00 |
|
hoshi-hiyouga
|
9a0044ef5e
|
[assets] update wechat (#7229)
Former-commit-id: ae4cbe8fbc
|
2025-03-10 15:39:06 +08:00 |
|
hoshi-hiyouga
|
d412301d08
|
[data] update mm demo data (#7211)
Former-commit-id: 1774882f5a
|
2025-03-07 20:07:15 +08:00 |
|
hoshi-hiyouga
|
5a0fd22c05
|
[assets] update readme (#7209)
Former-commit-id: cdf8fc6478
|
2025-03-07 17:27:49 +08:00 |
|
hoshi-hiyouga
|
df63f05b47
|
[data] fix loader (#7207)
* fix dataloader
* add test case
* fix type
* fix ci
* fix ci
* fix ci
* disable overwrite cache in ci
Former-commit-id: 8c3f9f6747
|
2025-03-07 17:20:46 +08:00 |
|
hoshi-hiyouga
|
98ea0e8109
|
[misc] fix ds config (#7205)
Former-commit-id: db113f690e
|
2025-03-07 15:21:28 +08:00 |
|
ZhangChuanhui
|
33b4c33279
|
[data] fix function formatter (#7201)
Co-authored-by: zhangchuanhui <zhangchal@digitalchina.com>
Former-commit-id: 194e3bddb2
|
2025-03-07 15:17:23 +08:00 |
|
hoshi-hiyouga
|
113cc3d920
|
[misc] fix cli (#7204)
Former-commit-id: bd17223559
|
2025-03-07 15:01:18 +08:00 |
|
hoshi-hiyouga
|
b6c0e8608e
|
[script] fix vllm version (#7193)
Former-commit-id: 313355759d
|
2025-03-06 17:14:17 +08:00 |
|
hoshi-hiyouga
|
eba31ae313
|
[webui] support escape html (#7190)
Former-commit-id: abb23f7673
|
2025-03-06 16:52:21 +08:00 |
|
hoshi-hiyouga
|
e7556b591e
|
[deps] upgrade vllm (#7183)
Former-commit-id: d739fddb10
|
2025-03-06 15:25:08 +08:00 |
|
hoshi-hiyouga
|
2b21c749c1
|
[data] fix mm template (#7181)
Former-commit-id: be66df1f02
|
2025-03-06 15:18:32 +08:00 |
|
hoshi-hiyouga
|
002f58ef8e
|
[model] add QwQ 32b (#7179)
Former-commit-id: 64a6fb9b50
|
2025-03-06 11:58:36 +08:00 |
|
Ze-Yi LIN
|
c67d2b9327
|
[trainer] fix swanlab callback (#7176)
Former-commit-id: 8ad03258e1
|
2025-03-06 00:33:37 +08:00 |
|
hoshi-hiyouga
|
6e58115f98
|
[trainer] update config (#7174)
Former-commit-id: b4b89b4ff3
|
2025-03-05 23:32:54 +08:00 |
|
sirui.li
|
8dddffa340
|
[data] fix qwen2audio plugin (#7166)
* Update pairwise.py
[data]Repair multimodal model dpo training
* Update pairwise.py
[data]repair multimodal model dpo training using deepcopy
* Update pairwise.py
* Update mm_plugin.py
Former-commit-id: dff4130969
|
2025-03-05 18:03:36 +08:00 |
|
hoshi-hiyouga
|
e1d574a784
|
[assets] update wechat (#7161)
Former-commit-id: 0c403ea15b
|
2025-03-05 14:11:10 +08:00 |
|
hoshi-hiyouga
|
caef0a8937
|
[data] use bicubic resampler (#7143)
Former-commit-id: bc298c60b7
|
2025-03-04 00:17:06 +08:00 |
|
hoshi-hiyouga
|
392533e139
|
[webui] fix webui (#7142)
Former-commit-id: 17ba2d5082
|
2025-03-04 00:01:49 +08:00 |
|
rabbit
|
299cd03785
|
[data] bailing template (#7117)
* add bailing template
* add bailing template
* add bailing template
---------
Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com>
Former-commit-id: 049ddf48af
|
2025-03-03 15:33:22 +08:00 |
|
hoshi-hiyouga
|
ee1b580328
|
[inference] fix hf_engine (#7120)
Former-commit-id: 1036311826
|
2025-03-01 05:22:49 +08:00 |
|
hoshi-hiyouga
|
54a090079c
|
[assets] update wechat (#7106)
Former-commit-id: d1863bbbaa
|
2025-02-28 12:01:04 +08:00 |
|