hiyouga
aaad963593
release v0.9.2
2025-03-11 14:49:13 +08:00
hoshi-hiyouga
ef7af457fc
[infer] fix vllm args ( #7235 )
2025-03-11 01:15:35 +08:00
Ze-Yi LIN
a1e76af3d9
[tracking] add swanlab_logdir param ( #7219 )
...
* feat: add swanlab_logdir param
* fix
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
ed8b12e3cb
[config] update args ( #7231 )
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
728c2f6819
[config] fix export max len ( #7230 )
2025-03-10 16:46:08 +08:00
hoshi-hiyouga
ae4cbe8fbc
[assets] update wechat ( #7229 )
2025-03-10 15:39:06 +08:00
hoshi-hiyouga
1774882f5a
[data] update mm demo data ( #7211 )
2025-03-07 20:07:15 +08:00
hoshi-hiyouga
cdf8fc6478
[assets] update readme ( #7209 )
2025-03-07 17:27:49 +08:00
hoshi-hiyouga
8c3f9f6747
[data] fix loader ( #7207 )
...
* fix dataloader
* add test case
* fix type
* fix ci
* fix ci
* fix ci
* disable overwrite cache in ci
2025-03-07 17:20:46 +08:00
hoshi-hiyouga
db113f690e
[misc] fix ds config ( #7205 )
2025-03-07 15:21:28 +08:00
ZhangChuanhui
194e3bddb2
[data] fix function formatter ( #7201 )
...
Co-authored-by: zhangchuanhui <zhangchal@digitalchina.com >
2025-03-07 15:17:23 +08:00
hoshi-hiyouga
bd17223559
[misc] fix cli ( #7204 )
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
313355759d
[script] fix vllm version ( #7193 )
2025-03-06 17:14:17 +08:00
hoshi-hiyouga
abb23f7673
[webui] support escape html ( #7190 )
2025-03-06 16:52:21 +08:00
hoshi-hiyouga
d739fddb10
[deps] upgrade vllm ( #7183 )
2025-03-06 15:25:08 +08:00
hoshi-hiyouga
be66df1f02
[data] fix mm template ( #7181 )
2025-03-06 15:18:32 +08:00
hoshi-hiyouga
64a6fb9b50
[model] add QwQ 32b ( #7179 )
2025-03-06 11:58:36 +08:00
Ze-Yi LIN
8ad03258e1
[trainer] fix swanlab callback ( #7176 )
2025-03-06 00:33:37 +08:00
hoshi-hiyouga
b4b89b4ff3
[trainer] update config ( #7174 )
2025-03-05 23:32:54 +08:00
sirui.li
dff4130969
[data] fix qwen2audio plugin ( #7166 )
...
* Update pairwise.py
[data]Repair multimodal model dpo training
* Update pairwise.py
[data]repair multimodal model dpo training using deepcopy
* Update pairwise.py
* Update mm_plugin.py
2025-03-05 18:03:36 +08:00
hoshi-hiyouga
0c403ea15b
[assets] update wechat ( #7161 )
2025-03-05 14:11:10 +08:00
hoshi-hiyouga
bc298c60b7
[data] use bicubic resampler ( #7143 )
2025-03-04 00:17:06 +08:00
hoshi-hiyouga
17ba2d5082
[webui] fix webui ( #7142 )
2025-03-04 00:01:49 +08:00
rabbit
049ddf48af
[data] bailing template ( #7117 )
...
* add bailing template
* add bailing template
* add bailing template
---------
Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com >
2025-03-03 15:33:22 +08:00
hoshi-hiyouga
1036311826
[inference] fix hf_engine ( #7120 )
2025-03-01 05:22:49 +08:00
hoshi-hiyouga
d1863bbbaa
[assets] update wechat ( #7106 )
2025-02-28 12:01:04 +08:00
Ze-Yi LIN
891c487503
[webui] display swanlab exp link ( #7089 )
...
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-02-27 19:40:54 +08:00
leo-pony
acc52e0fe7
[npu] update cann base image and torch 2.4 ( #7061 )
...
* Update base npu container image version:The Python version required for Hugging Face Transformers is >= python3.10
* Fix the bug: arg type of INSTALL_DEEPSPEED shoud been string now.
* Update Ascend CANN, CANN-Kernel and corresponding torch and torch-npu version
* Upgrade torch-npu needs packages' version: torch==2.1.0 and torch-npu==2.4.0.post2
2025-02-25 23:32:01 +08:00
hoshi-hiyouga
96fd510e6a
[misc] fix project toml ( #7067 )
2025-02-25 23:22:48 +08:00
JieShen
e8266fe563
[script] add seed args ( #7058 )
...
* add seed args
* add seed args
* update seed
2025-02-25 19:44:57 +08:00
Kingsley
19861d5170
[model] add paligemma2-mix series ( #7060 )
2025-02-25 18:51:16 +08:00
hoshi-hiyouga
76314e6ad1
[data] fix mllama ( #7053 )
...
* fix mllama
* fix test
2025-02-24 22:05:38 +08:00
hoshi-hiyouga
ec1a1bc118
[model] add models ( #7054 )
...
* add qwen25vl awq models
* add moonlight
2025-02-24 22:05:13 +08:00
hoshi-hiyouga
fe6dd92c84
[assets] update readme ( #7051 )
2025-02-24 20:45:06 +08:00
hoshi-hiyouga
1481af5dc9
[assets] update wechat ( #7019 )
2025-02-20 20:32:33 +08:00
Zhangchi Feng
cde479e47a
[data] fix MiniCPMV plugin ( #6998 )
...
* fix template
* fix bug in messages processing
2025-02-19 19:36:04 +08:00
hoshi-hiyouga
302ecb00fe
[webui] update css ( #6985 )
2025-02-18 18:27:57 +08:00
hoshi-hiyouga
2591a3fa8b
[data] add r1 distill dataset ( #6983 )
2025-02-18 17:25:09 +08:00
hoshi-hiyouga
b00b290c07
[version] support transformers 449 ( #6982 )
...
* support transformers 449
* fix mm plugin
2025-02-18 17:05:40 +08:00
hoshi-hiyouga
cc8c7e762b
[misc] fix script ( #6977 )
2025-02-18 17:00:46 +08:00
hoshi-hiyouga
3da2cc2710
[data] update vlm args ( #6976 )
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
7faecc0301
[data] add min resolution option ( #6975 )
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
bdb581c4a8
[data] fix predict dataset ( #6972 )
2025-02-17 20:29:40 +08:00
hoshi-hiyouga
ad0c6c8916
[assets] update wechat ( #6963 )
2025-02-17 15:23:17 +08:00
Zhangchi Feng
2faf8aeff8
[data] fix minicpmo template ( #6946 )
2025-02-15 00:37:41 +08:00
Eric Tang
6edd4992d7
[ray] specify ray storage path ( #6920 )
2025-02-14 21:55:41 +08:00
hoshi-hiyouga
1ada3ae5a3
[misc] fix lora regex ( #6944 )
...
* fix lora regex
* fix
2025-02-14 21:38:43 +08:00
hoshi-hiyouga
c31c63b411
[misc] fix grad ckpt ( #6931 )
2025-02-13 23:27:51 +08:00
hoshi-hiyouga
797043d29c
[model] add liger kernel to qwen2_5 vl ( #6930 )
...
* add liger kernel to qwen2_5 vl
* fix patch
* fix patch
2025-02-13 23:05:54 +08:00
Billy Cao
11eac71c13
[trainer] fix gen_kwarg to eval during training ( #5451 )
...
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-02-13 02:35:06 +08:00