Yaowei Zheng
|
4b0ec83928
|
[deps] bump transformers to 4.49.0 (#8564)
|
2025-07-07 20:31:50 +08:00 |
|
Ze-Yi LIN
|
16f13d304b
|
[tracking] fix swanlab hparams (#8532)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-07-02 22:08:44 +08:00 |
|
Kingsley
|
bede213da7
|
[assets] update readme (#8519)
|
2025-07-02 15:38:38 +08:00 |
|
Injae Ryou
|
a5a93597b1
|
[parser] update config loading to use OmegaConf #7793 (#8505)
|
2025-07-01 21:05:13 +08:00 |
|
Yaowei Zheng
|
c6c764388c
|
[assets] update readme (#8396)
|
2025-06-17 16:15:20 +08:00 |
|
Yaowei Zheng
|
3a3bae1cfe
|
[data] fix qwen2vl pos ids (#8387)
|
2025-06-17 00:48:54 +08:00 |
|
Yaowei Zheng
|
9a2d1dec62
|
[assets] update wechat (#8385)
|
2025-06-16 18:23:22 +08:00 |
|
Aman Gupta
|
8e4ac78607
|
[trainer] Add LD-DPO objective (#8362)
|
2025-06-12 16:10:38 +08:00 |
|
hoshi-hiyouga
|
2bf8e993ab
|
[data] fix shared file system (#8179)
|
2025-05-27 18:36:03 +08:00 |
|
hoshi-hiyouga
|
9ae17cd173
|
[deps] update to transformers 4.52 (#8125)
|
2025-05-21 05:16:18 +08:00 |
|
hoshi-hiyouga
|
9b5baa97f0
|
[data] qwen3 fixes (#8109)
|
2025-05-20 02:00:30 +08:00 |
|
hoshi-hiyouga
|
45030ff803
|
[model] switch to gptqmodel (#8108)
|
2025-05-19 22:25:40 +08:00 |
|
Saiya
|
ab41f7956c
|
[infer] support lora adapter for SGLang backend (#8067)
|
2025-05-16 23:33:47 +08:00 |
|
hoshi-hiyouga
|
13b05e74f1
|
[hparam] add enable think argument (#7928)
|
2025-04-30 17:21:30 +08:00 |
|
hoshi-hiyouga
|
73198a6645
|
[misc] fix uv (#7913)
|
2025-04-30 07:45:03 +08:00 |
|
hoshi-hiyouga
|
d4ee44bdef
|
[data] add eval_on_each_dataset arg (#7912)
|
2025-04-30 06:56:43 +08:00 |
|
Eric Tang
|
ef03832cd4
|
[ray] add storage filesystem to ray config (#7854)
|
2025-04-27 22:12:40 +08:00 |
|
Kingsley
|
fa0eb91f1f
|
[data] fix internvl plugin (#7817)
|
2025-04-23 00:58:22 +08:00 |
|
hoshi-hiyouga
|
fddcd43c88
|
[trainer] support early stop (#7797)
|
2025-04-22 01:59:33 +08:00 |
|
hoshi-hiyouga
|
b07628dea5
|
[example] add bash usage (#7794)
|
2025-04-22 00:25:51 +08:00 |
|
Juanxi Tian
|
12ada72ed4
|
[trainer] Add Muon Optimizer (#7749)
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-04-21 23:38:37 +08:00 |
|
hoshi-hiyouga
|
416853dd25
|
[parser] support omegaconf (#7793)
|
2025-04-21 23:30:30 +08:00 |
|
flashJd
|
0ac641326b
|
[misc] fix new tokens adding (#7253)
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-04-21 23:19:02 +08:00 |
|
hoshi-hiyouga
|
d222f63cb7
|
[infer] set env for vllm ascend (#7745)
|
2025-04-17 01:08:55 +08:00 |
|
hoshi-hiyouga
|
3df021d4d7
|
[deps] upgrade vllm (#7728)
|
2025-04-15 14:57:40 +08:00 |
|
hoshi-hiyouga
|
7c61b35106
|
[misc] upgrade cli (#7714)
|
2025-04-14 15:41:22 +08:00 |
|
Eric Tang
|
bb8d79bae2
|
[ray] allow for specifying ray.init kwargs (i.e. runtime_env) (#7647)
* ray init kwargs
* Update trainer_utils.py
* fix ray args
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-04-10 11:31:05 +08:00 |
|
hoshi-hiyouga
|
c3c0efbaa0
|
[misc] fix packing and eval plot (#7623)
|
2025-04-07 18:20:57 +08:00 |
|
hoshi-hiyouga
|
5e22597ff1
|
[infer] vllm video/audio inference (#7566)
|
2025-04-02 02:27:04 +08:00 |
|
hoshi-hiyouga
|
2bfcad2394
|
[model] fix kv cache (#7564)
|
2025-04-01 23:07:46 +08:00 |
|
Billy Cao
|
00409ff28a
|
[data] shard the dataset to allow multiprocessing when streaming is enabled (#7530)
* Shard the dataset when streaming to allow multiprocessing
* Allow user to not set dataset_shards to ensure backward compatibility
|
2025-04-01 15:36:23 +08:00 |
|
Kingsley
|
7eed496336
|
[model] add Qwen2.5-Omni model (#7537)
* preserve image_sizes
* preserve image_sizes
* init plugin
* support audio-text2text lora
* nit
* support image/video-text2text, audio-text2text
* remove args
* remove lines
* add docs && nit
* remove some comments
* fix && add merge part script
* add license
|
2025-03-31 20:39:35 +08:00 |
|
Xu-pixel
|
b578a7d5b6
|
[3rdparty] support swanlab lark notification (#7481)
|
2025-03-27 01:52:01 +08:00 |
|
hoshi-hiyouga
|
919415dba9
|
[deps] upgrade vllm to 0.8 (#7436)
|
2025-03-23 14:32:22 +08:00 |
|
Qiaolin Yu
|
a44a53ebec
|
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-15 04:37:58 +08:00 |
|
hoshi-hiyouga
|
93e6184cbe
|
[data] gemma3 plugin pan and scan (#7294)
* gemma3 pan and scan
* add test case
* fix test
|
2025-03-13 23:29:23 +08:00 |
|
hoshi-hiyouga
|
650a9a9057
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
264538cb26
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
Ze-Yi LIN
|
18968405d0
|
[tracking] add swanlab_logdir param (#7219)
* feat: add swanlab_logdir param
* fix
Former-commit-id: 9215ad488b6ac6cd57fe8fa4acdacceb63f68ca5
|
2025-03-11 00:53:07 +08:00 |
|
hoshi-hiyouga
|
71a1c1321a
|
[config] update args (#7231)
Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6
|
2025-03-10 23:04:43 +08:00 |
|
hoshi-hiyouga
|
cf58a6d860
|
[config] fix export max len (#7230)
Former-commit-id: 211c0b3e8f3340acd2fae1762d9152a09f19ba34
|
2025-03-10 16:46:08 +08:00 |
|
hoshi-hiyouga
|
a255c3a476
|
[misc] fix cli (#7204)
Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4
|
2025-03-07 15:01:18 +08:00 |
|
hoshi-hiyouga
|
d2f845d70d
|
[deps] upgrade vllm (#7183)
Former-commit-id: 37678a3d64668c3b4a4bfefc054e3b9b40427c1a
|
2025-03-06 15:25:08 +08:00 |
|
hoshi-hiyouga
|
f5cd17881e
|
[data] update vlm args (#6976)
Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab
|
2025-02-18 02:12:51 +08:00 |
|
hoshi-hiyouga
|
c09b648934
|
[data] add min resolution option (#6975)
Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327
|
2025-02-18 01:40:46 +08:00 |
|
Eric Tang
|
76f9bd1820
|
[ray] specify ray storage path (#6920)
Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49
|
2025-02-14 21:55:41 +08:00 |
|
hoshi-hiyouga
|
3a3f4072e5
|
[misc] fix grad ckpt func (#6916)
Former-commit-id: 35e069a52b3d7cfd9b0107574b09265eb2290f0b
|
2025-02-13 00:17:18 +08:00 |
|
hoshi-hiyouga
|
2581cc844b
|
[data] feat: auto template (#6905)
* support auto template
* add unittest
Former-commit-id: 0c6c9150db6414a5a05527ea486dce6633dff4b3
|
2025-02-12 00:22:53 +08:00 |
|
hoshi-hiyouga
|
88eafd865b
|
[misc] support export ollama modelfile (#6899)
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20
|
2025-02-11 19:52:25 +08:00 |
|
hoshi-hiyouga
|
4d1791e905
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|