hoshi-hiyouga
4831552856
[infer] set env for vllm ascend ( #7745 )
2025-04-17 01:08:55 +08:00
hoshi-hiyouga
0fe5631f9b
[deps] upgrade vllm ( #7728 )
2025-04-15 14:57:40 +08:00
hoshi-hiyouga
3ef36d0057
[misc] upgrade cli ( #7714 )
2025-04-14 15:41:22 +08:00
Eric Tang
39c1e29ed7
[ray] allow for specifying ray.init kwargs (i.e. runtime_env) ( #7647 )
...
* ray init kwargs
* Update trainer_utils.py
* fix ray args
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-10 11:31:05 +08:00
hoshi-hiyouga
5817cda37e
[misc] fix packing and eval plot ( #7623 )
2025-04-07 18:20:57 +08:00
hoshi-hiyouga
903db09822
[infer] vllm video/audio inference ( #7566 )
2025-04-02 02:27:04 +08:00
hoshi-hiyouga
aaf2e6ba2a
[model] fix kv cache ( #7564 )
2025-04-01 23:07:46 +08:00
Billy Cao
5d1cc863a4
[data] shard the dataset to allow multiprocessing when streaming is enabled ( #7530 )
...
* Shard the dataset when streaming to allow multiprocessing
* Allow user to not set dataset_shards to ensure backward compatibility
2025-04-01 15:36:23 +08:00
Kingsley
185c76f6ad
[model] add Qwen2.5-Omni model ( #7537 )
...
* preserve image_sizes
* preserve image_sizes
* init plugin
* support audio-text2text lora
* nit
* support image/video-text2text, audio-text2text
* remove args
* remove lines
* add docs && nit
* remove some comments
* fix && add merge part script
* add license
2025-03-31 20:39:35 +08:00
Xu-pixel
f547334604
[3rdparty] support swanlab lark notification ( #7481 )
2025-03-27 01:52:01 +08:00
hoshi-hiyouga
dfbe1391e9
[deps] upgrade vllm to 0.8 ( #7436 )
2025-03-23 14:32:22 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend ( #7278 )
...
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu >
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
ef5f1c1def
[data] gemma3 plugin pan and scan ( #7294 )
...
* gemma3 pan and scan
* add test case
* fix test
2025-03-13 23:29:23 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
Ze-Yi LIN
0a43bc1960
[tracking] add swanlab_logdir param ( #7219 )
...
* feat: add swanlab_logdir param
* fix
Former-commit-id: a1e76af3d9
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
4e68828e46
[config] fix export max len ( #7230 )
...
Former-commit-id: 728c2f6819
2025-03-10 16:46:08 +08:00
hoshi-hiyouga
113cc3d920
[misc] fix cli ( #7204 )
...
Former-commit-id: bd17223559
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
e7556b591e
[deps] upgrade vllm ( #7183 )
...
Former-commit-id: d739fddb10
2025-03-06 15:25:08 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
Eric Tang
e55ec42d3c
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 6edd4992d7
2025-02-14 21:55:41 +08:00
hoshi-hiyouga
036fb0d561
[misc] fix grad ckpt func ( #6916 )
...
Former-commit-id: e34c3c06da
2025-02-13 00:17:18 +08:00
hoshi-hiyouga
2e2f6bea07
[data] feat: auto template ( #6905 )
...
* support auto template
* add unittest
Former-commit-id: 2f8b6847f5
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
ff6658ad27
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
f70208e1c0
[misc] allow extra args ( #6831 )
...
Former-commit-id: 74ade3a176
2025-02-06 12:38:08 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
steveepreston
8895cf1152
Update val_size english description ( #6653 )
...
* Update `val_size` Description in locales.py
* Update `val_size` Description in data_args.py
* Remove extra space in data_args.py
Former-commit-id: 76675b654e
2025-01-15 16:00:20 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
5e699458e5
pin vllm version to 0.6.5 ( #6629 )
...
Former-commit-id: 1c7663d304
2025-01-14 02:44:02 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
hoshi-hiyouga
b777fed171
Merge pull request #6564 from stephen-nju/fix_ray
...
Fix ray
Former-commit-id: 6b34b69fa6
2025-01-08 18:14:18 +08:00
zhubin
014a7ea042
fix get ray args when args not a dict
...
Former-commit-id: 9c4c84828b
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e
run style check
...
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918
drafting ray integration
...
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com >
Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
hiyouga
813f5919a3
fix #6482
...
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hoshi-hiyouga
547f76e56e
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
...
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a3
2024-12-21 14:09:33 +08:00
ZeYi Lin
67d4757c35
fix: project blank
...
Former-commit-id: 82e5d75014
2024-12-20 18:26:02 +08:00
ZeYi Lin
cc703b58f5
fix: by hiyouga suggestion
...
Former-commit-id: 3a7ea2048a
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938
feat: ui improve
...
Former-commit-id: 5f6dafd70e
2024-12-20 11:03:02 +08:00