Commit Graph

142 Commits

Author SHA1 Message Date
Butui Hu
83688b0b4d [launcher] Add elastic and fault-tolerant training support (#8286)
Signed-off-by: Butui Hu <hot123tea123@gmail.com>
2025-06-05 16:40:03 +08:00
hoshi-hiyouga
c6bcca4c83 [example] update examples (#7964) 2025-05-06 17:24:25 +02:00
hoshi-hiyouga
a8430f4244 [misc] fix uv (#7913) 2025-04-30 07:45:03 +08:00
hoshi-hiyouga
a62cba3d05 [example] add bash usage (#7794) 2025-04-22 00:25:51 +08:00
Juanxi Tian
d128382d3c [trainer] Add Muon Optimizer (#7749)
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-04-21 23:38:37 +08:00
hoshi-hiyouga
278df4308d [parser] support omegaconf (#7793) 2025-04-21 23:30:30 +08:00
hoshi-hiyouga
4831552856 [infer] set env for vllm ascend (#7745) 2025-04-17 01:08:55 +08:00
leo-pony
e1fdd6e2f8 [infer] support vllm-ascend (#7739) 2025-04-16 20:06:47 +08:00
Eric Tang
39c1e29ed7 [ray] allow for specifying ray.init kwargs (i.e. runtime_env) (#7647)
* ray init kwargs

* Update trainer_utils.py

* fix ray args

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-04-10 11:31:05 +08:00
hoshi-hiyouga
5817cda37e [misc] fix packing and eval plot (#7623) 2025-04-07 18:20:57 +08:00
hoshi-hiyouga
7e0cdb1a76 [assets] update readme (#7612) 2025-04-06 13:58:49 +08:00
hoshi-hiyouga
aaf2e6ba2a [model] fix kv cache (#7564) 2025-04-01 23:07:46 +08:00
Qiaolin Yu
30038d9ce7 [inference] support sglang backend (#7278)
* Mimic SGLang offline Engine

* Add more tests and args

* Pass all current tests

* Clean Code

* fix sample_params

* clean code

* Fix Stream Chat

* change sglang from engine mode to server mode

* fix

* Fix Review Issues

* Use SGLang Built-In Utilities

* Fix test SGLang

* Some Doc Issue

* fix sglang engine

* add readme

---------

Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
5a29f49fb1 [config] update args (#7231)
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
98ea0e8109 [misc] fix ds config (#7205)
Former-commit-id: db113f690e
2025-03-07 15:21:28 +08:00
hoshi-hiyouga
6e58115f98 [trainer] update config (#7174)
Former-commit-id: b4b89b4ff3
2025-03-05 23:32:54 +08:00
hoshi-hiyouga
ca78ba964d [model] add models (#7054)
* add qwen25vl awq models

* add moonlight

Former-commit-id: ec1a1bc118
2025-02-24 22:05:13 +08:00
hoshi-hiyouga
1f4a0b11ba [data] update vlm args (#6976)
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9 [data] add min resolution option (#6975)
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
07aa7b71a3 [misc] update readme (#6917)
Former-commit-id: 499ea45d1f
2025-02-13 00:58:10 +08:00
Eric Tang
e515fe62de [example] fix path to ray example (#6906)
Former-commit-id: ee5fe216dc
2025-02-13 00:29:32 +08:00
hoshi-hiyouga
197aa3baf4 [data] fix ollama template (#6902)
* fix ollama template

* add meta info

* use half precision

Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx

Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
bbf334f823 disable valset by default (#6690)
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
hiyouga
c89d17ab63 refactor mllm param logic
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
Yaser Afshar
76ebd62ac1 Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security


Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hiyouga
ba901bc000 update assets
Former-commit-id: 7059055e89
2024-12-14 17:36:03 +00:00
hiyouga
6f1e450739 fix mrope
Former-commit-id: 2811814fc4
2024-12-12 15:08:17 +00:00
hiyouga
cf8cad8e7e support qwen2vl train proj only
Former-commit-id: 99c62660c6
2024-12-05 10:37:42 +00:00
hiyouga
90fb5605c1 update examples
Former-commit-id: e5584dc7ba
2024-12-05 08:48:25 +00:00
hiyouga
235cdcacee support batch infer in vllm
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
3f7c874594 update tests
Former-commit-id: 93d3b8f43f
2024-11-02 12:41:44 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
hiyouga
3aa6a3e45b add e2e tests
Former-commit-id: 94d5b1bd8f
2024-09-05 21:52:28 +08:00
hiyouga
bfdcc6bacf add rlhf-v dataset
Former-commit-id: 8e49940746
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61
2024-08-31 00:24:51 +08:00
hiyouga
c883542583 add examples
Former-commit-id: e08045a946
2024-08-30 21:43:19 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32
2024-08-30 02:14:31 +08:00
simonJJJ
8a09b1e732 initial-commit
Former-commit-id: aeb85f200b
2024-08-28 16:51:35 +08:00
hiyouga
f8c11bd540 update examples
Former-commit-id: 0a690ada6f
2024-08-09 20:13:46 +08:00
hiyouga
5eacd17090 add adam_mini to readme
Former-commit-id: e2a28f51c6
2024-08-09 20:02:03 +08:00