hoshi-hiyouga
|
e34c3c06da
|
[misc] fix grad ckpt func (#6916)
|
2025-02-13 00:17:18 +08:00 |
|
hoshi-hiyouga
|
2f8b6847f5
|
[data] feat: auto template (#6905)
* support auto template
* add unittest
|
2025-02-12 00:22:53 +08:00 |
|
hoshi-hiyouga
|
9184a6e0ed
|
[misc] support export ollama modelfile (#6899)
* support export ollama modelfile
* update config
* add system and num ctx
|
2025-02-11 19:52:25 +08:00 |
|
hoshi-hiyouga
|
5f38bcaba9
|
[deps] upgrade vllm (#6857)
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
74ade3a176
|
[misc] allow extra args (#6831)
|
2025-02-06 12:38:08 +08:00 |
|
Zhangchi Feng
|
24c7842948
|
[model] support audio (#6701)
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-02-05 04:59:09 +08:00 |
|
hoshi-hiyouga
|
e2dc5b952a
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
999c7c8fe0
|
[model] add qwen2.5 vl models (#6779)
|
2025-01-31 03:00:29 +08:00 |
|
hoshi-hiyouga
|
1f47b6186c
|
[model] support yarn (#6693)
|
2025-01-18 13:56:09 +08:00 |
|
hoshi-hiyouga
|
77bbf65905
|
disable valset by default (#6690)
|
2025-01-17 21:09:30 +08:00 |
|
steveepreston
|
76675b654e
|
Update val_size english description (#6653)
* Update `val_size` Description in locales.py
* Update `val_size` Description in data_args.py
* Remove extra space in data_args.py
|
2025-01-15 16:00:20 +08:00 |
|
hoshi-hiyouga
|
7a04021d04
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
d9189f9f0b
|
[optim] add support to APOLLO (#6617)
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
1c7663d304
|
pin vllm version to 0.6.5 (#6629)
|
2025-01-14 02:44:02 +08:00 |
|
hiyouga
|
f6f630a1c9
|
refactor mllm param logic
|
2025-01-10 15:45:48 +00:00 |
|
hoshi-hiyouga
|
6b34b69fa6
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
9c4c84828b
|
fix get ray args when args not a dict
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
d8cac6f546
|
refactor ray integration, support save ckpt
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
1e8e7be0a5
|
run style check
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
163ddb680b
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
5111cac6f8
|
support report custom args
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
947e22a4a3
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
82e5d75014
|
fix: project blank
|
2024-12-20 18:26:02 +08:00 |
|
ZeYi Lin
|
3a7ea2048a
|
fix: by hiyouga suggestion
|
2024-12-20 16:43:03 +08:00 |
|
ZeYi Lin
|
5f6dafd70e
|
feat: ui improve
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
d0eb64d5e3
|
fix: bugs
|
2024-12-19 21:08:16 +08:00 |
|
ZeYi Lin
|
7eb49e5ffa
|
docs: config framework
|
2024-12-19 20:22:36 +08:00 |
|
ZeYi Lin
|
3306919629
|
fix: string
|
2024-12-19 20:18:59 +08:00 |
|
hiyouga
|
d4c1fda1ad
|
fix #6391
|
2024-12-19 12:16:38 +00:00 |
|
ZeYi Lin
|
d5cf87990e
|
feat: swanlab params
|
2024-12-19 18:47:27 +08:00 |
|
hiyouga
|
c7cedc7569
|
support disable shuffling
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
96f8f103e5
|
add swanlab
|
2024-12-19 07:12:31 +00:00 |
|
Yaser Afshar
|
1c8ad22a5f
|
Add missing key to init_kwargs
|
2024-12-17 12:34:05 +00:00 |
|
Yaser Afshar
|
0943776326
|
Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
|
2024-12-17 12:25:12 +00:00 |
|
hoshi-hiyouga
|
a665ad6178
|
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
[model] support non-reenterent-gc
|
2024-12-17 19:58:36 +08:00 |
|
hiyouga
|
f319da6937
|
support non-reenterent-gc & fix #6358
|
2024-12-17 11:41:59 +00:00 |
|
hiyouga
|
eda76de32b
|
support control eos, fix #6345
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
446441fdb0
|
fix inputs
|
2024-11-23 18:26:02 +00:00 |
|
Ting
|
40627c601e
|
code refactor
|
2024-11-19 20:33:18 +08:00 |
|
hiyouga
|
58ab4579dc
|
add vllm config
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
24da9f59b0
|
fix #5883
|
2024-11-02 13:06:34 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
3af57795dd
|
tiny fix
|
2024-10-11 23:51:54 +08:00 |
|
hoshi-hiyouga
|
228dd1739e
|
Merge pull request #5665 from johnnynunez/main
vllm 0.6.3
|
2024-10-11 23:45:58 +08:00 |
|
Johnny
|
e5849cdcce
|
Update parser.py
|
2024-10-11 12:29:33 +02:00 |
|
huniu20
|
7b91be33c9
|
add om_hub_token argument
|
2024-10-10 17:16:46 +08:00 |
|