Copilot
eceec8ab69
[deps] goodbye python 3.9 ( #9677 )
...
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com >
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-12-27 02:50:44 +08:00
浮梦
2b6f16f261
[model] temporarily support npu fused options on v0, powered by v1 kernels ( #9520 )
...
Co-authored-by: frozenleaves <frozen@Mac.local >
2025-11-27 02:08:36 +08:00
Yaowei Zheng
eaf963f67f
[model] update kt code ( #9406 )
2025-11-05 15:27:22 +08:00
Peilin Li
934b3084ee
[train] KTransformers SFT as backend engine for LLaMA-Factory ( #9400 )
...
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com >
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-11-04 15:54:12 +08:00
Yaowei Zheng
d9d67ba62d
[misc] fix import error ( #9299 )
2025-10-17 17:46:27 +08:00
Ximing Xing
c867e28093
[model] adds semantic initialization support for special tokens ( #9267 )
...
Co-authored-by: ximingxing <ximingxing@tencent.com >
2025-10-14 17:00:48 +08:00
Ben Feuer
1c44b60e3e
[feat] fp8 training ( #8960 )
...
Co-authored-by: Benjamin Feuer <penfever@gmail.com >
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-10-01 14:32:53 +08:00
hoshi-hiyouga
9ae17cd173
[deps] update to transformers 4.52 ( #8125 )
2025-05-21 05:16:18 +08:00
Saiya
ab41f7956c
[infer] support lora adapter for SGLang backend ( #8067 )
2025-05-16 23:33:47 +08:00
Kingsley
fa0eb91f1f
[data] fix internvl plugin ( #7817 )
2025-04-23 00:58:22 +08:00
hoshi-hiyouga
b07628dea5
[example] add bash usage ( #7794 )
2025-04-22 00:25:51 +08:00
flashJd
0ac641326b
[misc] fix new tokens adding ( #7253 )
...
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-21 23:19:02 +08:00
hoshi-hiyouga
c3c0efbaa0
[misc] fix packing and eval plot ( #7623 )
2025-04-07 18:20:57 +08:00
hoshi-hiyouga
5e22597ff1
[infer] vllm video/audio inference ( #7566 )
2025-04-02 02:27:04 +08:00
Kingsley
7eed496336
[model] add Qwen2.5-Omni model ( #7537 )
...
* preserve image_sizes
* preserve image_sizes
* init plugin
* support audio-text2text lora
* nit
* support image/video-text2text, audio-text2text
* remove args
* remove lines
* add docs && nit
* remove some comments
* fix && add merge part script
* add license
2025-03-31 20:39:35 +08:00
Qiaolin Yu
a44a53ebec
[inference] support sglang backend ( #7278 )
...
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu >
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
93e6184cbe
[data] gemma3 plugin pan and scan ( #7294 )
...
* gemma3 pan and scan
* add test case
* fix test
2025-03-13 23:29:23 +08:00
hoshi-hiyouga
650a9a9057
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
264538cb26
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
71a1c1321a
[config] update args ( #7231 )
...
Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
a255c3a476
[misc] fix cli ( #7204 )
...
Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
f5cd17881e
[data] update vlm args ( #6976 )
...
Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
c09b648934
[data] add min resolution option ( #6975 )
...
Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
88eafd865b
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
800de98dc8
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: ed46fb4f6194c30060b908092464dded12e5787c
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
87d685b59f
[model] support yarn ( #6693 )
...
Former-commit-id: 8c412abc44a4c61b683465e36c6288580d980250
2025-01-18 13:56:09 +08:00
hiyouga
a897d46049
support report custom args
...
Former-commit-id: d41254c40a1c5cacf9377096adb27efa9bdb79ea
2024-12-21 21:42:45 +00:00
Yaser Afshar
6f1c8dacea
Add missing key to init_kwargs
...
Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca
2024-12-17 12:34:05 +00:00
Yaser Afshar
8881237475
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 4bf23f406cf5235c16f9f8139850c53354901814
2024-12-17 12:25:12 +00:00
hiyouga
4196d5b4d6
support non-reenterent-gc & fix #6358
...
Former-commit-id: 20446141e408885eb36d512bfb2dfb62bbc0c20d
2024-12-17 11:41:59 +00:00
hiyouga
5003820a6a
fix inputs
...
Former-commit-id: 7d535bb8cdf7e81edda81152e63c8cfe6c9dcc9f
2024-11-23 18:26:02 +00:00
hiyouga
1e6f96508a
add vllm config
...
Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb
2024-11-10 21:28:18 +08:00
huniu20
5b15ca0b0b
add om_hub_token argument
...
Former-commit-id: b3214e69d32067a1c22dbd60c2cde1545ba75b19
2024-10-10 17:16:46 +08:00
hiyouga
20ee1d2e19
fix #5542
...
Former-commit-id: cf28e7418c2eb07e86923a53ef832ef218e45af1
2024-09-30 23:28:55 +08:00
hiyouga
294a103ead
support activation offloading via unsloth gc
...
Former-commit-id: d3d0dd0feba3ca6f0ae970d5856bec989d26ef67
2024-09-08 01:22:19 +08:00
hiyouga
9bdba2f6a8
add e2e tests
...
Former-commit-id: 0156a37450604641c4f5f9756ad84324698fc88c
2024-09-05 21:52:28 +08:00
hiyouga
1874d579c5
video datasets
...
Former-commit-id: 33f28ce82d9e44d2615909250dc56d6a4a03cd99
2024-09-05 02:04:17 +08:00
hiyouga
fed7ae5661
fix #5334
...
Former-commit-id: a5ea0f83f00c81d128a1f50ce244866ce38ee15f
2024-09-03 19:09:42 +08:00
hiyouga
60cf12727b
add rlhf-v dataset
...
Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437
2024-09-01 22:57:41 +08:00
hiyouga
2f6fc27c8b
remove visual_inputs, fix qlora
...
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
2024-08-31 00:24:51 +08:00
hiyouga
c62a6ca59d
refactor mm training
...
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
2024-08-30 02:14:31 +08:00
hiyouga
206a8364d4
support liger kernel
...
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
2024-08-27 11:20:14 +08:00
hiyouga
5acaa476d6
update hparams
...
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
2024-07-03 23:18:58 +08:00
ancv
20fdf177e8
move efficient_packing from data_args to model_args
...
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
2024-07-02 18:37:55 +07:00
hiyouga
8aaf1185a5
support HQQ/EETQ #4113
...
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
2024-06-27 00:29:42 +08:00
hiyouga
a79e93f335
fix #4410
...
Former-commit-id: f49adc4ab5eade21d7a9e029212f17688ee9b0cf
2024-06-24 22:34:31 +08:00
stceum
16e950454e
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
...
Former-commit-id: 171289d8e4c111fdca2b100282b64c74a04a4726
2024-06-24 20:39:31 +08:00
hiyouga
32f45c9e91
support pissa
...
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
2024-06-16 01:08:12 +08:00
hiyouga
14f7bfc545
use fixture
...
Former-commit-id: 10761985691b9f934f7689c1f82aa6dd68febcca
2024-06-15 20:06:17 +08:00
hiyouga
bb88536166
add license
...
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
2024-06-15 17:54:33 +08:00