LLaMA-Factory/model at 7af909522a951e3ad9f022ea6f88b6755257eaa5 - LLaMA-Factory - Gitea: Git with a cup of tea

423A35C7/LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-17 12:48:55 +08:00

Files

History

jiaqiw09 7d719182c9 [model] fix non-packing batch (bsz>1) for Qwen3.5 with flash attention (#10529 )

2026-05-30 21:41:41 +08:00

..

[model] support MiniCPM-V-4.6 (#10472 )

2026-05-08 18:14:34 +08:00

__init__.py

[misc] upgrade format to py39 (#7256 )

2025-03-12 00:08:41 +08:00

adapter.py

[data] Optimize QwenVL video dataset preprocessing (#10404 )

2026-05-03 18:36:56 +08:00

loader.py

[refactor] Add KTransformers AMX MoE SFT support via Accelerate (#10430 )

2026-05-01 01:47:58 +08:00

patcher.py

[model] fix non-packing batch (bsz>1) for Qwen3.5 with flash attention (#10529 )

2026-05-30 21:41:41 +08:00