423A35C7
  • Joined on 2024-05-11
423A35C7 synced and deleted reference refs/tags/hiyouga/misc at 423A35C7/LLaMA-Factory from mirror 2025-10-02 01:42:53 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-01 17:32:51 +08:00
c4cf97d84d [docker] update Dockerfile to set no_proxy and fix pydantic version (#8651)
05271756d2 [feat] fp8 training (#8960)
Compare 2 commits »
423A35C7 synced new reference hiyouga/misc to 423A35C7/LLaMA-Factory from mirror 2025-10-01 17:32:51 +08:00
423A35C7 synced commits to hiyouga/misc at 423A35C7/LLaMA-Factory from mirror 2025-10-01 17:32:51 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-01 01:12:50 +08:00
4f2f058d42 [data] fix reasoning template (#9219)
d55091ea87 [npu] Redirect SDPA to torch_npu.npu_fusion_attention (opt-in, ZeRO-3 safe, no impact off NPU) (#8972)
44131fdb2a [cli] support lazy import (#9217)
Compare 3 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-28 07:52:49 +08:00
a3c2b6139c [data] fix qwen omni plugin (#9204)
423A35C7 synced and deleted reference refs/tags/hiyouga/fix_qwen_omni at 423A35C7/LLaMA-Factory from mirror 2025-09-28 07:52:49 +08:00
423A35C7 synced commits to hiyouga/fix_qwen_omni at 423A35C7/LLaMA-Factory from mirror 2025-09-27 23:42:49 +08:00
66ec2b6cda fix test
Compare 3 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-27 07:22:49 +08:00
8becd392df [model] add qwen3-vl/qwen3-omni (#9196)
423A35C7 synced new reference hiyouga/fix_qwen_omni to 423A35C7/LLaMA-Factory from mirror 2025-09-27 07:22:49 +08:00
423A35C7 synced commits to hiyouga/fix_qwen_omni at 423A35C7/LLaMA-Factory from mirror 2025-09-27 07:22:49 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-24 22:12:50 +08:00
cb3f56e6c9 [docs] update ling-v2 to the readme (#9188)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-22 13:02:49 +08:00
953e9788d5 [model] supported ERNIE4.5 Text Models (#9165)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-22 04:52:49 +08:00
d891bfedad [model] add dots ocr (#9176)
a4cbf10d3d [assets] update wechat (#9177)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-16 18:12:48 +08:00
2b27283ba0 [assets] update readme (#9143)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-16 01:52:48 +08:00
2df33d399e [assets] update readme (#9137)
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-09-16 01:22:48 +08:00
7711bf34a8 fix device error
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-14 09:02:48 +08:00
28a625bf5b [model] add qwen3 next (#9130)
5d89af9e58 [assets] update wechat (#9129)
cf48406d07 [deps] upgrade transformers to 4.56.1 (#9128)
b95c11d8ea [data] Fix qwen_2vl with valuehead (#9078)
aff6923fd1 [data] bailing template v2 & openai data converter (#9112)
Compare 5 commits »
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-09-05 04:12:49 +08:00
d098beb7a7 allow python 3.12
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-09-03 19:52:52 +08:00
59f2bf1ea3 [misc] update readme (#9071)