423A35C7
  • Joined on 2024-05-11
423A35C7 synced and deleted reference refs/tags/hiyouga/data at 423A35C7/LLaMA-Factory from mirror 2025-10-13 23:32:50 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-13 15:22:50 +08:00
48974783da [model]: add ernie4_5_moe support for DeepSpeed Zero3 training (#9262)
575e4099df [misc] add qwen bench script (#9259)
Compare 2 commits »
423A35C7 synced new reference hiyouga/data to 423A35C7/LLaMA-Factory from mirror 2025-10-13 15:22:50 +08:00
423A35C7 synced commits to hiyouga/data at 423A35C7/LLaMA-Factory from mirror 2025-10-13 15:22:50 +08:00
423A35C7 synced and deleted reference refs/tags/exp_qwen2vl at 423A35C7/LLaMA-Factory from mirror 2025-10-13 15:22:50 +08:00
423A35C7 synced new reference exp_qwen2vl to 423A35C7/LLaMA-Factory from mirror 2025-10-11 22:32:50 +08:00
423A35C7 synced commits to exp_qwen2vl at 423A35C7/LLaMA-Factory from mirror 2025-10-11 22:32:50 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-10 05:42:49 +08:00
9687b71d3a [v1] init data plugins (#9248)
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-10-10 05:12:49 +08:00
fc6a6b8951 separate multigpu tests
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-09 21:32:48 +08:00
1c35db60d6 [v1] support read dataset (#9243)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-08 04:42:50 +08:00
10146029ba [v1] add v1 launcher (#9236)
95b7188090 Merge commit from fork
Compare 2 commits »
423A35C7 synced and deleted reference refs/tags/hiyouga/v1 at 423A35C7/LLaMA-Factory from mirror 2025-10-08 04:42:50 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-05 19:32:48 +08:00
d5bb4e6394 [assets] update readme (#9232)
423A35C7 synced new reference hiyouga/v1 to 423A35C7/LLaMA-Factory from mirror 2025-10-05 19:32:48 +08:00
423A35C7 synced commits to hiyouga/v1 at 423A35C7/LLaMA-Factory from mirror 2025-10-05 19:32:48 +08:00
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-05 03:12:48 +08:00
3fe6f0febd [ci] update docker workflow (#9231)
40d3691e9e [misc] fix moe models (#9230)
af8437095a [ci] Change macOS version (#9229)
Compare 3 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-04 19:02:48 +08:00
2e2f92701f [model] add qwen3-vl-30b (#9227)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-02 18:02:50 +08:00
bcc2c1fd8f [misc] move wechat out (#9223)
7dd910f067 [misc] lint (#9221)
d10d65e4ce [docker] update Dockerfile to set no_proxy and fix pydantic version (#8651)
1c44b60e3e [feat] fp8 training (#8960)
e2b1594d31 [data] fix reasoning template (#9219)
Compare 2789 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-02 18:02:50 +08:00
7d60b840ef [v1] support switch v1 backend (#9226)
1d96c62df2 [v1] add v1 folders (#9225)
a0d44c650a [misc] add data files (#9224)
bcc2c1fd8f [misc] move wechat out (#9223)
7dd910f067 [misc] lint (#9221)
Compare 2792 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-02 09:52:50 +08:00
6f743571b1 [misc] move wechat out (#9223)