LLaMA-Factory

423A35C7/LLaMA-Factory

Fork 0

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-10-14 15:52:49 +08:00

Commit Graph

Select branches

Hide Pull Requests

main

#1

#1059

#11

#1186

#119

#1252

#1326

#1348

#1353

#1375

#1436

#145

#1454

#1486

#1525

#1544

#1553

#156

#158

#1624

#1689

#1690

#1695

#1699

#1700

#171

#179

#1796

#1800

#1802

#1861

#1864

#1868

#1918

#1932

#1946

#1947

#1953

#1954

#200

#2007

#2019

#2100

#2117

#213

#2163

#2194

#22

#2201

#221

#2226

#2262

#2264

#2266

#2283

#2285

#2319

#2350

#2411

#2423

#2426

#2435

#2445

#2462

#2469

#2474

#2514

#2519

#2525

#2531

#2568

#2570

#2572

#2575

#258

#26

#2608

#2683

#2689

#2730

#2739

#2743

#2746

#2764

#2766

#2830

#2845

#2849

#2872

#2903

#2905

#2919

#2944

#2945

#2963

#2967

#2993

#3004

#3046

#3053

#3057

#306

#3066

#307

#3083

#3103

#3103

#3158

#3159

#3160

#3161

#3201

#3226

#3254

#3256

#3261

#3263

#3267

#3275

#3276

#3287

#3288

#3291

#33

#3332

#3338

#3357

#3371

#3383

#3394

#3412

#3423

#3435

#3449

#3450

#3454

#3471

#3484

#3487

#3490

#3498

#3511

#3513

#3527

#3532

#356

#3578

#3584

#3588

#3596

#3601

#3604

#3651

#3654

#3655

#3661

#3683

#3692

#3702

#3741

#3746

#3748

#3755

#3756

#3785

#3792

#3794

#3799

#3804

#3812

#382

#3829

#3835

#387

#3876

#3921

#3923

#3925

#3930

#3941

#395

#3958

#3976

#3987

#4003

#4006

#4007

#4009

#4011

#4015

#4029

#4043

#4045

#4053

#4066

#4080

#4082

#4083

#4098

#4099

#4119

#4136

#4166

#4167

#4173

#4191

#4204

#4224

#4227

#4234

#4237

#4245

#4246

#4307

#4309

#4314

#4321

#4329

#4334

#434

#4342

#4347

#4348

#4352

#4355

#4377

#4382

#4409

#4417

#4445

#4446

#4461

#451

#4544

#4561

#4580

#4589

#4590

#4636

#4651

#4662

#4663

#4673

#4680

#4680

#4686

#4687

#4691

#4692

#4700

#4706

#4724

#4733

#4746

#4781

#479

#4793

#4804

#4821

#4822

#4877

#4878

#4892

#4939

#4950

#4957

#4961

#4970

#4995

#4996

#5010

#5019

#5032

#5037

#5068

#5072

#5095

#51

#5109

#511

#5111

#5112

#5115

#5118

#5156

#516

#5163

#5170

#5185

#5188

#5193

#5208

#5226

#5230

#5233

#5237

#5242

#5278

#5290

#5317

#5323

#5326

#5339

#5343

#5346

#5365

#5372

#5388

#5405

#5424

#5427

#5438

#5445

#5451

#5458

#5473

#5475

#5480

#5483

#5486

#5507

#5522

#5532

#5533

#5536

#5546

#5547

#5555

#5563

#5574

#5580

#5581

#5585

#5606

#5615

#5639

#5642

#5653

#5665

#5673

#5746

#5752

#5752

#5758

#5781

#5799

#5801

#5816

#5819

#5839

#5852

#5856

#5857

#5871

#5873

#5880

#5889

#5895

#5897

#5901

#5906

#5907

#5909

#5910

#5912

#5913

#5914

#5920

#5922

#5924

#5926

#5927

#5929

#5933

#596

#5970

#5971

#5973

#598

#5982

#5984

#599

#5990

#5993

#6010

#6022

#6046

#6052

#6065

#6078

#6083

#6098

#6103

#6120

#6121

#6123

#6124

#6125

#6126

#6127

#6128

#6129

#6137

#6138

#6140

#6141

#6151

#6152

#6156

#6157

#6160

#6170

#6175

#619

#6190

#6192

#6204

#6224

#6226

#6233

#6238

#6242

#6246

#6251

#6253

#6265

#6275

#629

#6310

#6313

#6317

#6334

#6359

#6362

#6363

#6364

#6365

#6367

#6368

#6369

#6379

#6384

#6388

#6395

#6396

#6401

#6416

#6418

#6420

#6426

#6430

#644

#6441

#6443

#6444

#6457

#6462

#6465

#6471

#6478

#6483

#6492

#6493

#6498

#6503

#6506

#651

#6512

#6513

#6514

#6515

#6524

#6527

#6528

#6542

#6547

#6564

#6565

#657

#6585

#6588

#6597

#6598

#6600

#6601

#6617

#6620

#6624

#6625

#6626

#6628

#6629

#6631

#6632

#6637

#6640

#6641

#6642

#6645

#6648

#6653

#6657

#6684

#6688

#6689

#6690

#6691

#6692

#6693

#6698

#6701

#6710

#6722

#6753

#6767

#6771

#6778

#6779

#678

#6786

#6787

#6788

#6796

#6797

#68

#6801

#6810

#6814

#6830

#6831

#6834

#6843

#6854

#6855

#6857

#6865

#6866

#6868

#6874

#6879

#6890

#6892

#6895

#6896

#6899

#6901

#6902

#6903

#6904

#6905

#6906

#6907

#6913

#6916

#6917

#6918

#6920

#6930

#6931

#6944

#6946

#6954

#6963

#6972

#6975

#6976

#6977

#6982

#6983

#6985

#6998

#7019

#7051

#7053

#7054

#7058

#7060

#7061

#7067

#7074

#7077

#7089

#7106

#7108

#7117

#7120

#7126

#7142

#7143

#7161

#7166

#7174

#7176

#7179

#7181

#7183

#7190

#7193

#7201

#7204

#7205

#7206

#7207

#7209

#7211

#7219

#7229

#7230

#7231

#7235

#7241

#7242

#7244

#7247

#7253

#7254

#7255

#7256

#7257

#7258

#7259

#7264

#7272

#7273

#7275

#7277

#7278

#7287

#7288

#7294

#7295

#7304

#7308

#7312

#7317

#7318

#7330

#7332

#7338

#7340

#7343

#7345

#7347

#7349

#7351

#7361

#7378

#7381

#7395

#7404

#741

#7413

#7419

#7420

#7432

#7436

#7437

#7440

#7441

#7442

#7445

#7448

#7449

#7453

#7455

#7456

#7462

#7466

#7469

#7471

#7481

#7500

#7505

#7509

#7519

#7523

#7530

#7537

#7546

#7553

#7564

#7566

#7567

#7570

#7573

#7576

#7578

#7594

#7609

#7611

#7612

#7623

#7625

#7635

#7638

#7639

#7644

#7645

#7646

#7647

#7654

#7655

#7657

#766

#7660

#7674

#7686

#7694

#7695

#7700

#7704

#7714

#7715

#7719

#7724

#7725

#7728

#7732

#7739

#7740

#7744

#7745

#7746

#7747

#7748

#7749

#7754

#7765

#7786

#7792

#7793

#7794

#7795

#7797

#7801

#7803

#7804

#7808

#7810

#7817

#7826

#7830

#7840

#7854

#786

#7867

#7870

#7872

#7875

#7879

#7883

#7885

#7887

#7910

#7911

#7912

#7913

#7923

#7924

#7928

#7945

#7946

#7958

#7962

#7964

#7966

#7974

#7988

#7992

#8000

#8015

#8039

#8042

#8050

#8051

#8057

#8067

#8077

#8078

#8095

#8099

#8101

#8103

#8108

#8109

#8110

#8124

#8125

#8128

#8129

#8130

#8156

#8159

#8161

#8162

#8167

#8176

#8178

#8179

#8180

#8181

#8183

#8195

#8196

#8197

#8201

#8202

#8203

#8215

#8220

#8227

#8235

#8245

#8248

#8249

#8258

#8264

#8270

#8276

#8286

#8288

#8291

#8293

#8298

#83

#8303

#8311

#8312

#8314

#8325

#8327

#8328

#8333

#8335

#8348

#8362

#8367

#8385

#8386

#8387

#8388

#8389

#8390

#8396

#84

#8403

#8414

#8421

#8422

#8423

#8432

#8433

#8438

#844

#8441

#8448

#8449

#8457

#8458

#8460

#8461

#8462

#8480

#8481

#8505

#8509

#8517

#8519

#8529

#8530

#8532

#8535

#8538

#8539

#8542

#8543

#8546

#8547

#8548

#8554

#8556

#8557

#8559

#8564

#8565

#8567

#8569

#8571

#8587

#86

#8614

#8623

#8627

#8637

#8651

#8680

#8685

#8689

#8721

#8722

#8731

#8736

#8739

#8750

#8752

#8762

#8770

#8773

#8774

#8776

#8783

#8784

#8787

#8788

#8795

#8812

#8813

#8818

#8823

#8826

#8827

#8829

#8839

#8842

#8845

#8851

#8861

#8863

#8866

#8869

#8875

#8876

#8887

#8899

#8906

#8917

#8930

#8960

#8961

#8962

#8970

#8972

#8975

#8976

#8978

#8985

#8992

#900

#9000

#9008

#9018

#9022

#9024

#9028

#9029

#9046

#9071

#9077

#9078

#9086

#9086

#9112

#9117

#9124

#9128

#9129

#9130

#9137

#9143

#9165

#9176

#9177

#9183

#9188

#9196

#9198

#9204

#9215

#9217

#9219

#9221

#9223

#9224

#9225

#9226

#9227

#9229

#9230

#9231

#9232

#9236

#9237

#9243

#9248

#9249

#9259

#9262

#9263

#9265

#975

v0.0.9

v0.1.0

v0.1.1

v0.1.2

v0.1.3

v0.1.4

v0.1.5

v0.1.6

v0.1.7

v0.1.8

v0.2.0

v0.2.1

v0.2.2

v0.3.0

v0.3.2

v0.3.3

v0.4.0

v0.5.0

v0.5.2

v0.5.3

v0.6.0

v0.6.1

v0.6.2

v0.6.3

v0.7.0

v0.7.1

v0.8.0

v0.8.1

v0.8.2

v0.8.3

v0.9.0

v0.9.1

v0.9.2

v0.9.3

151ef48b40 [data] fix function formatter (#7201) ZhangChuanhui 2025-03-07 15:17:23 +08:00
a255c3a476 [misc] fix cli (#7204) hoshi-hiyouga 2025-03-07 15:01:18 +08:00
f4ec4fa6ad [script] fix vllm version (#7193) hoshi-hiyouga 2025-03-06 17:14:17 +08:00
2635794727 [webui] support escape html (#7190) hoshi-hiyouga 2025-03-06 16:52:21 +08:00
d2f845d70d [deps] upgrade vllm (#7183) hoshi-hiyouga 2025-03-06 15:25:08 +08:00
bb8aba5abf [data] fix mm template (#7181) hoshi-hiyouga 2025-03-06 15:18:32 +08:00
9f16c50155 [model] add QwQ 32b (#7179) hoshi-hiyouga 2025-03-06 11:58:36 +08:00
25bb9f5ad9 [trainer] fix swanlab callback (#7176) Ze-Yi LIN 2025-03-06 00:33:37 +08:00
7b985f55db [trainer] update config (#7174) hoshi-hiyouga 2025-03-05 23:32:54 +08:00
fd0357a26d [data] fix qwen2audio plugin (#7166) sirui.li 2025-03-05 18:03:36 +08:00
31f9daa362 [data] use bicubic resampler (#7143) hoshi-hiyouga 2025-03-04 00:17:06 +08:00
15ea576246 [webui] fix webui (#7142) hoshi-hiyouga 2025-03-04 00:01:49 +08:00
19a6916d80 [data] bailing template (#7117) rabbit 2025-03-03 15:33:22 +08:00
585c475f71 [inference] fix hf_engine (#7120) hoshi-hiyouga 2025-03-01 05:22:49 +08:00
e62dae37fe [assets] update wechat (#7106) hoshi-hiyouga 2025-02-28 12:01:04 +08:00
11672f760d [webui] display swanlab exp link (#7089) Ze-Yi LIN 2025-02-27 19:40:54 +08:00
b9f84900ee [npu] update cann base image and torch 2.4 (#7061) leo-pony 2025-02-25 23:32:01 +08:00
5f65558088 [misc] fix project toml (#7067) hoshi-hiyouga 2025-02-25 23:22:48 +08:00
0f54a78144 [script] add seed args (#7058) JieShen 2025-02-25 19:44:57 +08:00
2986bef530 [model] add paligemma2-mix series (#7060) Kingsley 2025-02-25 18:51:16 +08:00
065f7fb5da [data] fix mllama (#7053) hoshi-hiyouga 2025-02-24 22:05:38 +08:00
c1d5073bd3 [model] add models (#7054) hoshi-hiyouga 2025-02-24 22:05:13 +08:00
ee46011b34 [assets] update readme (#7051) hoshi-hiyouga 2025-02-24 20:45:06 +08:00
d55f420206 [assets] update wechat (#7019) hoshi-hiyouga 2025-02-20 20:32:33 +08:00
fcf75633a0 [data] fix MiniCPMV plugin (#6998) Zhangchi Feng 2025-02-19 19:36:04 +08:00
e77ced045d [webui] update css (#6985) hoshi-hiyouga 2025-02-18 18:27:57 +08:00
331f53381f [data] add r1 distill dataset (#6983) hoshi-hiyouga 2025-02-18 17:25:09 +08:00
1d675a287d [version] support transformers 449 (#6982) hoshi-hiyouga 2025-02-18 17:05:40 +08:00
be33ef67fb [misc] fix script (#6977) hoshi-hiyouga 2025-02-18 17:00:46 +08:00
f5cd17881e [data] update vlm args (#6976) hoshi-hiyouga 2025-02-18 02:12:51 +08:00
c09b648934 [data] add min resolution option (#6975) hoshi-hiyouga 2025-02-18 01:40:46 +08:00
f2fd9d1b25 [data] fix predict dataset (#6972) hoshi-hiyouga 2025-02-17 20:29:40 +08:00
167342af8a [data] fix minicpmo template (#6946) Zhangchi Feng 2025-02-15 00:37:41 +08:00
76f9bd1820 [ray] specify ray storage path (#6920) Eric Tang 2025-02-14 05:55:41 -08:00
a893505924 [misc] fix lora regex (#6944) hoshi-hiyouga 2025-02-14 21:38:43 +08:00
ed25e051a9 [misc] fix grad ckpt (#6931) hoshi-hiyouga 2025-02-13 23:27:51 +08:00
5e5fc337f9 [model] add liger kernel to qwen2_5 vl (#6930) hoshi-hiyouga 2025-02-13 23:05:54 +08:00
58e9ca8aa0 [trainer] fix gen_kwarg to eval during training (#5451) Billy Cao 2025-02-13 02:35:06 +08:00
a4c4b8496f [data] evaluate on each dataset (#5522) SrWYG 2025-02-13 02:19:03 +08:00
38c9641777 [data] improve error handling (#6128) Noah 2025-02-13 01:39:41 +08:00
8b8fdb3a85 [misc] update readme (#6918) hoshi-hiyouga 2025-02-13 01:01:41 +08:00
290057069e [misc] update readme (#6917) hoshi-hiyouga 2025-02-13 00:58:10 +08:00
46203856fc [breaking change] refactor data pipeline (#6901) hoshi-hiyouga 2025-02-13 00:39:20 +08:00
80b89978d9 [misc] support for launching LLaMA-Factory with uv run (#6907) Eric Tang 2025-02-12 08:38:44 -08:00
5a221d91f9 [example] fix path to ray example (#6906) Eric Tang 2025-02-12 08:29:32 -08:00
3a3f4072e5 [misc] fix grad ckpt func (#6916) hoshi-hiyouga 2025-02-13 00:17:18 +08:00
0c0cdc26bc [trainer] fix llama3.2 vision kto train (#6904) marko1616 2025-02-12 19:09:14 +08:00
2581cc844b [data] feat: auto template (#6905) hoshi-hiyouga 2025-02-12 00:22:53 +08:00
d58fcd094e [misc] update readme (#6903) hoshi-hiyouga 2025-02-11 22:51:26 +08:00
86063e27ea [data] fix ollama template (#6902) hoshi-hiyouga 2025-02-11 22:43:09 +08:00
88eafd865b [misc] support export ollama modelfile (#6899) hoshi-hiyouga 2025-02-11 19:52:25 +08:00
3f7bd98bfa [data] refactor template (#6896) hoshi-hiyouga 2025-02-11 17:59:25 +08:00
b72c4bd118 support ollama modelfile export (#4686) codingma 2025-02-11 17:52:24 +08:00
808ff89a2d [data] refactor mm plugin (#6895) hoshi-hiyouga 2025-02-11 16:34:49 +08:00
6d7f1299bd [data] fix qwen_2_5_vl video processing (#6868) HJ 2025-02-11 16:14:50 +08:00
0420a608ca [assets] update wechat (#6892) hoshi-hiyouga 2025-02-11 13:56:26 +08:00
2047eab723 [da'ta] fix minicpmv plugin (#6890) Zhangchi Feng 2025-02-11 13:30:44 +08:00
e11b40c344 [data] fix: sharegpt converter (#6879) HJ 2025-02-10 21:59:12 +08:00
b869506a57 [data] fix mllama collator (#6874) hoshi-hiyouga 2025-02-09 22:42:25 +08:00
72d5b06b08 [test] align test cases (#6865) hoshi-hiyouga 2025-02-09 01:03:49 +08:00
94726bdc8d [dataset] add openthought (#6866) hoshi-hiyouga 2025-02-09 00:53:01 +08:00
4d1791e905 [deps] upgrade vllm (#6857) hoshi-hiyouga 2025-02-08 15:02:28 +08:00
528e06ccaa fix qwen2vl plugin (#6855) hoshi-hiyouga 2025-02-08 10:59:10 +08:00
fec641ec82 [misc] allow extra args (#6831) hoshi-hiyouga 2025-02-06 12:38:08 +08:00
8f401e37f8 [model] support audio (#6701) Zhangchi Feng 2025-02-05 04:59:09 +08:00
9feb78e7b4 [data] allow thought in function call (#6797) Yueqi Song 2025-02-05 02:26:23 +08:00
c2022431aa [misc] update license year & fix llama pro (#6814) hoshi-hiyouga 2025-02-05 01:53:33 +08:00
0817c24c04 [data] fix qwen tool template (#6796) Yueqi Song 2025-02-05 00:02:00 +08:00
cfb926fb84 [data] fix minicpmv plugin (#6801) Zhangchi Feng 2025-02-04 21:20:15 +08:00
34746d6151 [readme] update flash attention installation instruction on win platform (#6788) neavo 2025-02-01 12:43:29 +08:00
5bb447b118 [misc] update workflows (#6787) hoshi-hiyouga 2025-02-01 04:54:42 +08:00
a28261a866 [model] add mistral small models (#6786) hoshi-hiyouga 2025-02-01 04:31:38 +08:00
800de98dc8 [model] add qwen2.5 vl models (#6779) hoshi-hiyouga 2025-01-31 03:00:29 +08:00
222423bcef [breaking] support transformers 4.48 (#6628) hoshi-hiyouga 2025-01-31 01:36:33 +08:00
e71737351f [webui] improve webui & reasoning mode (#6778) hoshi-hiyouga 2025-01-31 00:09:21 +08:00
4f298894da [model] add deepseek-R1 & show think process (#6767) qvlehao 2025-01-29 12:16:26 +08:00
a8fae3869d fix: avoid redundant normalization in DPO's SFT loss calculation (#6722) yinpu 2025-01-21 13:38:02 +08:00
db9b977e4f [webui] support ja (#6698) engchina 2025-01-20 19:46:38 +08:00
87d685b59f [model] support yarn (#6693) hoshi-hiyouga 2025-01-18 13:56:09 +08:00
e4046bdd1f [assets] update wechat (#6692) hoshi-hiyouga 2025-01-18 12:35:03 +08:00
5baa3add8c [misc] update mm plugin (#6691) hoshi-hiyouga 2025-01-17 23:04:26 +08:00
332f637592 disable valset by default (#6690) hoshi-hiyouga 2025-01-17 21:09:30 +08:00
31daa6570b [webui] upgrade to gradio 5 (#6688) hoshi-hiyouga 2025-01-17 20:15:42 +08:00
33525a34b6 fix qwen2 moe (#6684) hoshi-hiyouga 2025-01-17 13:46:09 +08:00
3607caa2ad [data] Fix minicpmv/o dpo training (#6657) Zhangchi Feng 2025-01-15 17:30:37 +08:00
0fc2e19279 Update val_size english description (#6653) steveepreston 2025-01-15 11:30:20 +03:30
ef994600db update readme (#6648) hoshi-hiyouga 2025-01-15 11:06:19 +08:00
7638f1070e [optim] clean apollo (#6645) hoshi-hiyouga 2025-01-15 01:42:50 +08:00
c2120432db [optim] add support to APOLLO (#6617) zhuHQ 2025-01-14 10:24:56 -06:00
66184762e8 update readme of MiniCPM-o (#6642) Zhangchi Feng 2025-01-14 21:22:35 +08:00
41a9e231cb lint (#6641) hoshi-hiyouga 2025-01-14 18:40:07 +08:00
1bb06e06df Support InternLM3 Dense 8B Model (#6640) Haian Huang(深度眸) 2025-01-14 18:07:27 +08:00
381f7120e6 Fix tokenizer max length (#6632) Xiaosu Zhu 2025-01-14 17:35:54 +08:00
f7857c83e1 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631) Zhangchi Feng 2025-01-14 17:34:58 +08:00
d0da6f40b0 [model] fix mllama any image (#6637) hoshi-hiyouga 2025-01-14 16:47:58 +08:00
28d145a066 pin vllm version to 0.6.5 (#6629) hoshi-hiyouga 2025-01-14 02:44:02 +08:00
ae32c148d1 Support new features of MiniCPM-V (#6626) Zhangchi Feng 2025-01-14 00:26:19 +08:00
2a05941b14 [inference] fix stop token for object detection (#6624) hoshi-hiyouga 2025-01-13 21:34:20 +08:00
11c38b9173 add nf4 qlora support on Ascend NPU (#6601) codingma 2025-01-13 19:43:36 +08:00
73c1c15b62 Fix template name of MiniCPM-V (#6620) Zhangchi Feng 2025-01-13 16:46:48 +08:00