146 Commits

Author SHA1 Message Date
BUAADreamer
31bce63a10 add llava and instructblip
Former-commit-id: cfb485eddff0130422416b50c50e171fccc8103e
2024-04-25 00:22:43 +08:00
BUAADreamer
175b56bced add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: 4dcb11eab7bbeac866043d2a7c748b8d06fbd243
2024-04-23 18:45:43 +08:00
hiyouga
d5d6fb3970 update readme
Former-commit-id: db7f3b9784d21ef5c18a11679ad995bb97d61f7c
2024-04-22 17:09:17 +08:00
hiyouga
6a3ee1edd9 fix optimizers
Former-commit-id: fbbe0dba2f244557fff8ac0076a9f312d0aa01ab
2024-04-21 20:40:54 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45727f7859583d4b9fda776715e27ff6
2024-04-21 18:11:10 +08:00
Marco
b6a87e0a3f fix small typo
Former-commit-id: 4fb7e046b3599aebeeba5b150acb6369f70ecb49
2024-04-18 20:33:29 +02:00
Marco
639297a5ef Added Mixture of Depths
Former-commit-id: 620add7b9f634de1a711f7b87b16050adf735e9b
2024-04-18 20:31:24 +02:00
hiyouga
038d524fcd Update parser.py
Former-commit-id: c00f0771a5ab2a422e0300dcb6d88e8609a3b997
2024-04-16 18:09:31 +08:00
hoshi-hiyouga
496396b3bc Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm

Former-commit-id: 4d660c5ade384df4444fa0543a39edce6220903d
2024-04-16 17:32:16 +08:00
hoshi-hiyouga
0f6d691aa6 Update parser.py
Former-commit-id: 4660703674233949a8ba8c76bdb17dafc9d620d4
2024-04-16 17:27:25 +08:00
hoshi-hiyouga
d68212d016 Update parser.py
Former-commit-id: 5b59ff421204115ced405a2e3d56ac0ee8c5b788
2024-04-16 17:27:02 +08:00
hoshi-hiyouga
5f48dd545e Update finetuning_args.py
Former-commit-id: ec899cccf3b8710510e496a3cd8e4c302bb99a19
2024-04-16 17:26:30 +08:00
hiyouga
2dc3343b1c support cohere commandR #3184
Former-commit-id: e0dbac28450a0e1e0b84e1577ef785fc762c0b46
2024-04-15 23:26:42 +08:00
Jonery
025f329445 Feature BAdam
Former-commit-id: 06c8908d3fe48907ddb585c5fa15677fc5416f94
2024-04-15 23:15:27 +08:00
hiyouga
fb385b8c26 update examples
Former-commit-id: cce52351b54f70904f33902d9c17411134f9f6eb
2024-04-15 22:14:34 +08:00
hiyouga
ceccad3419 fix #3273
Former-commit-id: efc345c4b0095ec959ea23bbe54c344278780cbe
2024-04-15 15:32:58 +08:00
hiyouga
c9d3cc181a fix #3238
Former-commit-id: 232642a6215de9b57a1627f39b4efea127948a01
2024-04-12 14:28:11 +08:00
hiyouga
88d9f47a0b fix #3116
Former-commit-id: ce77d98872fa377fd4bc961701b07982f4b51491
2024-04-03 14:47:59 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90bdd82a72a06559943467b56dde12c71
2024-04-02 20:07:43 +08:00
hiyouga
f4be51f356 add moe aux loss control #3085
Former-commit-id: b267aeb53fc49d2eeb0f3fc5ebe55e643f5db377
2024-04-02 14:26:31 +08:00
hiyouga
b7468ea0a8 support infer 4bit model on GPUs #3023
Former-commit-id: eb259cc5738dfb383e4cc5d32579501c580e11b1
2024-04-01 17:34:04 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
2024-03-31 18:29:50 +08:00
hiyouga
e4f3d583df fix #2982
Former-commit-id: 8d603f8820efd1617557f2bc5d9674143abe7c57
2024-03-28 20:22:31 +08:00
hiyouga
c311375b50 fix bug
Former-commit-id: 3164b4f11b72684c8aa2105037cb36c47b6acfd4
2024-03-26 17:30:12 +08:00
hiyouga
ec94e5e876 fix #2961
Former-commit-id: 511f6754026fbbf48bd481018015338a6a3ad92f
2024-03-26 17:26:14 +08:00
hiyouga
196a33cca4 tiny fix
Former-commit-id: 98a42cbdaa4a90dbe5edda1c412c17e628324f52
2024-03-25 23:28:52 +08:00
hiyouga
b18749fb01 add arg check
Former-commit-id: 1484f76a95bcf40e4c668d52fed68d68c9745a75
2024-03-25 22:42:58 +08:00
hiyouga
8b8671817f improve lora+ impl.
Former-commit-id: 72367307dfadf936fb989ebe8bc9f0ff229fb933
2024-03-13 23:32:51 +08:00
齐保元
24c9277488 [FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c85545aa2364e244295df2963308354
2024-03-13 19:43:27 +08:00
hiyouga
a74426df0f fix kv cache
Former-commit-id: 96ce76cd2753bc91c781ad13aa8f7a972abe815a
2024-03-13 01:21:50 +08:00
hiyouga
bbf272f96e support QDoRA
Former-commit-id: 19ef4826490b79e0c2aee20ad67430aa0e4724a7
2024-03-12 22:12:42 +08:00
hiyouga
0b7e870b07 fix #2802
Former-commit-id: 8d8956bad542c0e1c0f7edbf4ffc22bb0f8788ae
2024-03-12 17:08:34 +08:00
hiyouga
7124b71676 fix #2782 #2798
Former-commit-id: 07f9b754a7418b489e839bd674aa47094583a92d
2024-03-12 15:53:29 +08:00
hiyouga
566bfad930 update parser
Former-commit-id: be99799413e1ba37807a02838bf2d87fd966bf55
2024-03-10 13:35:20 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
2024-03-10 00:24:11 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644ce2c18806fc4fdae1fedcb3e5b5f808
2024-03-09 22:21:46 +08:00
hiyouga
5c00783697 update hardware requirements
Former-commit-id: 393c2de27ce0a2dee793092843ec0afa54f49a6d
2024-03-09 03:58:18 +08:00
hiyouga
7443ac3116 fix chat engine, update webui
Former-commit-id: 5d956e2a5167201aecdfce2794c25d8a2d84e234
2024-03-08 03:01:53 +08:00
hiyouga
2235020cc9 update galore args
Former-commit-id: 0ac6b40a4772b61a3476bb74b976d24c408a2c35
2024-03-08 01:17:32 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
2024-03-08 00:44:51 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f78621883917425fabe49f5473778111012127
2024-03-07 22:41:36 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1cdbc13879afd84f653afdfee03a6933
2024-03-07 20:26:31 +08:00
hiyouga
37e40563f1 fix #2735
Former-commit-id: f74f804a715dfb16bf24a056bc95db6b102f9ed7
2024-03-07 16:15:53 +08:00
hiyouga
e887aface7 fix version checking
Former-commit-id: 3016e6565708637c1d760f2cd5a67cbd8a5a6c26
2024-03-06 14:51:51 +08:00
hiyouga
9561809ce9 improve aqlm optim
Former-commit-id: 259af60d28985b919911587716c24a3ac7f7de64
2024-03-05 20:49:50 +08:00
hiyouga
d1e6e02461 fix #2649
Former-commit-id: 4e5fae2fac85227641bd16159cf296a32e0b18b4
2024-03-01 13:02:41 +08:00
hiyouga
8e7d50dae4 release v0.5.3
Former-commit-id: fa5ab21ebc0ab738178c0c57578db3bda995ae06
2024-02-29 00:34:19 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37453a15c55866d019887f24e886a577
2024-02-28 19:53:28 +08:00
hiyouga
2f738a1db6 fix #2532
Former-commit-id: 3cc10a01a792a92b99b952a45bb21c25097fccf6
2024-02-21 21:55:14 +08:00
hiyouga
0fcb931f18 support lora for llama pro
Former-commit-id: 9aeb404a946795d6c4fa3cb45e3e96ffeec13646
2024-02-21 02:17:22 +08:00