Commit Graph

146 Commits

Author SHA1 Message Date
BUAADreamer
31bce63a10 add llava and instructblip
Former-commit-id: cfb485eddf
2024-04-25 00:22:43 +08:00
BUAADreamer
175b56bced add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: 4dcb11eab7
2024-04-23 18:45:43 +08:00
hiyouga
d5d6fb3970 update readme
Former-commit-id: db7f3b9784
2024-04-22 17:09:17 +08:00
hiyouga
6a3ee1edd9 fix optimizers
Former-commit-id: fbbe0dba2f
2024-04-21 20:40:54 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45
2024-04-21 18:11:10 +08:00
Marco
b6a87e0a3f fix small typo
Former-commit-id: 4fb7e046b3
2024-04-18 20:33:29 +02:00
Marco
639297a5ef Added Mixture of Depths
Former-commit-id: 620add7b9f
2024-04-18 20:31:24 +02:00
hiyouga
038d524fcd Update parser.py
Former-commit-id: c00f0771a5
2024-04-16 18:09:31 +08:00
hoshi-hiyouga
496396b3bc Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm

Former-commit-id: 4d660c5ade
2024-04-16 17:32:16 +08:00
hoshi-hiyouga
0f6d691aa6 Update parser.py
Former-commit-id: 4660703674
2024-04-16 17:27:25 +08:00
hoshi-hiyouga
d68212d016 Update parser.py
Former-commit-id: 5b59ff4212
2024-04-16 17:27:02 +08:00
hoshi-hiyouga
5f48dd545e Update finetuning_args.py
Former-commit-id: ec899cccf3
2024-04-16 17:26:30 +08:00
hiyouga
2dc3343b1c support cohere commandR #3184
Former-commit-id: e0dbac2845
2024-04-15 23:26:42 +08:00
Jonery
025f329445 Feature BAdam
Former-commit-id: 06c8908d3f
2024-04-15 23:15:27 +08:00
hiyouga
fb385b8c26 update examples
Former-commit-id: cce52351b5
2024-04-15 22:14:34 +08:00
hiyouga
ceccad3419 fix #3273
Former-commit-id: efc345c4b0
2024-04-15 15:32:58 +08:00
hiyouga
c9d3cc181a fix #3238
Former-commit-id: 232642a621
2024-04-12 14:28:11 +08:00
hiyouga
88d9f47a0b fix #3116
Former-commit-id: ce77d98872
2024-04-03 14:47:59 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90b
2024-04-02 20:07:43 +08:00
hiyouga
f4be51f356 add moe aux loss control #3085
Former-commit-id: b267aeb53f
2024-04-02 14:26:31 +08:00
hiyouga
b7468ea0a8 support infer 4bit model on GPUs #3023
Former-commit-id: eb259cc573
2024-04-01 17:34:04 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a
2024-03-31 18:29:50 +08:00
hiyouga
e4f3d583df fix #2982
Former-commit-id: 8d603f8820
2024-03-28 20:22:31 +08:00
hiyouga
c311375b50 fix bug
Former-commit-id: 3164b4f11b
2024-03-26 17:30:12 +08:00
hiyouga
ec94e5e876 fix #2961
Former-commit-id: 511f675402
2024-03-26 17:26:14 +08:00
hiyouga
196a33cca4 tiny fix
Former-commit-id: 98a42cbdaa
2024-03-25 23:28:52 +08:00
hiyouga
b18749fb01 add arg check
Former-commit-id: 1484f76a95
2024-03-25 22:42:58 +08:00
hiyouga
8b8671817f improve lora+ impl.
Former-commit-id: 72367307df
2024-03-13 23:32:51 +08:00
齐保元
24c9277488 [FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c
2024-03-13 19:43:27 +08:00
hiyouga
a74426df0f fix kv cache
Former-commit-id: 96ce76cd27
2024-03-13 01:21:50 +08:00
hiyouga
bbf272f96e support QDoRA
Former-commit-id: 19ef482649
2024-03-12 22:12:42 +08:00
hiyouga
0b7e870b07 fix #2802
Former-commit-id: 8d8956bad5
2024-03-12 17:08:34 +08:00
hiyouga
7124b71676 fix #2782 #2798
Former-commit-id: 07f9b754a7
2024-03-12 15:53:29 +08:00
hiyouga
566bfad930 update parser
Former-commit-id: be99799413
2024-03-10 13:35:20 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde
2024-03-10 00:24:11 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644c
2024-03-09 22:21:46 +08:00
hiyouga
5c00783697 update hardware requirements
Former-commit-id: 393c2de27c
2024-03-09 03:58:18 +08:00
hiyouga
7443ac3116 fix chat engine, update webui
Former-commit-id: 5d956e2a51
2024-03-08 03:01:53 +08:00
hiyouga
2235020cc9 update galore args
Former-commit-id: 0ac6b40a47
2024-03-08 01:17:32 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a
2024-03-08 00:44:51 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
37e40563f1 fix #2735
Former-commit-id: f74f804a71
2024-03-07 16:15:53 +08:00
hiyouga
e887aface7 fix version checking
Former-commit-id: 3016e65657
2024-03-06 14:51:51 +08:00
hiyouga
9561809ce9 improve aqlm optim
Former-commit-id: 259af60d28
2024-03-05 20:49:50 +08:00
hiyouga
d1e6e02461 fix #2649
Former-commit-id: 4e5fae2fac
2024-03-01 13:02:41 +08:00
hiyouga
8e7d50dae4 release v0.5.3
Former-commit-id: fa5ab21ebc
2024-02-29 00:34:19 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
2024-02-28 19:53:28 +08:00
hiyouga
2f738a1db6 fix #2532
Former-commit-id: 3cc10a01a7
2024-02-21 21:55:14 +08:00
hiyouga
0fcb931f18 support lora for llama pro
Former-commit-id: 9aeb404a94
2024-02-21 02:17:22 +08:00