Commit Graph

159 Commits

Author SHA1 Message Date
hoshi-hiyouga
860549b99b update hparam name 2024-04-26 02:49:39 +08:00
BUAADreamer
2d4ded535f modify some style 2024-04-25 21:58:18 +08:00
BUAADreamer
fc0fa9f048 modify style 2024-04-25 21:27:48 +08:00
BUAADreamer
1dcabafe72 modify style 2024-04-25 21:15:16 +08:00
BUAADreamer
43d7ad5ecc Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory 2024-04-25 21:08:40 +08:00
hiyouga
28571da80a vllm + lora support 2024-04-25 20:24:31 +08:00
BUAADreamer
68cdd9a020 Merge branch 'hiyouga:main' into main 2024-04-25 20:02:50 +08:00
BUAADreamer
c6dd89918f merge data part to the text stream 2024-04-25 19:19:59 +08:00
hiyouga
3a7c1286ce add export_device in webui #3333 2024-04-25 19:02:32 +08:00
BUAADreamer
838eb87a96 merge model part to the text stream 2024-04-25 08:20:41 +08:00
BUAADreamer
7ffee90799 remove conflicts 2024-04-25 00:34:22 +08:00
BUAADreamer
cfb485eddf add llava and instructblip 2024-04-25 00:22:43 +08:00
hiyouga
297fb8ead3 support new special token #3420 2024-04-24 23:39:31 +08:00
hiyouga
07737a3d2d reenable sdpa and fast tok by default 2024-04-24 02:18:44 +08:00
BUAADreamer
4dcb11eab7 add multimodal LLM BLIP-2 and InstructBLIP 2024-04-23 18:45:43 +08:00
hiyouga
db7f3b9784 update readme 2024-04-22 17:09:17 +08:00
hiyouga
fbbe0dba2f fix optimizers 2024-04-21 20:40:54 +08:00
hiyouga
f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
Marco
4fb7e046b3 fix small typo 2024-04-18 20:33:29 +02:00
Marco
620add7b9f Added Mixture of Depths 2024-04-18 20:31:24 +02:00
hiyouga
c00f0771a5 Update parser.py 2024-04-16 18:09:31 +08:00
hoshi-hiyouga
4d660c5ade Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
2024-04-16 17:32:16 +08:00
hoshi-hiyouga
4660703674 Update parser.py 2024-04-16 17:27:25 +08:00
hoshi-hiyouga
5b59ff4212 Update parser.py 2024-04-16 17:27:02 +08:00
hoshi-hiyouga
ec899cccf3 Update finetuning_args.py 2024-04-16 17:26:30 +08:00
hiyouga
e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
Jonery
06c8908d3f Feature BAdam 2024-04-15 23:15:27 +08:00
hiyouga
cce52351b5 update examples 2024-04-15 22:14:34 +08:00
hiyouga
efc345c4b0 fix #3273 2024-04-15 15:32:58 +08:00
hiyouga
232642a621 fix #3238 2024-04-12 14:28:11 +08:00
hiyouga
ce77d98872 fix #3116 2024-04-03 14:47:59 +08:00
hiyouga
92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga
b267aeb53f add moe aux loss control #3085 2024-04-02 14:26:31 +08:00
hiyouga
eb259cc573 support infer 4bit model on GPUs #3023 2024-04-01 17:34:04 +08:00
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga
8d603f8820 fix #2982 2024-03-28 20:22:31 +08:00
hiyouga
3164b4f11b fix bug 2024-03-26 17:30:12 +08:00
hiyouga
511f675402 fix #2961 2024-03-26 17:26:14 +08:00
hiyouga
98a42cbdaa tiny fix 2024-03-25 23:28:52 +08:00
hiyouga
1484f76a95 add arg check 2024-03-25 22:42:58 +08:00
hiyouga
72367307df improve lora+ impl. 2024-03-13 23:32:51 +08:00
齐保元
a0965cd62c [FEATURE]: ADD LORA+ ALGORITHM 2024-03-13 19:43:27 +08:00
hiyouga
96ce76cd27 fix kv cache 2024-03-13 01:21:50 +08:00
hiyouga
19ef482649 support QDoRA 2024-03-12 22:12:42 +08:00
hiyouga
8d8956bad5 fix #2802 2024-03-12 17:08:34 +08:00
hiyouga
07f9b754a7 fix #2782 #2798 2024-03-12 15:53:29 +08:00
hiyouga
be99799413 update parser 2024-03-10 13:35:20 +08:00
hiyouga
8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga
bdb496644c allow non-packing pretraining 2024-03-09 22:21:46 +08:00
hiyouga
393c2de27c update hardware requirements 2024-03-09 03:58:18 +08:00