hoshi-hiyouga
|
860549b99b
|
update hparam name
|
2024-04-26 02:49:39 +08:00 |
|
BUAADreamer
|
2d4ded535f
|
modify some style
|
2024-04-25 21:58:18 +08:00 |
|
BUAADreamer
|
fc0fa9f048
|
modify style
|
2024-04-25 21:27:48 +08:00 |
|
BUAADreamer
|
1dcabafe72
|
modify style
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
43d7ad5ecc
|
Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory
|
2024-04-25 21:08:40 +08:00 |
|
hiyouga
|
28571da80a
|
vllm + lora support
|
2024-04-25 20:24:31 +08:00 |
|
BUAADreamer
|
68cdd9a020
|
Merge branch 'hiyouga:main' into main
|
2024-04-25 20:02:50 +08:00 |
|
BUAADreamer
|
c6dd89918f
|
merge data part to the text stream
|
2024-04-25 19:19:59 +08:00 |
|
hiyouga
|
3a7c1286ce
|
add export_device in webui #3333
|
2024-04-25 19:02:32 +08:00 |
|
BUAADreamer
|
838eb87a96
|
merge model part to the text stream
|
2024-04-25 08:20:41 +08:00 |
|
BUAADreamer
|
7ffee90799
|
remove conflicts
|
2024-04-25 00:34:22 +08:00 |
|
BUAADreamer
|
cfb485eddf
|
add llava and instructblip
|
2024-04-25 00:22:43 +08:00 |
|
hiyouga
|
297fb8ead3
|
support new special token #3420
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
07737a3d2d
|
reenable sdpa and fast tok by default
|
2024-04-24 02:18:44 +08:00 |
|
BUAADreamer
|
4dcb11eab7
|
add multimodal LLM BLIP-2 and InstructBLIP
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
db7f3b9784
|
update readme
|
2024-04-22 17:09:17 +08:00 |
|
hiyouga
|
fbbe0dba2f
|
fix optimizers
|
2024-04-21 20:40:54 +08:00 |
|
hiyouga
|
f58425ab45
|
fix mod stuff
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
4fb7e046b3
|
fix small typo
|
2024-04-18 20:33:29 +02:00 |
|
Marco
|
620add7b9f
|
Added Mixture of Depths
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
c00f0771a5
|
Update parser.py
|
2024-04-16 18:09:31 +08:00 |
|
hoshi-hiyouga
|
4d660c5ade
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
4660703674
|
Update parser.py
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
5b59ff4212
|
Update parser.py
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
ec899cccf3
|
Update finetuning_args.py
|
2024-04-16 17:26:30 +08:00 |
|
hiyouga
|
e0dbac2845
|
support cohere commandR #3184
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
06c8908d3f
|
Feature BAdam
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
cce52351b5
|
update examples
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
efc345c4b0
|
fix #3273
|
2024-04-15 15:32:58 +08:00 |
|
hiyouga
|
232642a621
|
fix #3238
|
2024-04-12 14:28:11 +08:00 |
|
hiyouga
|
ce77d98872
|
fix #3116
|
2024-04-03 14:47:59 +08:00 |
|
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
b267aeb53f
|
add moe aux loss control #3085
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
eb259cc573
|
support infer 4bit model on GPUs #3023
|
2024-04-01 17:34:04 +08:00 |
|
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
8d603f8820
|
fix #2982
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
3164b4f11b
|
fix bug
|
2024-03-26 17:30:12 +08:00 |
|
hiyouga
|
511f675402
|
fix #2961
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
98a42cbdaa
|
tiny fix
|
2024-03-25 23:28:52 +08:00 |
|
hiyouga
|
1484f76a95
|
add arg check
|
2024-03-25 22:42:58 +08:00 |
|
hiyouga
|
72367307df
|
improve lora+ impl.
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
a0965cd62c
|
[FEATURE]: ADD LORA+ ALGORITHM
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
96ce76cd27
|
fix kv cache
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
19ef482649
|
support QDoRA
|
2024-03-12 22:12:42 +08:00 |
|
hiyouga
|
8d8956bad5
|
fix #2802
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
07f9b754a7
|
fix #2782 #2798
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
be99799413
|
update parser
|
2024-03-10 13:35:20 +08:00 |
|
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
bdb496644c
|
allow non-packing pretraining
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
393c2de27c
|
update hardware requirements
|
2024-03-09 03:58:18 +08:00 |
|