BUAADreamer
|
31bce63a10
|
add llava and instructblip
Former-commit-id: cfb485eddf
|
2024-04-25 00:22:43 +08:00 |
|
BUAADreamer
|
175b56bced
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: 4dcb11eab7
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
d5d6fb3970
|
update readme
Former-commit-id: db7f3b9784
|
2024-04-22 17:09:17 +08:00 |
|
hiyouga
|
6a3ee1edd9
|
fix optimizers
Former-commit-id: fbbe0dba2f
|
2024-04-21 20:40:54 +08:00 |
|
hiyouga
|
ec81d45d27
|
fix mod stuff
Former-commit-id: f58425ab45
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
b6a87e0a3f
|
fix small typo
Former-commit-id: 4fb7e046b3
|
2024-04-18 20:33:29 +02:00 |
|
Marco
|
639297a5ef
|
Added Mixture of Depths
Former-commit-id: 620add7b9f
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
038d524fcd
|
Update parser.py
Former-commit-id: c00f0771a5
|
2024-04-16 18:09:31 +08:00 |
|
hoshi-hiyouga
|
496396b3bc
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 4d660c5ade
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
0f6d691aa6
|
Update parser.py
Former-commit-id: 4660703674
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
d68212d016
|
Update parser.py
Former-commit-id: 5b59ff4212
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
5f48dd545e
|
Update finetuning_args.py
Former-commit-id: ec899cccf3
|
2024-04-16 17:26:30 +08:00 |
|
hiyouga
|
2dc3343b1c
|
support cohere commandR #3184
Former-commit-id: e0dbac2845
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
025f329445
|
Feature BAdam
Former-commit-id: 06c8908d3f
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
fb385b8c26
|
update examples
Former-commit-id: cce52351b5
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
ceccad3419
|
fix #3273
Former-commit-id: efc345c4b0
|
2024-04-15 15:32:58 +08:00 |
|
hiyouga
|
c9d3cc181a
|
fix #3238
Former-commit-id: 232642a621
|
2024-04-12 14:28:11 +08:00 |
|
hiyouga
|
88d9f47a0b
|
fix #3116
Former-commit-id: ce77d98872
|
2024-04-03 14:47:59 +08:00 |
|
hiyouga
|
bf5ffeeae0
|
simplify readme
Former-commit-id: 92dab8a90b
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
f4be51f356
|
add moe aux loss control #3085
Former-commit-id: b267aeb53f
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
b7468ea0a8
|
support infer 4bit model on GPUs #3023
Former-commit-id: eb259cc573
|
2024-04-01 17:34:04 +08:00 |
|
hiyouga
|
2f878bde11
|
support ORPO
Former-commit-id: 17bf8a2c3a
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
e4f3d583df
|
fix #2982
Former-commit-id: 8d603f8820
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
c311375b50
|
fix bug
Former-commit-id: 3164b4f11b
|
2024-03-26 17:30:12 +08:00 |
|
hiyouga
|
ec94e5e876
|
fix #2961
Former-commit-id: 511f675402
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
196a33cca4
|
tiny fix
Former-commit-id: 98a42cbdaa
|
2024-03-25 23:28:52 +08:00 |
|
hiyouga
|
b18749fb01
|
add arg check
Former-commit-id: 1484f76a95
|
2024-03-25 22:42:58 +08:00 |
|
hiyouga
|
8b8671817f
|
improve lora+ impl.
Former-commit-id: 72367307df
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
24c9277488
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
a74426df0f
|
fix kv cache
Former-commit-id: 96ce76cd27
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
bbf272f96e
|
support QDoRA
Former-commit-id: 19ef482649
|
2024-03-12 22:12:42 +08:00 |
|
hiyouga
|
0b7e870b07
|
fix #2802
Former-commit-id: 8d8956bad5
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
7124b71676
|
fix #2782 #2798
Former-commit-id: 07f9b754a7
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
566bfad930
|
update parser
Former-commit-id: be99799413
|
2024-03-10 13:35:20 +08:00 |
|
hiyouga
|
4a4e4b4354
|
support layerwise galore
Former-commit-id: 8664262cde
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
868444e124
|
allow non-packing pretraining
Former-commit-id: bdb496644c
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
5c00783697
|
update hardware requirements
Former-commit-id: 393c2de27c
|
2024-03-09 03:58:18 +08:00 |
|
hiyouga
|
7443ac3116
|
fix chat engine, update webui
Former-commit-id: 5d956e2a51
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
2235020cc9
|
update galore args
Former-commit-id: 0ac6b40a47
|
2024-03-08 01:17:32 +08:00 |
|
hiyouga
|
5b50458acf
|
fix galore
Former-commit-id: 33a4c24a8a
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f7862188
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
34533b2f35
|
support vllm
Former-commit-id: d07ad5cc1c
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
37e40563f1
|
fix #2735
Former-commit-id: f74f804a71
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
e887aface7
|
fix version checking
Former-commit-id: 3016e65657
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
9561809ce9
|
improve aqlm optim
Former-commit-id: 259af60d28
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
d1e6e02461
|
fix #2649
Former-commit-id: 4e5fae2fac
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
8e7d50dae4
|
release v0.5.3
Former-commit-id: fa5ab21ebc
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
5abbca70d3
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
2f738a1db6
|
fix #2532
Former-commit-id: 3cc10a01a7
|
2024-02-21 21:55:14 +08:00 |
|
hiyouga
|
0fcb931f18
|
support lora for llama pro
Former-commit-id: 9aeb404a94
|
2024-02-21 02:17:22 +08:00 |
|