0xez
3f50d572ed
Update README.md, fix the release date of the paper
...
Former-commit-id: 675ba41562d812f169c6b2775e57a3f38fc8deee
2024-03-21 22:14:48 +08:00
hiyouga
63c83f3802
add citation
...
Former-commit-id: 5eaa50fa01a7172408840255d18bcc0ab43a01fb
2024-03-21 17:04:10 +08:00
hiyouga
0684e315be
paper release
...
Former-commit-id: 0581bfdbc7d6c764e63f8d54271da7663ca354d9
2024-03-21 13:49:17 +08:00
hiyouga
ada7e20eb4
update readme
...
Former-commit-id: bfe7a9128952bacef93d5478938d3e088bd0480d
2024-03-21 00:48:42 +08:00
hiyouga
7999836fb6
support fsdp + qlora
...
Former-commit-id: 84082251621e1470b3b5406a56d0a967780a1804
2024-03-21 00:36:06 +08:00
hiyouga
8717e98200
fix #2777 #2895
...
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
2024-03-20 17:59:45 +08:00
khazic
13bf8b1f91
Updated README with new information
...
Former-commit-id: 0531dac30d5cbee56b73e06230cd0a62928ee9ca
2024-03-20 14:21:16 +08:00
刘一博
5b8725399e
Updated README with new information
...
Former-commit-id: df9b4fb90a076c18f533da32beb7c42ae5b9ed22
2024-03-20 14:11:28 +08:00
hiyouga
8b8671817f
improve lora+ impl.
...
Former-commit-id: 72367307dfadf936fb989ebe8bc9f0ff229fb933
2024-03-13 23:32:51 +08:00
hiyouga
14ed926a2d
support olmo
...
Former-commit-id: b3247d6a1604f4cbeb0d7c163d0082ce91afb870
2024-03-12 18:30:38 +08:00
hoshi-hiyouga
52f14211e3
Merge pull request #2743 from S3Studio/DockerizeSupport
...
Add dockerize support
Former-commit-id: c901aa63ff4fb6daea7f7da467782e8bf6224d4d
2024-03-12 00:05:49 +08:00
hiyouga
4a4e4b4354
support layerwise galore
...
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
2024-03-10 00:24:11 +08:00
hiyouga
17e50bcbb1
add GaLore results
...
Former-commit-id: 818726e9bcdedfbd330ea7a60e02ee5b03aed459
2024-03-09 04:11:55 +08:00
hiyouga
5c00783697
update hardware requirements
...
Former-commit-id: 393c2de27ce0a2dee793092843ec0afa54f49a6d
2024-03-09 03:58:18 +08:00
hiyouga
398c261c7c
fix aqlm version
...
Former-commit-id: 10be2f0eccc3963a985afcd24e5b8b8fc638b1c3
2024-03-09 00:09:09 +08:00
S3Studio
de41334055
Add dockerize support
...
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.
Former-commit-id: 3d911ae713b901d6680a9f9ac82569cc5878f820
2024-03-08 10:47:28 +08:00
hiyouga
b268215a0e
update readme
...
Former-commit-id: 4a2cc60b9440d245141e9317c35a0ac4c687dbdb
2024-03-08 03:06:21 +08:00
hiyouga
5b50458acf
fix galore
...
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
2024-03-08 00:44:51 +08:00
hiyouga
f373290012
add Yi-9B model
...
Former-commit-id: 57452a4aa1d37a047d659f002c1aaa6246f64178
2024-03-07 23:11:57 +08:00
hiyouga
cb2bf680c9
add galore examples
...
Former-commit-id: 7230e1177daf4d96a1205565ab9335085cc8f3a7
2024-03-07 22:53:45 +08:00
hiyouga
2c010c72b8
support galore
...
Former-commit-id: 28f78621883917425fabe49f5473778111012127
2024-03-07 22:41:36 +08:00
hiyouga
1af71f548c
update readme
...
Former-commit-id: 725f7cd70fce502728f785282f1c0d59f23ff434
2024-03-07 20:34:49 +08:00
hiyouga
583d956bda
tiny fix
...
Former-commit-id: 77211d984385247bf7f5f8edea34e9a080a3dc9f
2024-03-07 20:29:34 +08:00
hiyouga
34533b2f35
support vllm
...
Former-commit-id: d07ad5cc1cdbc13879afd84f653afdfee03a6933
2024-03-07 20:26:31 +08:00
hiyouga
31c618f1f7
tiny fix
...
Former-commit-id: 0048a2021e94d068f7c6054df0b9569ae4912eb1
2024-03-06 17:25:08 +08:00
hiyouga
8b21a60d9c
fix add tokens
...
Former-commit-id: 9658c63cd94d28bba730a19f73397580b9865d6b
2024-03-06 15:04:02 +08:00
hiyouga
e887aface7
fix version checking
...
Former-commit-id: 3016e6565708637c1d760f2cd5a67cbd8a5a6c26
2024-03-06 14:51:51 +08:00
hiyouga
02eac3fd09
update readme
...
Former-commit-id: df9e6bb0636160a93f1d4e9562a2e31a08009be3
2024-03-05 03:20:23 +08:00
hiyouga
b04316d9a8
update readme
...
Former-commit-id: 24a79bd50f972008ca1a862edaff7cdd6c211cef
2024-03-04 19:29:26 +08:00
hiyouga
d966aee105
update readme
...
Former-commit-id: 7c227e07dd88051d43e21f44acfed94af5ccd7f6
2024-03-03 01:41:07 +08:00
hiyouga
9ae1514a75
update readme, add starcoder2, cosmopedia
...
Former-commit-id: 894d183214417b10af64d6add7be082d63e8b1f3
2024-03-03 01:01:46 +08:00
hoshi-hiyouga
dcd0d92978
Update README.md
...
Former-commit-id: 4bf7eb72e089b2afd27ed9268b12e661d638d9db
2024-03-03 00:48:47 +08:00
hoshi-hiyouga
5f9b1ad80c
Update README.md
...
Former-commit-id: 585c884ea98d7a40e7b5f4be1278099be139287a
2024-03-03 00:48:06 +08:00
hiyouga
caa6aa9dc5
add colab demo
...
Former-commit-id: 318315c76d029a4db414f79bb42151cec37fd30c
2024-03-02 19:58:21 +08:00
hiyouga
92464eaf30
add twitter
...
Former-commit-id: bb16502c3343471cffd59e65e15105e2464ca390
2024-02-29 17:45:30 +08:00
hiyouga
8e7d50dae4
release v0.5.3
...
Former-commit-id: fa5ab21ebc0ab738178c0c57578db3bda995ae06
2024-02-29 00:34:19 +08:00
hiyouga
845e750abd
add examples
...
Former-commit-id: 804c1e7083e56b4a132d4d820ea9d8d50e5499e9
2024-02-28 23:19:25 +08:00
hiyouga
57f85add58
update chatglm3 template
...
Former-commit-id: 38d8b2cef8d70ce8c390de0317559df7f04b4a5d
2024-02-28 21:11:23 +08:00
hiyouga
9846071c67
update readme
...
Former-commit-id: a2dccce06a9810df89ba6dee68929c3e6f487ef0
2024-02-28 20:50:01 +08:00
hiyouga
5abbca70d3
support DoRA, AWQ, AQLM #2512
...
Former-commit-id: cfefacaa37453a15c55866d019887f24e886a577
2024-02-28 19:53:28 +08:00
hiyouga
3af5fea981
update readme
...
Former-commit-id: 3ba10545937cd9a1dea9cf65d98dd174a205337d
2024-02-26 17:25:47 +08:00
hiyouga
116de2ce48
update readme
...
Former-commit-id: 261f631a1cc708a5713e40102ad558c5dfa6a379
2024-02-25 16:26:08 +08:00
hiyouga
757564caa1
add papers
...
Former-commit-id: aca948da8fcd682e43cd2064eeb9a368af221833
2024-02-25 15:34:47 +08:00
hiyouga
f3feddd502
add papers
...
Former-commit-id: ad76482cf9d4407787fb7fde526c584948a2c0e9
2024-02-25 15:18:58 +08:00
hiyouga
1845e03921
support gemma
...
Former-commit-id: c99e19641a9b893da0a3277bd41bd1d3996d1913
2024-02-21 23:27:36 +08:00
hiyouga
ec42d2c385
tiny fix
...
Former-commit-id: daa318535097a51bdb8546960a9e4b4681c11dfe
2024-02-21 18:30:29 +08:00
hoshi-hiyouga
48dab3ad37
Update README.md
...
Former-commit-id: 869fd208a81efd8a2e4785549684978fc2e17d64
2024-02-20 16:07:55 +08:00
codemayq
e649ddd99f
1. update the version of pre-built bitsandbytes library
...
2. add pre-built flash-attn library
Former-commit-id: d47e40633a9428175db9319f6778eb7c98df02e0
2024-02-20 11:28:25 +08:00
codemayq
16599f2376
1. update the version of pre-built bitsandbytes library
...
2. add pre-built flash-attn library
Former-commit-id: 95f53a46bd1d0012d90a1b6198d35807de1dc100
2024-02-20 11:26:22 +08:00
hiyouga
96265ec154
support llama pro #2338 , add rslora
...
Former-commit-id: 7924ffc55d98e33bfbfbca303e46c8f476435673
2024-02-15 02:27:36 +08:00