Commit Graph

562 Commits

Author SHA1 Message Date
khazic
13bf8b1f91 Updated README with new information
Former-commit-id: 0531dac30d
2024-03-20 14:21:16 +08:00
刘一博
5b8725399e Updated README with new information
Former-commit-id: df9b4fb90a
2024-03-20 14:11:28 +08:00
hiyouga
8b8671817f improve lora+ impl.
Former-commit-id: 72367307df
2024-03-13 23:32:51 +08:00
hiyouga
14ed926a2d support olmo
Former-commit-id: b3247d6a16
2024-03-12 18:30:38 +08:00
hoshi-hiyouga
52f14211e3 Merge pull request #2743 from S3Studio/DockerizeSupport
Add dockerize support

Former-commit-id: c901aa63ff
2024-03-12 00:05:49 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde
2024-03-10 00:24:11 +08:00
hiyouga
17e50bcbb1 add GaLore results
Former-commit-id: 818726e9bc
2024-03-09 04:11:55 +08:00
hiyouga
5c00783697 update hardware requirements
Former-commit-id: 393c2de27c
2024-03-09 03:58:18 +08:00
hiyouga
398c261c7c fix aqlm version
Former-commit-id: 10be2f0ecc
2024-03-09 00:09:09 +08:00
S3Studio
de41334055 Add dockerize support
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.


Former-commit-id: 3d911ae713
2024-03-08 10:47:28 +08:00
hiyouga
b268215a0e update readme
Former-commit-id: 4a2cc60b94
2024-03-08 03:06:21 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a
2024-03-08 00:44:51 +08:00
hiyouga
f373290012 add Yi-9B model
Former-commit-id: 57452a4aa1
2024-03-07 23:11:57 +08:00
hiyouga
cb2bf680c9 add galore examples
Former-commit-id: 7230e1177d
2024-03-07 22:53:45 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
1af71f548c update readme
Former-commit-id: 725f7cd70f
2024-03-07 20:34:49 +08:00
hiyouga
583d956bda tiny fix
Former-commit-id: 77211d9843
2024-03-07 20:29:34 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
31c618f1f7 tiny fix
Former-commit-id: 0048a2021e
2024-03-06 17:25:08 +08:00
hiyouga
8b21a60d9c fix add tokens
Former-commit-id: 9658c63cd9
2024-03-06 15:04:02 +08:00
hiyouga
e887aface7 fix version checking
Former-commit-id: 3016e65657
2024-03-06 14:51:51 +08:00
hiyouga
02eac3fd09 update readme
Former-commit-id: df9e6bb063
2024-03-05 03:20:23 +08:00
hiyouga
b04316d9a8 update readme
Former-commit-id: 24a79bd50f
2024-03-04 19:29:26 +08:00
hiyouga
d966aee105 update readme
Former-commit-id: 7c227e07dd
2024-03-03 01:41:07 +08:00
hiyouga
9ae1514a75 update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214
2024-03-03 01:01:46 +08:00
hoshi-hiyouga
dcd0d92978 Update README.md
Former-commit-id: 4bf7eb72e0
2024-03-03 00:48:47 +08:00
hoshi-hiyouga
5f9b1ad80c Update README.md
Former-commit-id: 585c884ea9
2024-03-03 00:48:06 +08:00
hiyouga
caa6aa9dc5 add colab demo
Former-commit-id: 318315c76d
2024-03-02 19:58:21 +08:00
hiyouga
92464eaf30 add twitter
Former-commit-id: bb16502c33
2024-02-29 17:45:30 +08:00
hiyouga
8e7d50dae4 release v0.5.3
Former-commit-id: fa5ab21ebc
2024-02-29 00:34:19 +08:00
hiyouga
845e750abd add examples
Former-commit-id: 804c1e7083
2024-02-28 23:19:25 +08:00
hiyouga
57f85add58 update chatglm3 template
Former-commit-id: 38d8b2cef8
2024-02-28 21:11:23 +08:00
hiyouga
9846071c67 update readme
Former-commit-id: a2dccce06a
2024-02-28 20:50:01 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
2024-02-28 19:53:28 +08:00
hiyouga
3af5fea981 update readme
Former-commit-id: 3ba1054593
2024-02-26 17:25:47 +08:00
hiyouga
116de2ce48 update readme
Former-commit-id: 261f631a1c
2024-02-25 16:26:08 +08:00
hiyouga
757564caa1 add papers
Former-commit-id: aca948da8f
2024-02-25 15:34:47 +08:00
hiyouga
f3feddd502 add papers
Former-commit-id: ad76482cf9
2024-02-25 15:18:58 +08:00
hiyouga
1845e03921 support gemma
Former-commit-id: c99e19641a
2024-02-21 23:27:36 +08:00
hiyouga
ec42d2c385 tiny fix
Former-commit-id: daa3185350
2024-02-21 18:30:29 +08:00
hoshi-hiyouga
48dab3ad37 Update README.md
Former-commit-id: 869fd208a8
2024-02-20 16:07:55 +08:00
codemayq
e649ddd99f 1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library


Former-commit-id: d47e40633a
2024-02-20 11:28:25 +08:00
codemayq
16599f2376 1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library


Former-commit-id: 95f53a46bd
2024-02-20 11:26:22 +08:00
hiyouga
96265ec154 support llama pro #2338 , add rslora
Former-commit-id: 7924ffc55d
2024-02-15 02:27:36 +08:00
hiyouga
db2051684b improve aligner
Former-commit-id: 7d2dc83c5e
2024-02-10 16:39:19 +08:00
hiyouga
36f092b53f improve fix tokenizer
Former-commit-id: 54ea9684ed
2024-02-09 14:53:14 +08:00
hoshi-hiyouga
186ba72d72 Merge pull request #2423 from mayflower/main
Support for german sft and dpo

Former-commit-id: d0daaa01f9
2024-02-07 15:58:20 +08:00
hiyouga
dcfb9b5cfa support qwen1.5
Former-commit-id: ccabb5b04a
2024-02-06 00:10:51 +08:00
Johann-Peter Hartmann
c264eb4793 Add support for german datasets
Former-commit-id: d9a8301ed4
2024-01-30 10:18:01 +01:00
hiyouga
9f11bdfe8a release v0.5.0 (real)
Former-commit-id: a0d59aa4ec
2024-01-21 01:54:49 +08:00