hiyouga
2dc3343b1c
support cohere commandR #3184
...
Former-commit-id: e0dbac28450a0e1e0b84e1577ef785fc762c0b46
2024-04-15 23:26:42 +08:00
hiyouga
431e9804ee
release v0.6.2
...
Former-commit-id: 9d4c949461d232a959c14859ae7fef191faab711
2024-04-11 20:08:51 +08:00
hiyouga
527cce1eb5
update readme
...
Former-commit-id: a88fe8c1af5e17c2ac459b225a2e03b2cb013783
2024-04-07 00:48:24 +08:00
hiyouga
a6d347726f
fix requires for windows
...
Former-commit-id: 7f6e4126041f2eaa577426310622a115049fbde7
2024-04-03 21:56:43 +08:00
hiyouga
4b23159f53
update vllm example
...
Former-commit-id: 49a2dfaf90982079239d116fcfac9ca72a8fc2b5
2024-04-02 22:45:20 +08:00
hiyouga
3b16f97c6c
update readme
...
Former-commit-id: 66b0fe4e964ad4d882935910f5e512666c54c2b2
2024-04-02 22:17:48 +08:00
hiyouga
03c538ebb3
add zh readme
...
Former-commit-id: 7765f337c7f0d7b8866beecb990c346700146c67
2024-04-02 20:58:45 +08:00
hiyouga
135c4e3512
update readme
...
Former-commit-id: 11a6c1bad65a86b0f3d9c5e5df84d246d7d368df
2024-04-02 20:37:37 +08:00
hiyouga
291ac11156
update readme
...
Former-commit-id: 949e5fe63811afb05cc687a2117295d494979b69
2024-04-02 20:22:11 +08:00
hiyouga
bf5ffeeae0
simplify readme
...
Former-commit-id: 92dab8a90bdd82a72a06559943467b56dde12c71
2024-04-02 20:07:43 +08:00
hiyouga
8d987b7af7
add qwen1.5 moe
...
Former-commit-id: 54b7d349088a828d32551fde56b3467a82df1b9b
2024-04-01 21:49:40 +08:00
hiyouga
34f1de0574
fix #3077
...
Former-commit-id: aee634cd20e6dfdfbe2fbb47ae57f62b2da2bf9a
2024-04-01 21:35:18 +08:00
hiyouga
ddad9be81d
update readme
...
Former-commit-id: 099db6acc08f9a0da18f642455f29012607a920f
2024-03-31 18:46:34 +08:00
hiyouga
2f878bde11
support ORPO
...
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
2024-03-31 18:29:50 +08:00
hiyouga
1421db282a
update readme
...
Former-commit-id: c1fe6ce782a5167bdf020c8af96a268a236f61db
2024-03-28 22:02:32 +08:00
hiyouga
b265002c19
add project
...
Former-commit-id: 1e43319f9cdf53741b4bf6c61749c409db119388
2024-03-28 20:24:27 +08:00
hiyouga
a9d5b4b68e
update readme
...
Former-commit-id: 6c94305e4746c9a735ff62a6428e295d1a67da52
2024-03-28 18:35:11 +08:00
hiyouga
89c400633a
update trainers
...
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
2024-03-28 18:16:27 +08:00
hiyouga
e90c3769e5
update readme
...
Former-commit-id: 7b3d8188f5e4416f326b1dc98ad941020461f67f
2024-03-25 23:06:13 +08:00
hoshi-hiyouga
94fb50c52a
Merge pull request #2967 from Tsumugii24/main
...
Update README_zh.md
Former-commit-id: f633ac6646306f448bc77c1e261c98f15421df75
2024-03-25 23:02:22 +08:00
Tsumugii24
03c387c543
Update README.md
...
Former-commit-id: 1704599503a4c6921a8e78c2b4b940232ca1ba5d
2024-03-25 22:54:38 +08:00
hiyouga
27151b8c65
release v0.6.0
...
Former-commit-id: 6f2b563f125fe51ee32753e58f902a4911ab757c
2024-03-25 22:38:56 +08:00
hiyouga
58aa576ae5
fix #2941
...
Former-commit-id: a1c8c98c5fecfc0dd0ed1be33ee8dd2ade05b708
2024-03-24 00:28:44 +08:00
0xez
3f50d572ed
Update README.md, fix the release date of the paper
...
Former-commit-id: 675ba41562d812f169c6b2775e57a3f38fc8deee
2024-03-21 22:14:48 +08:00
hiyouga
63c83f3802
add citation
...
Former-commit-id: 5eaa50fa01a7172408840255d18bcc0ab43a01fb
2024-03-21 17:04:10 +08:00
hiyouga
0684e315be
paper release
...
Former-commit-id: 0581bfdbc7d6c764e63f8d54271da7663ca354d9
2024-03-21 13:49:17 +08:00
hiyouga
ada7e20eb4
update readme
...
Former-commit-id: bfe7a9128952bacef93d5478938d3e088bd0480d
2024-03-21 00:48:42 +08:00
hiyouga
7999836fb6
support fsdp + qlora
...
Former-commit-id: 84082251621e1470b3b5406a56d0a967780a1804
2024-03-21 00:36:06 +08:00
hiyouga
8717e98200
fix #2777 #2895
...
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
2024-03-20 17:59:45 +08:00
khazic
13bf8b1f91
Updated README with new information
...
Former-commit-id: 0531dac30d5cbee56b73e06230cd0a62928ee9ca
2024-03-20 14:21:16 +08:00
刘一博
5b8725399e
Updated README with new information
...
Former-commit-id: df9b4fb90a076c18f533da32beb7c42ae5b9ed22
2024-03-20 14:11:28 +08:00
hiyouga
8b8671817f
improve lora+ impl.
...
Former-commit-id: 72367307dfadf936fb989ebe8bc9f0ff229fb933
2024-03-13 23:32:51 +08:00
hiyouga
14ed926a2d
support olmo
...
Former-commit-id: b3247d6a1604f4cbeb0d7c163d0082ce91afb870
2024-03-12 18:30:38 +08:00
hoshi-hiyouga
52f14211e3
Merge pull request #2743 from S3Studio/DockerizeSupport
...
Add dockerize support
Former-commit-id: c901aa63ff4fb6daea7f7da467782e8bf6224d4d
2024-03-12 00:05:49 +08:00
hiyouga
4a4e4b4354
support layerwise galore
...
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
2024-03-10 00:24:11 +08:00
hiyouga
17e50bcbb1
add GaLore results
...
Former-commit-id: 818726e9bcdedfbd330ea7a60e02ee5b03aed459
2024-03-09 04:11:55 +08:00
hiyouga
5c00783697
update hardware requirements
...
Former-commit-id: 393c2de27ce0a2dee793092843ec0afa54f49a6d
2024-03-09 03:58:18 +08:00
hiyouga
398c261c7c
fix aqlm version
...
Former-commit-id: 10be2f0eccc3963a985afcd24e5b8b8fc638b1c3
2024-03-09 00:09:09 +08:00
S3Studio
de41334055
Add dockerize support
...
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.
Former-commit-id: 3d911ae713b901d6680a9f9ac82569cc5878f820
2024-03-08 10:47:28 +08:00
hiyouga
b268215a0e
update readme
...
Former-commit-id: 4a2cc60b9440d245141e9317c35a0ac4c687dbdb
2024-03-08 03:06:21 +08:00
hiyouga
5b50458acf
fix galore
...
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
2024-03-08 00:44:51 +08:00
hiyouga
f373290012
add Yi-9B model
...
Former-commit-id: 57452a4aa1d37a047d659f002c1aaa6246f64178
2024-03-07 23:11:57 +08:00
hiyouga
cb2bf680c9
add galore examples
...
Former-commit-id: 7230e1177daf4d96a1205565ab9335085cc8f3a7
2024-03-07 22:53:45 +08:00
hiyouga
2c010c72b8
support galore
...
Former-commit-id: 28f78621883917425fabe49f5473778111012127
2024-03-07 22:41:36 +08:00
hiyouga
1af71f548c
update readme
...
Former-commit-id: 725f7cd70fce502728f785282f1c0d59f23ff434
2024-03-07 20:34:49 +08:00
hiyouga
583d956bda
tiny fix
...
Former-commit-id: 77211d984385247bf7f5f8edea34e9a080a3dc9f
2024-03-07 20:29:34 +08:00
hiyouga
34533b2f35
support vllm
...
Former-commit-id: d07ad5cc1cdbc13879afd84f653afdfee03a6933
2024-03-07 20:26:31 +08:00
hiyouga
31c618f1f7
tiny fix
...
Former-commit-id: 0048a2021e94d068f7c6054df0b9569ae4912eb1
2024-03-06 17:25:08 +08:00
hiyouga
8b21a60d9c
fix add tokens
...
Former-commit-id: 9658c63cd94d28bba730a19f73397580b9865d6b
2024-03-06 15:04:02 +08:00
hiyouga
e887aface7
fix version checking
...
Former-commit-id: 3016e6565708637c1d760f2cd5a67cbd8a5a6c26
2024-03-06 14:51:51 +08:00