hiyouga
|
485a80d294
|
tiny fix
Former-commit-id: 2289436567a7860d25d9da0afb39e4a3e5e83839
|
2024-06-17 17:47:25 +08:00 |
|
hiyouga
|
32f45c9e91
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
46f441dd37
|
update examples
Former-commit-id: 19681f93db399d695aa8e35f8ec2a9e720875baa
|
2024-06-13 03:15:06 +08:00 |
|
hiyouga
|
937f49ec3d
|
lora modules: all by default
Former-commit-id: 52c4ae87c7f4312704c31ef26b079b2c5b95ea5f
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
35379c7c0e
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
2ac2cde03e
|
tiny fix
Former-commit-id: f9d50501aac1f60a3b445ca3fee9aa60995461ee
|
2024-06-04 00:31:10 +08:00 |
|
hiyouga
|
82d744716a
|
fix #4005 #4013
Former-commit-id: 8608fa268cde5cddf8d0c6c2eb2cb5fa246c1831
|
2024-06-03 19:12:29 +08:00 |
|
hiyouga
|
2bff90719b
|
improve KTO impl., replace datasets
Former-commit-id: e56a57ddcf061de6e4acc8679f7dbf0b68364986
|
2024-05-18 03:44:56 +08:00 |
|
hiyouga
|
92b3697e2c
|
update badam example #3764
Former-commit-id: a3730fd0a96bab869be6d695031182dabaea8137
|
2024-05-17 02:21:10 +08:00 |
|
hiyouga
|
a3320f26cf
|
update examples
Former-commit-id: 3b5f138155d96b346bda18e465cf60ec7d99e19c
|
2024-05-17 01:02:00 +08:00 |
|
hiyouga
|
538c79fd8f
|
fix #3694
Former-commit-id: 3d1b818cb6a77b7603724fbeb756b468aa74e7ea
|
2024-05-16 00:35:28 +08:00 |
|
hiyouga
|
e4972c8fc4
|
update examples
Former-commit-id: 779603055ae9216ff549f5285caac8c0c0a1e9fb
|
2024-05-13 20:39:36 +08:00 |
|
hiyouga
|
50c71dd29f
|
update example docs
Former-commit-id: 102cd42768d9eb2cf1219309a25b41e26149067e
|
2024-05-06 22:51:02 +08:00 |
|
hiyouga
|
5c9da798b5
|
update docs
Former-commit-id: a4a2e94241bea6f96590f6cb8ca8b5cddee1917e
|
2024-05-06 21:47:00 +08:00 |
|
Oscar
|
d0597897bf
|
Fix badam example outdated argument
Former-commit-id: 29aa188cc774cb72367f706f1cd4c07bc5a9f241
|
2024-05-05 23:35:19 +08:00 |
|
hiyouga
|
ce8200ad98
|
update webui and add CLIs
Former-commit-id: 1368dda22ab875914c9dd86ee5146a4f6a4736ad
|
2024-05-03 02:58:23 +08:00 |
|
hiyouga
|
ba06eb65ca
|
update readme and examples
Former-commit-id: 27dd9bf201c24f7804811398bc2758966ec78432
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
be716972fe
|
remove extras
Former-commit-id: d67e972f8c3d5273e589c8c85c0a1620f59785c5
|
2024-04-22 00:35:41 +08:00 |
|
hiyouga
|
d16561e7a4
|
fix bug in galore optimizer
Former-commit-id: c05ac23261a5a8ba893c2918a43dc7777307407b
|
2024-04-21 18:53:22 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hoshi-hiyouga
|
de728d0371
|
Update sft.sh
Former-commit-id: 2b4b1562e91bbb02e345e71b7721da9333c0791b
|
2024-04-16 17:25:40 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
38b59664e6
|
update examples
Former-commit-id: c078582a759f6bce6e760cd39a05883f7eb194fe
|
2024-04-02 20:51:21 +08:00 |
|
hiyouga
|
933a084999
|
update examples
Former-commit-id: bf36b16e48d6438de6d0b2f2bfe33f7895699b9d
|
2024-04-02 20:41:49 +08:00 |
|
hiyouga
|
c1510d19c7
|
update readme
Former-commit-id: 9b8e7ccdab167f53fb897e1940562682324e8ff0
|
2024-04-02 20:37:37 +08:00 |
|
hiyouga
|
b12176d818
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
59e6ebf039
|
update trainers
Former-commit-id: d0dd6eefed0b86895ed00a7cafb331e5193db645
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
3c91e86268
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: c35b3c3b1e27171f8a703f88ede1dc8a84c80a56
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
8ed1463236
|
update examples
Former-commit-id: 38592faa258f7331afb95bc5db4b9bf37f08105d
|
2024-03-09 02:30:37 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
hiyouga
|
53ab28533e
|
fix example params
Former-commit-id: 0280748528488d7bee6b9074025255453966124c
|
2024-03-08 20:41:43 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
bf812fbe40
|
add galore examples
Former-commit-id: aabf1b99f39aae535401b2f65f0d629def6e39f5
|
2024-03-07 22:53:45 +08:00 |
|