hiyouga
|
ba06eb65ca
|
update readme and examples
Former-commit-id: 27dd9bf201c24f7804811398bc2758966ec78432
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
be716972fe
|
remove extras
Former-commit-id: d67e972f8c3d5273e589c8c85c0a1620f59785c5
|
2024-04-22 00:35:41 +08:00 |
|
hiyouga
|
d16561e7a4
|
fix bug in galore optimizer
Former-commit-id: c05ac23261a5a8ba893c2918a43dc7777307407b
|
2024-04-21 18:53:22 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hoshi-hiyouga
|
de728d0371
|
Update sft.sh
Former-commit-id: 2b4b1562e91bbb02e345e71b7721da9333c0791b
|
2024-04-16 17:25:40 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
38b59664e6
|
update examples
Former-commit-id: c078582a759f6bce6e760cd39a05883f7eb194fe
|
2024-04-02 20:51:21 +08:00 |
|
hiyouga
|
933a084999
|
update examples
Former-commit-id: bf36b16e48d6438de6d0b2f2bfe33f7895699b9d
|
2024-04-02 20:41:49 +08:00 |
|
hiyouga
|
c1510d19c7
|
update readme
Former-commit-id: 9b8e7ccdab167f53fb897e1940562682324e8ff0
|
2024-04-02 20:37:37 +08:00 |
|
hiyouga
|
b12176d818
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
59e6ebf039
|
update trainers
Former-commit-id: d0dd6eefed0b86895ed00a7cafb331e5193db645
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
3c91e86268
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: c35b3c3b1e27171f8a703f88ede1dc8a84c80a56
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
8ed1463236
|
update examples
Former-commit-id: 38592faa258f7331afb95bc5db4b9bf37f08105d
|
2024-03-09 02:30:37 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
hiyouga
|
53ab28533e
|
fix example params
Former-commit-id: 0280748528488d7bee6b9074025255453966124c
|
2024-03-08 20:41:43 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
bf812fbe40
|
add galore examples
Former-commit-id: aabf1b99f39aae535401b2f65f0d629def6e39f5
|
2024-03-07 22:53:45 +08:00 |
|