hoshi-hiyouga
|
c3c0efbaa0
|
[misc] fix packing and eval plot (#7623)
|
2025-04-07 18:20:57 +08:00 |
|
hoshi-hiyouga
|
332f637592
|
disable valset by default (#6690)
Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533
|
2025-01-17 21:09:30 +08:00 |
|
hoshi-hiyouga
|
7638f1070e
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
|
2025-01-15 01:42:50 +08:00 |
|
Yaser Afshar
|
6f1c8dacea
|
Add missing key to init_kwargs
Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca
|
2024-12-17 12:34:05 +00:00 |
|
hiyouga
|
1e6f96508a
|
add vllm config
Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
48f0819327
|
fix #4944
Former-commit-id: 9e8cf3b21a0b12d1413c3c7f3d60399784909242
|
2024-07-24 16:42:51 +08:00 |
|
hiyouga
|
35379c7c0e
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
82d744716a
|
fix #4005 #4013
Former-commit-id: 8608fa268cde5cddf8d0c6c2eb2cb5fa246c1831
|
2024-06-03 19:12:29 +08:00 |
|
hiyouga
|
2bff90719b
|
improve KTO impl., replace datasets
Former-commit-id: e56a57ddcf061de6e4acc8679f7dbf0b68364986
|
2024-05-18 03:44:56 +08:00 |
|
hiyouga
|
a3320f26cf
|
update examples
Former-commit-id: 3b5f138155d96b346bda18e465cf60ec7d99e19c
|
2024-05-17 01:02:00 +08:00 |
|
hiyouga
|
e4972c8fc4
|
update examples
Former-commit-id: 779603055ae9216ff549f5285caac8c0c0a1e9fb
|
2024-05-13 20:39:36 +08:00 |
|
hiyouga
|
50c71dd29f
|
update example docs
Former-commit-id: 102cd42768d9eb2cf1219309a25b41e26149067e
|
2024-05-06 22:51:02 +08:00 |
|
hiyouga
|
ce8200ad98
|
update webui and add CLIs
Former-commit-id: 1368dda22ab875914c9dd86ee5146a4f6a4736ad
|
2024-05-03 02:58:23 +08:00 |
|
hiyouga
|
ba06eb65ca
|
update readme and examples
Former-commit-id: 27dd9bf201c24f7804811398bc2758966ec78432
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
be716972fe
|
remove extras
Former-commit-id: d67e972f8c3d5273e589c8c85c0a1620f59785c5
|
2024-04-22 00:35:41 +08:00 |
|
hiyouga
|
d16561e7a4
|
fix bug in galore optimizer
Former-commit-id: c05ac23261a5a8ba893c2918a43dc7777307407b
|
2024-04-21 18:53:22 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hiyouga
|
59e6ebf039
|
update trainers
Former-commit-id: d0dd6eefed0b86895ed00a7cafb331e5193db645
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
hiyouga
|
53ab28533e
|
fix example params
Former-commit-id: 0280748528488d7bee6b9074025255453966124c
|
2024-03-08 20:41:43 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
bf812fbe40
|
add galore examples
Former-commit-id: aabf1b99f39aae535401b2f65f0d629def6e39f5
|
2024-03-07 22:53:45 +08:00 |
|