Commit Graph

23 Commits

Author SHA1 Message Date
hoshi-hiyouga
77bbf65905 disable valset by default (#6690) 2025-01-17 21:09:30 +08:00
hoshi-hiyouga
7a04021d04 [optim] clean apollo (#6645)
* clean apollo code

* update readme
2025-01-15 01:42:50 +08:00
Yaser Afshar
1c8ad22a5f Add missing key to init_kwargs 2024-12-17 12:34:05 +00:00
hiyouga
58ab4579dc add vllm config 2024-11-10 21:28:18 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hiyouga
1bbd49faae fix #4944 2024-07-24 16:42:51 +08:00
hiyouga
dc4a00dd63 update train hparams 2024-06-06 01:49:20 +08:00
hiyouga
eed33862bc fix #4005 #4013 2024-06-03 19:12:29 +08:00
hiyouga
c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
hiyouga
ddec9e1b84 update examples 2024-05-17 01:02:00 +08:00
hiyouga
dae83f4199 update examples 2024-05-13 20:39:36 +08:00
hiyouga
f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
hiyouga
245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga
a1f1fac33b update readme and examples 2024-04-22 00:37:32 +08:00
hiyouga
ddbd29d777 remove extras 2024-04-22 00:35:41 +08:00
hiyouga
5c62881c5a fix bug in galore optimizer 2024-04-21 18:53:22 +08:00
hiyouga
f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
hiyouga
8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hiyouga
8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga
10be2f0ecc fix aqlm version 2024-03-09 00:09:09 +08:00
hiyouga
8a45213440 fix example params 2024-03-08 20:41:43 +08:00
hiyouga
33a4c24a8a fix galore 2024-03-08 00:44:51 +08:00
hiyouga
7230e1177d add galore examples 2024-03-07 22:53:45 +08:00