24 Commits

Author SHA1 Message Date
hoshi-hiyouga
5817cda37e
[misc] fix packing and eval plot (#7623) 2025-04-07 18:20:57 +08:00
hoshi-hiyouga
bbf334f823 disable valset by default (#6690)
Former-commit-id: 77bbf659053e1b205974eb6df69998fee0305d26
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
Yaser Afshar
76ebd62ac1 Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
2024-12-17 12:34:05 +00:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
hiyouga
5eacd17090 add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
2024-08-09 20:02:03 +08:00
hiyouga
fae881b854 fix #4944
Former-commit-id: 1bbd49faaef438f49cb5340166cb13faee8fb854
2024-07-24 16:42:51 +08:00
hiyouga
00b3fb4d14 update train hparams
Former-commit-id: dc4a00dd63769dc02d898c8bad2c158e4e5c0447
2024-06-06 01:49:20 +08:00
hiyouga
e4ce59243b fix #4005 #4013
Former-commit-id: eed33862bc733361f3c28b3c95dc0eb4ea00884c
2024-06-03 19:12:29 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
hiyouga
dfff5119b4 update examples
Former-commit-id: ddec9e1b842d407790637e9b0b181f8b26926db9
2024-05-17 01:02:00 +08:00
hiyouga
3318b6e188 update examples
Former-commit-id: dae83f419919305cb23bb2b9da1277a1616179c5
2024-05-13 20:39:36 +08:00
hiyouga
92cafef325 update example docs
Former-commit-id: f02f87c6fbd20adae105c83526baa23dba2042fd
2024-05-06 22:51:02 +08:00
hiyouga
289d1f3679 update webui and add CLIs
Former-commit-id: 245fe47ece22a4b7822449b126715aaa8ec25aba
2024-05-03 02:58:23 +08:00
hiyouga
d8deb0f99e update readme and examples
Former-commit-id: a1f1fac33b2a727b38e8ba52d68a224814d4848b
2024-04-22 00:37:32 +08:00
hiyouga
92e24a73cb remove extras
Former-commit-id: ddbd29d77702f7b82051d930e3eac1b47f5c6d35
2024-04-22 00:35:41 +08:00
hiyouga
9e45f82be7 fix bug in galore optimizer
Former-commit-id: 5c62881c5a59cfcc5a76d365263c8ad8c817ce49
2024-04-21 18:53:22 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45727f7859583d4b9fda776715e27ff6
2024-04-21 18:11:10 +08:00
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
2024-03-28 18:16:27 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
2024-03-10 00:24:11 +08:00
hiyouga
398c261c7c fix aqlm version
Former-commit-id: 10be2f0eccc3963a985afcd24e5b8b8fc638b1c3
2024-03-09 00:09:09 +08:00
hiyouga
ccec17f773 fix example params
Former-commit-id: 8a45213440ffc960947dd69ecf3b092aa724bef3
2024-03-08 20:41:43 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
2024-03-08 00:44:51 +08:00
hiyouga
cb2bf680c9 add galore examples
Former-commit-id: 7230e1177daf4d96a1205565ab9335085cc8f3a7
2024-03-07 22:53:45 +08:00