hoshi-hiyouga
|
5817cda37e
|
[misc] fix packing and eval plot (#7623)
|
2025-04-07 18:20:57 +08:00 |
|
hoshi-hiyouga
|
bbf334f823
|
disable valset by default (#6690)
Former-commit-id: 77bbf659053e1b205974eb6df69998fee0305d26
|
2025-01-17 21:09:30 +08:00 |
|
hoshi-hiyouga
|
9ef85f8fc4
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
|
2025-01-15 01:42:50 +08:00 |
|
Yaser Afshar
|
76ebd62ac1
|
Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
|
2024-12-17 12:34:05 +00:00 |
|
hiyouga
|
0d18cca0db
|
add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
5eacd17090
|
add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
fae881b854
|
fix #4944
Former-commit-id: 1bbd49faaef438f49cb5340166cb13faee8fb854
|
2024-07-24 16:42:51 +08:00 |
|
hiyouga
|
00b3fb4d14
|
update train hparams
Former-commit-id: dc4a00dd63769dc02d898c8bad2c158e4e5c0447
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
e4ce59243b
|
fix #4005 #4013
Former-commit-id: eed33862bc733361f3c28b3c95dc0eb4ea00884c
|
2024-06-03 19:12:29 +08:00 |
|
hiyouga
|
13d7b48efe
|
improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
|
2024-05-18 03:44:56 +08:00 |
|
hiyouga
|
dfff5119b4
|
update examples
Former-commit-id: ddec9e1b842d407790637e9b0b181f8b26926db9
|
2024-05-17 01:02:00 +08:00 |
|
hiyouga
|
3318b6e188
|
update examples
Former-commit-id: dae83f419919305cb23bb2b9da1277a1616179c5
|
2024-05-13 20:39:36 +08:00 |
|
hiyouga
|
92cafef325
|
update example docs
Former-commit-id: f02f87c6fbd20adae105c83526baa23dba2042fd
|
2024-05-06 22:51:02 +08:00 |
|
hiyouga
|
289d1f3679
|
update webui and add CLIs
Former-commit-id: 245fe47ece22a4b7822449b126715aaa8ec25aba
|
2024-05-03 02:58:23 +08:00 |
|
hiyouga
|
d8deb0f99e
|
update readme and examples
Former-commit-id: a1f1fac33b2a727b38e8ba52d68a224814d4848b
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
92e24a73cb
|
remove extras
Former-commit-id: ddbd29d77702f7b82051d930e3eac1b47f5c6d35
|
2024-04-22 00:35:41 +08:00 |
|
hiyouga
|
9e45f82be7
|
fix bug in galore optimizer
Former-commit-id: 5c62881c5a59cfcc5a76d365263c8ad8c817ce49
|
2024-04-21 18:53:22 +08:00 |
|
hiyouga
|
ec81d45d27
|
fix mod stuff
Former-commit-id: f58425ab45727f7859583d4b9fda776715e27ff6
|
2024-04-21 18:11:10 +08:00 |
|
hiyouga
|
89c400633a
|
update trainers
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
4a4e4b4354
|
support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
398c261c7c
|
fix aqlm version
Former-commit-id: 10be2f0eccc3963a985afcd24e5b8b8fc638b1c3
|
2024-03-09 00:09:09 +08:00 |
|
hiyouga
|
ccec17f773
|
fix example params
Former-commit-id: 8a45213440ffc960947dd69ecf3b092aa724bef3
|
2024-03-08 20:41:43 +08:00 |
|
hiyouga
|
5b50458acf
|
fix galore
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
cb2bf680c9
|
add galore examples
Former-commit-id: 7230e1177daf4d96a1205565ab9335085cc8f3a7
|
2024-03-07 22:53:45 +08:00 |
|