hoshi-hiyouga
|
002f58ef8e
|
[model] add QwQ 32b (#7179)
Former-commit-id: 64a6fb9b5056166265abc5acbddffb64cd8b5256
|
2025-03-06 11:58:36 +08:00 |
|
Ze-Yi LIN
|
c67d2b9327
|
[trainer] fix swanlab callback (#7176)
Former-commit-id: 8ad03258e16309158368384e2a0a707845536133
|
2025-03-06 00:33:37 +08:00 |
|
Ze-Yi LIN
|
210cdb9557
|
[webui] display swanlab exp link (#7089)
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 891c4875039e8e3b7d0de025ee61c4ff003ff0c4
|
2025-02-27 19:40:54 +08:00 |
|
Eric Tang
|
e55ec42d3c
|
[ray] specify ray storage path (#6920)
Former-commit-id: 6edd4992d700fec56800a638f1cac0f87990c581
|
2025-02-14 21:55:41 +08:00 |
|
hoshi-hiyouga
|
9ef85f8fc4
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
763f9b9df0
|
[optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
|
2025-01-15 00:24:56 +08:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
|
2025-01-07 09:39:10 +00:00 |
|
hiyouga
|
8c57169eb7
|
fix #6546
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
|
2025-01-07 06:30:44 +00:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
ZeYi Lin
|
8f786ee938
|
feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
dd22454fc5
|
fix: bugs
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
|
2024-12-19 21:08:16 +08:00 |
|
ZeYi Lin
|
53103f55b6
|
feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
|
2024-12-19 19:04:19 +08:00 |
|
ZeYi Lin
|
cc5cde734b
|
feat: swanlab params
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
|
2024-12-19 18:47:27 +08:00 |
|
hiyouga
|
1a48340680
|
add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
0d8aa6e6ef
|
use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
5eacd17090
|
add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
|
2024-08-09 20:02:03 +08:00 |
|
moontidef
|
44f7c4dd56
|
feat: add support for adammini
Former-commit-id: 82bc15dc795f95768b81c25eaaabdc613da30cd8
|
2024-08-07 10:08:22 +08:00 |
|
moontidef
|
b0d32b2041
|
fix: rename optimzer to optimizer
Former-commit-id: 40908a36fae3393715f75156867c11e6373fabad
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
fe6ef6400c
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5bdbfd49acc82ff055bd54889241761a
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
7735456561
|
fix templates
Former-commit-id: 4cff6a4ad55b24bf57db6be5cf817180c1ea5626
|
2024-06-19 17:44:05 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
|
2024-06-17 18:44:51 +08:00 |
|
hiyouga
|
ce4a27a5f7
|
fix tol
Former-commit-id: 46093b5786611d99adf1fd3d42926a728fc629f8
|
2024-06-16 01:38:44 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
81ed4d8abf
|
fix #4209
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
Former-commit-id: cf9f2d6c42b5a37038c9eededbb767eae6a3f67d
|
2024-06-13 02:25:50 +08:00 |
|
hiyouga
|
5834651c4a
|
fix #4198
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
d3196318be
|
fix #4120
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
|
2024-06-07 00:09:06 +08:00 |
|