Kdump
01166841cf
[trainer] fix wsd scheduler ( #7304 )
...
* [trainer] Warmup_stable_decay supports setting the number of stable and decay steps according to the warmup_ratio ratio
* Update trainer_utils.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-03-26 15:25:02 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
Ze-Yi LIN
0a43bc1960
[tracking] add swanlab_logdir param ( #7219 )
...
* feat: add swanlab_logdir param
* fix
Former-commit-id: a1e76af3d9cf64a6c016bb2333fc815fd4be73cf
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
002f58ef8e
[model] add QwQ 32b ( #7179 )
...
Former-commit-id: 64a6fb9b5056166265abc5acbddffb64cd8b5256
2025-03-06 11:58:36 +08:00
Ze-Yi LIN
c67d2b9327
[trainer] fix swanlab callback ( #7176 )
...
Former-commit-id: 8ad03258e16309158368384e2a0a707845536133
2025-03-06 00:33:37 +08:00
Ze-Yi LIN
210cdb9557
[webui] display swanlab exp link ( #7089 )
...
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 891c4875039e8e3b7d0de025ee61c4ff003ff0c4
2025-02-27 19:40:54 +08:00
Eric Tang
e55ec42d3c
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 6edd4992d700fec56800a638f1cac0f87990c581
2025-02-14 21:55:41 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
2025-01-15 00:24:56 +08:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
2025-01-07 09:39:10 +00:00
hiyouga
8c57169eb7
fix #6546
...
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
2025-01-07 06:30:44 +00:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
ZeYi Lin
8f786ee938
feat: ui improve
...
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
2024-12-20 11:03:02 +08:00
ZeYi Lin
dd22454fc5
fix: bugs
...
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
2024-12-19 21:08:16 +08:00
ZeYi Lin
53103f55b6
feat: optimize frontend
...
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b
feat: swanlab params
...
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
2024-12-19 18:47:27 +08:00
hiyouga
1a48340680
add swanlab
...
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
2024-12-19 07:12:31 +00:00
hiyouga
e83cb17f97
support rank0 logger
...
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
2024-11-02 18:31:04 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
hiyouga
7ccb86b215
add docstrings, refactor logger
...
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
2024-09-08 00:56:56 +08:00
hiyouga
5eacd17090
add adam_mini to readme
...
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
2024-08-09 20:02:03 +08:00
moontidef
44f7c4dd56
feat: add support for adammini
...
Former-commit-id: 82bc15dc795f95768b81c25eaaabdc613da30cd8
2024-08-07 10:08:22 +08:00
moontidef
b0d32b2041
fix: rename optimzer to optimizer
...
Former-commit-id: 40908a36fae3393715f75156867c11e6373fabad
2024-08-07 10:05:01 +08:00
hiyouga
14bc7b0551
fix up
...
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
2024-07-15 01:04:56 +08:00
hiyouga
835f0578c2
refactor pissa, improve llamaboard
...
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
2024-06-28 01:04:24 +08:00
hiyouga
a225b5a70c
tiny fix about badam
...
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
2024-06-25 01:54:53 +08:00
hoshi-hiyouga
fe6ef6400c
Merge pull request #4352 from Ledzy/main
...
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5bdbfd49acc82ff055bd54889241761a
2024-06-25 01:49:13 +08:00
hiyouga
7735456561
fix templates
...
Former-commit-id: 4cff6a4ad55b24bf57db6be5cf817180c1ea5626
2024-06-19 17:44:05 +08:00
Jonery
c779899f7b
Cleaner integration.
...
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
2024-06-19 12:29:40 +08:00
Jonery
5d59f6562a
Merge remote-tracking branch 'upstream/main'
...
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
2024-06-17 18:44:51 +08:00
hiyouga
ce4a27a5f7
fix tol
...
Former-commit-id: 46093b5786611d99adf1fd3d42926a728fc629f8
2024-06-16 01:38:44 +08:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
2946153cea
add license
...
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
81ed4d8abf
fix #4209
...
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
Former-commit-id: cf9f2d6c42b5a37038c9eededbb767eae6a3f67d
2024-06-13 02:25:50 +08:00
hiyouga
5834651c4a
fix #4198
...
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
2024-06-11 15:38:38 +08:00
hiyouga
d3196318be
fix #4120
...
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
2024-06-07 04:18:05 +08:00
hiyouga
8da149ba40
rename files
...
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
2024-06-07 00:09:06 +08:00