hoshi-hiyouga
|
264538cb26
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
Ze-Yi LIN
|
18968405d0
|
[tracking] add swanlab_logdir param (#7219)
* feat: add swanlab_logdir param
* fix
Former-commit-id: 9215ad488b6ac6cd57fe8fa4acdacceb63f68ca5
|
2025-03-11 00:53:07 +08:00 |
|
hoshi-hiyouga
|
9f16c50155
|
[model] add QwQ 32b (#7179)
Former-commit-id: 8897e48b8cd55407812453ddd4ff98ac7bdc4e91
|
2025-03-06 11:58:36 +08:00 |
|
Ze-Yi LIN
|
25bb9f5ad9
|
[trainer] fix swanlab callback (#7176)
Former-commit-id: 6d9acf4bd30db24499118aee16bd19cb19ba9e3d
|
2025-03-06 00:33:37 +08:00 |
|
Ze-Yi LIN
|
11672f760d
|
[webui] display swanlab exp link (#7089)
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b
|
2025-02-27 19:40:54 +08:00 |
|
Eric Tang
|
76f9bd1820
|
[ray] specify ray storage path (#6920)
Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49
|
2025-02-14 21:55:41 +08:00 |
|
hoshi-hiyouga
|
7638f1070e
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
c2120432db
|
[optim] add support to APOLLO (#6617)
Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824
|
2025-01-15 00:24:56 +08:00 |
|
hiyouga
|
944a2aec4d
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
hiyouga
|
d8bd46f1bf
|
fix #6546
Former-commit-id: 6fcf2f10faf3b1614896b091591eeef96d717e64
|
2025-01-07 06:30:44 +00:00 |
|
hiyouga
|
a897d46049
|
support report custom args
Former-commit-id: d41254c40a1c5cacf9377096adb27efa9bdb79ea
|
2024-12-21 21:42:45 +00:00 |
|
ZeYi Lin
|
e5d9d8c55d
|
feat: ui improve
Former-commit-id: 6a1effb1741a13ae5238b0e9b429b4cbe3b6534f
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
925e421bde
|
fix: bugs
Former-commit-id: a2297f97f7587c77d55fbce9ffa81dc60d0b04a1
|
2024-12-19 21:08:16 +08:00 |
|
ZeYi Lin
|
44895ebe36
|
feat: optimize frontend
Former-commit-id: 4a78603c141d9bd78bcaf81261b443cf082bf51f
|
2024-12-19 19:04:19 +08:00 |
|
ZeYi Lin
|
44dfbf9dbd
|
feat: swanlab params
Former-commit-id: 761b3bdb03e27826fde2ca86d4e37b53c2bbc777
|
2024-12-19 18:47:27 +08:00 |
|
hiyouga
|
7eeeffdb8a
|
add swanlab
Former-commit-id: c85a77c8a8824a56a67d56b97b4877fcd6edeb3d
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
093eda2ad6
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
248d5daaff
|
use pre-commit
Former-commit-id: 7cfede95df22a9ff236788f04159b6b16b8d04bb
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
7f71276ad8
|
add docstrings, refactor logger
Former-commit-id: c34e489d71f8f539028543ccf8ee92cecedd6276
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
moontidef
|
8f42d7df56
|
feat: add support for adammini
Former-commit-id: a2d5fafb705ff44db1711e972490f0abebc2012b
|
2024-08-07 10:08:22 +08:00 |
|
moontidef
|
33a90b9026
|
fix: rename optimzer to optimizer
Former-commit-id: 186dc1fde822e6a603ac273538741ea3853f243e
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
9fd7a410bb
|
tiny fix about badam
Former-commit-id: 03f49267c7406e36aee35639f86e6e0383897090
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
bfb2ad7c79
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: 0dc75275efa7d7540b472783a52ea6aeaa503c0b
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
3e0fa4a8da
|
fix templates
Former-commit-id: 6f357d59b73309c5955683008632e7f320e7dcb1
|
2024-06-19 17:44:05 +08:00 |
|
Jonery
|
fa3150548e
|
Cleaner integration.
Former-commit-id: 26d4b05d424bd71f570195dd433258caf6465d92
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
95ae30f678
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: 37834a7e79473ccf50ad7f67745b97c274c326d9
|
2024-06-17 18:44:51 +08:00 |
|
hiyouga
|
727943f078
|
fix tol
Former-commit-id: bdb54bcb477126687db789bd89f2df84e424a2a3
|
2024-06-16 01:38:44 +08:00 |
|
hiyouga
|
32f45c9e91
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
103a507b39
|
fix #4209
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
Former-commit-id: 4be013f18ea6a35b5a11db98db5f0670ffb41619
|
2024-06-13 02:25:50 +08:00 |
|
hiyouga
|
820b6e7b32
|
fix #4198
Former-commit-id: 945d2c6cc73542adf9272ebd9aa332ea2c1c7361
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
d0edcde4ea
|
fix #4120
Former-commit-id: 2a44da678a5e360a9c0f9056397ac9e801329321
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
fcb134e144
|
rename files
Former-commit-id: e1a8431770fc36c0c9ee7fed4abbc3d7fdcc5efd
|
2024-06-07 00:09:06 +08:00 |
|