hoshi-hiyouga
|
9ccfb97a2c
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
7c1640ed5f
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
3bcb4633ca
|
fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
547f76e56e
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
cc703b58f5
|
fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a41eafc41fdca944e142f5a0f35a5b3
|
2024-12-20 16:43:03 +08:00 |
|
hiyouga
|
95d3c2620b
|
support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
6f1e450739
|
fix mrope
Former-commit-id: 2811814fc42fb214b3e8be1055f9f57ffd0ffb12
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
825ea1c72d
|
fix #5747
Former-commit-id: ae045c884f8ac2aa0ea27592e0757b7bca2dba13
|
2024-10-29 10:47:04 +00:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
51a0016873
|
optimize predict vram
Former-commit-id: a244f143f48a01910ce1cd56c0855ef11d62a72a
|
2024-08-30 23:08:45 +08:00 |
|
moontidef
|
b0d32b2041
|
fix: rename optimzer to optimizer
Former-commit-id: 40908a36fae3393715f75156867c11e6373fabad
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
ea2d3f6c18
|
remove rlhf support for chatglm2&3
Former-commit-id: 821bb6660e57c29ebf6ac482e78dd2efb8d72437
|
2024-07-02 23:03:17 +08:00 |
|
hiyouga
|
4828bed837
|
upcast logits
Former-commit-id: c13ae2df19ed4cdc849bef55d04225e1a98c19b5
|
2024-07-02 22:32:05 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
d3b7c489f2
|
add Gemma2 models
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9227e33ad20d2b1641b1c9faae16a5f
|
2024-06-18 12:27:47 +08:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
|
2024-05-16 18:39:08 +08:00 |
|