hoshi-hiyouga
|
b0c8ba73e0
|
[deps] update to transformers 4.52 (#8125)
|
2025-05-21 05:16:18 +08:00 |
|
hoshi-hiyouga
|
610f164c69
|
[trainer] fix pt loss (#7748)
* fix pt loss
* robust
* fix
* test
|
2025-04-17 03:15:35 +08:00 |
|
hoshi-hiyouga
|
42e090d38b
|
[trainer] fix vlm loss for transformers 4.49 (#7448)
|
2025-03-24 10:24:05 +08:00 |
|
hoshi-hiyouga
|
9ccfb97a2c
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
7c1640ed5f
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
Billy Cao
|
48173b606c
|
[trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 11eac71c13cd432322b69ae74a3b8fa17af31bc4
|
2025-02-13 02:35:06 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
hoshi-hiyouga
|
d8cba9464f
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
|
2025-01-13 21:34:20 +08:00 |
|
hiyouga
|
da8721a70e
|
fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
3bcb4633ca
|
fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
547f76e56e
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
|
2024-12-21 14:09:33 +08:00 |
|
hiyouga
|
8524dcaa4a
|
fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
|
2024-12-19 12:16:38 +00:00 |
|
hiyouga
|
95d3c2620b
|
support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
1a48340680
|
add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
a94a1eac67
|
support control eos, fix #6345
Former-commit-id: eda76de32bab103c650f246327d214539ae6f291
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
50ca43c3fb
|
fix #6348
Former-commit-id: 142191e4664cb1b920aff2f51d1bac6180f2c24b
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
6f1e450739
|
fix mrope
Former-commit-id: 2811814fc42fb214b3e8be1055f9f57ffd0ffb12
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
235cdcacee
|
support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
c2766af6f4
|
fix dpo metrics
Former-commit-id: 4270f7dfb9a12471c91f6c03dce7ca6fd88566e1
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hoshi-hiyouga
|
5af92971bc
|
fix trainer predict
Former-commit-id: 99fd9637bdc25f41fd1abc8a162f1069cb9060d4
|
2024-09-02 10:15:29 +08:00 |
|
hiyouga
|
51a0016873
|
optimize predict vram
Former-commit-id: a244f143f48a01910ce1cd56c0855ef11d62a72a
|
2024-08-30 23:08:45 +08:00 |
|
moontidef
|
b0d32b2041
|
fix: rename optimzer to optimizer
Former-commit-id: 40908a36fae3393715f75156867c11e6373fabad
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
54e786346e
|
add eval acc
Former-commit-id: 1856a08e87b150fa4bffcb0af703ed84d848e24b
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
e1751f6398
|
fix #4579
Former-commit-id: 677c86594e4ea904fde0a557852daf54636b06ae
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9227e33ad20d2b1641b1c9faae16a5f
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b437277846d4f0b64c13a0bc892ef4f345a21e
|
2024-06-17 18:18:10 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
ab66ae8cd2
|
fix #4295
Former-commit-id: 78589cf90c6e12e612f269b1c771f19f3dad83d2
|
2024-06-15 04:34:55 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
|
2024-05-16 18:39:08 +08:00 |
|