hoshi-hiyouga
|
0a0cfeb782
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
Shawn Tao
|
8f5f4cc559
|
[trainer] fix key error (#7635)
|
2025-04-08 18:39:50 +08:00 |
|
hoshi-hiyouga
|
5817cda37e
|
[misc] fix packing and eval plot (#7623)
|
2025-04-07 18:20:57 +08:00 |
|
hoshi-hiyouga
|
42e090d38b
|
[trainer] fix vlm loss for transformers 4.49 (#7448)
|
2025-03-24 10:24:05 +08:00 |
|
hoshi-hiyouga
|
9ccfb97a2c
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
7c1640ed5f
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
Billy Cao
|
48173b606c
|
[trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 11eac71c13cd432322b69ae74a3b8fa17af31bc4
|
2025-02-13 02:35:06 +08:00 |
|
hoshi-hiyouga
|
1fee69f874
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
hoshi-hiyouga
|
d8cba9464f
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
|
2025-01-13 21:34:20 +08:00 |
|
hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
da8721a70e
|
fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
3bcb4633ca
|
fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
547f76e56e
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
|
2024-12-21 14:09:33 +08:00 |
|
hiyouga
|
8524dcaa4a
|
fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
|
2024-12-19 12:16:38 +00:00 |
|
hiyouga
|
95d3c2620b
|
support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
1a48340680
|
add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
a94a1eac67
|
support control eos, fix #6345
Former-commit-id: eda76de32bab103c650f246327d214539ae6f291
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
50ca43c3fb
|
fix #6348
Former-commit-id: 142191e4664cb1b920aff2f51d1bac6180f2c24b
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
6f1e450739
|
fix mrope
Former-commit-id: 2811814fc42fb214b3e8be1055f9f57ffd0ffb12
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
235cdcacee
|
support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
|
2024-12-04 13:50:00 +00:00 |
|
Ting
|
e27a0c3d53
|
code refactor
Former-commit-id: 40627c601efc9f144a227dded8c6b40babff4e8b
|
2024-11-19 20:33:18 +08:00 |
|
Ting
|
bf2b8df540
|
update
Former-commit-id: ef6e14550dd76810285cee9c268590d1d9423e54
|
2024-11-19 19:10:07 +08:00 |
|
Ting
|
7ad5b5c088
|
support efficient tokens calculation on sft/dpo
Former-commit-id: b9f00286d8a017ed9fd2876986da3b4d7034ef07
|
2024-11-19 17:15:47 +08:00 |
|
hiyouga
|
c2766af6f4
|
fix dpo metrics
Former-commit-id: 4270f7dfb9a12471c91f6c03dce7ca6fd88566e1
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
d5ea05cfff
|
update get template
Former-commit-id: dabad5570bf4a6b1044c963d8f27717030f373ef
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
22deca0e9e
|
lazy image load
Former-commit-id: 47ea97fb1ba77de2e8a561904aa8fdc27c3f5025
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
982585e375
|
lint
Former-commit-id: 22959bcdd3b124a642e2acaadc050e36d0520f52
|
2024-09-03 00:46:25 +08:00 |
|
hiyouga
|
6e98872622
|
fix #5324
Former-commit-id: a61c8c4890962f3847b19eff31b170cd7f54316c
|
2024-09-02 23:56:21 +08:00 |
|
hoshi-hiyouga
|
5af92971bc
|
fix trainer predict
Former-commit-id: 99fd9637bdc25f41fd1abc8a162f1069cb9060d4
|
2024-09-02 10:15:29 +08:00 |
|
hoshi-hiyouga
|
5c9972a2d5
|
remove .cpu()
Former-commit-id: a6c6750e8af5bc1ece1dfe6111d3e484fd19ee75
|
2024-09-02 10:10:53 +08:00 |
|
hiyouga
|
f31e7e0dfc
|
remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
51a0016873
|
optimize predict vram
Former-commit-id: a244f143f48a01910ce1cd56c0855ef11d62a72a
|
2024-08-30 23:08:45 +08:00 |
|
moontidef
|
b0d32b2041
|
fix: rename optimzer to optimizer
Former-commit-id: 40908a36fae3393715f75156867c11e6373fabad
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
3c7b10b1fa
|
fix metrics #4786
Former-commit-id: beec77a0898a39d94f41c23920415f5b4873a23a
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
84e6715423
|
fix #4820
Former-commit-id: fd8cc490084aba9b5155eaaaf26129efd2871fa3
|
2024-07-15 22:32:07 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
2cf03017a0
|
tiny fix
Former-commit-id: 73280b7dc7f8b3210bb08dfc3cf34760190f585a
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
54e786346e
|
add eval acc
Former-commit-id: 1856a08e87b150fa4bffcb0af703ed84d848e24b
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
e1751f6398
|
fix #4579
Former-commit-id: 677c86594e4ea904fde0a557852daf54636b06ae
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|