Commit Graph

142 Commits

Author SHA1 Message Date
hoshi-hiyouga
e2dc5b952a [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
15357cdad9 [breaking] support transformers 4.48 (#6628) 2025-01-31 01:36:33 +08:00
yinpu
0f45982bac fix: avoid redundant normalization in DPO's SFT loss calculation (#6722) 2025-01-21 13:38:02 +08:00
hoshi-hiyouga
7a04021d04 [optim] clean apollo (#6645)
* clean apollo code

* update readme
2025-01-15 01:42:50 +08:00
zhuHQ
d9189f9f0b [optim] add support to APOLLO (#6617) 2025-01-15 00:24:56 +08:00
hoshi-hiyouga
e3e2c8c689 [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples
2025-01-13 21:34:20 +08:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
hiyouga
c46675d5e5 fix llamaboard with ray 2025-01-07 09:59:24 +00:00
hiyouga
d8cac6f546 refactor ray integration, support save ckpt 2025-01-07 09:39:10 +00:00
Eric Tang
1e8e7be0a5 run style check 2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
163ddb680b drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2025-01-07 08:55:44 +00:00
hiyouga
870f23d7ea fix #6546 2025-01-07 06:30:44 +00:00
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hiyouga
2719867982 fix #6448 2024-12-27 16:54:39 +00:00
hiyouga
5111cac6f8 support report custom args 2024-12-21 21:42:45 +00:00
hoshi-hiyouga
947e22a4a3 Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
2024-12-21 14:09:33 +08:00
ZeYi Lin
3a7ea2048a fix: by hiyouga suggestion 2024-12-20 16:43:03 +08:00
ZeYi Lin
5f6dafd70e feat: ui improve 2024-12-20 11:03:02 +08:00
ZeYi Lin
d0eb64d5e3 fix: bugs 2024-12-19 21:08:16 +08:00
hiyouga
d4c1fda1ad fix #6391 2024-12-19 12:16:38 +00:00
ZeYi Lin
8c2df41b93 feat: optimize frontend 2024-12-19 19:04:19 +08:00
ZeYi Lin
d5cf87990e feat: swanlab params 2024-12-19 18:47:27 +08:00
hiyouga
c7cedc7569 support disable shuffling 2024-12-19 08:53:21 +00:00
hiyouga
96f8f103e5 add swanlab 2024-12-19 07:12:31 +00:00
hiyouga
eda76de32b support control eos, fix #6345 2024-12-17 10:42:05 +00:00
hiyouga
142191e466 fix #6348 2024-12-17 10:06:46 +00:00
hiyouga
2811814fc4 fix mrope 2024-12-12 15:08:17 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
hoshi-hiyouga
bd639a137e Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation
support effective tokens calculation on sft/dpo
2024-11-20 13:43:15 +08:00
Ting
40627c601e code refactor 2024-11-19 20:33:18 +08:00
Ting
f566ecc8d1 update 2024-11-19 19:12:10 +08:00
Ting
ef6e14550d update 2024-11-19 19:10:07 +08:00
Ting
b9f00286d8 support efficient tokens calculation on sft/dpo 2024-11-19 17:15:47 +08:00
hoshi-hiyouga
dc82821872 fix #6050 2024-11-16 16:11:16 +08:00
hiyouga
4270f7dfb9 fix dpo metrics 2024-11-02 20:59:01 +08:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
hiyouga
93d3b8f43f update tests 2024-11-02 12:41:44 +08:00
hiyouga
30567a1487 fix incorrect loss value for vlms 2024-10-30 08:56:46 +00:00
hiyouga
23dbe9a099 fix #5749 2024-10-29 13:02:13 +00:00
hiyouga
51e5f96247 fix pissa 2024-10-29 12:18:45 +00:00
hiyouga
ae045c884f fix #5747 2024-10-29 10:47:04 +00:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
hoshi-hiyouga
74a79cc059 fix test 2024-10-22 12:35:36 +08:00
hiyouga
451d271718 tiny fix 2024-10-08 17:48:56 +08:00
Chengcheng Pei
573e3183e6 1, log exceptions in details; 2, check processor is None before calling it. 2024-09-25 12:59:48 -07:00
hiyouga
c7e51ff187 fix #5411 2024-09-11 17:36:42 +08:00
hiyouga
90d6df6222 release v0.9.0 (real) 2024-09-09 01:00:25 +08:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hoshi-hiyouga
e9bda48c6d fix #5366 2024-09-05 18:08:09 +08:00