Ze-Yi LIN
|
210cdb9557
|
[webui] display swanlab exp link (#7089)
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 891c4875039e8e3b7d0de025ee61c4ff003ff0c4
|
2025-02-27 19:40:54 +08:00 |
|
Eric Tang
|
e55ec42d3c
|
[ray] specify ray storage path (#6920)
Former-commit-id: 6edd4992d700fec56800a638f1cac0f87990c581
|
2025-02-14 21:55:41 +08:00 |
|
Billy Cao
|
48173b606c
|
[trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 11eac71c13cd432322b69ae74a3b8fa17af31bc4
|
2025-02-13 02:35:06 +08:00 |
|
marko1616
|
bae934dea3
|
[trainer] fix llama3.2 vision kto train (#6904)
Former-commit-id: b7fd1e9c00c77a4c2a0f2f347767d22bd47213f1
|
2025-02-12 19:09:14 +08:00 |
|
hoshi-hiyouga
|
197aa3baf4
|
[data] fix ollama template (#6902)
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242cd1e0a1ca9ee7d04377a53872488126
|
2025-02-11 22:43:09 +08:00 |
|
hoshi-hiyouga
|
c6be9e242c
|
[misc] support export ollama modelfile (#6899)
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed7ff5f632c848f861bfa448c4cd06fc
|
2025-02-11 19:52:25 +08:00 |
|
hoshi-hiyouga
|
ff6658ad27
|
[deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
1fee69f874
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
yinpu
|
aa7c07caf0
|
fix: avoid redundant normalization in DPO's SFT loss calculation (#6722)
Former-commit-id: 0f45982bac6b65533a94054ea5f792cb0f9e5a1f
|
2025-01-21 13:38:02 +08:00 |
|
hoshi-hiyouga
|
9ef85f8fc4
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
763f9b9df0
|
[optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
d8cba9464f
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
|
2025-01-13 21:34:20 +08:00 |
|
hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
0c1ad5f3fb
|
fix llamaboard with ray
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
bba52e258e
|
run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
1217240918
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
8c57169eb7
|
fix #6546
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
|
2025-01-07 06:30:44 +00:00 |
|
hiyouga
|
da8721a70e
|
fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
813f5919a3
|
fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
3bcb4633ca
|
fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
547f76e56e
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
cc703b58f5
|
fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a41eafc41fdca944e142f5a0f35a5b3
|
2024-12-20 16:43:03 +08:00 |
|
ZeYi Lin
|
8f786ee938
|
feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
dd22454fc5
|
fix: bugs
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
|
2024-12-19 21:08:16 +08:00 |
|
hiyouga
|
8524dcaa4a
|
fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
|
2024-12-19 12:16:38 +00:00 |
|
ZeYi Lin
|
53103f55b6
|
feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
|
2024-12-19 19:04:19 +08:00 |
|
ZeYi Lin
|
cc5cde734b
|
feat: swanlab params
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
|
2024-12-19 18:47:27 +08:00 |
|
hiyouga
|
95d3c2620b
|
support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
1a48340680
|
add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
a94a1eac67
|
support control eos, fix #6345
Former-commit-id: eda76de32bab103c650f246327d214539ae6f291
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
50ca43c3fb
|
fix #6348
Former-commit-id: 142191e4664cb1b920aff2f51d1bac6180f2c24b
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
6f1e450739
|
fix mrope
Former-commit-id: 2811814fc42fb214b3e8be1055f9f57ffd0ffb12
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
235cdcacee
|
support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
|
2024-12-04 13:50:00 +00:00 |
|
hoshi-hiyouga
|
302e4e22bf
|
Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation
support effective tokens calculation on sft/dpo
Former-commit-id: bd639a137e6f46e1a0005cc91572f5f1ec894f74
|
2024-11-20 13:43:15 +08:00 |
|
Ting
|
e27a0c3d53
|
code refactor
Former-commit-id: 40627c601efc9f144a227dded8c6b40babff4e8b
|
2024-11-19 20:33:18 +08:00 |
|
Ting
|
32656bc50d
|
update
Former-commit-id: f566ecc8d1f04615351acbe4f8480b75b2daed42
|
2024-11-19 19:12:10 +08:00 |
|
Ting
|
bf2b8df540
|
update
Former-commit-id: ef6e14550dd76810285cee9c268590d1d9423e54
|
2024-11-19 19:10:07 +08:00 |
|
Ting
|
7ad5b5c088
|
support efficient tokens calculation on sft/dpo
Former-commit-id: b9f00286d8a017ed9fd2876986da3b4d7034ef07
|
2024-11-19 17:15:47 +08:00 |
|
hoshi-hiyouga
|
9815d1712c
|
fix #6050
Former-commit-id: dc828218726704ff0453a2d13535663ac6ad7833
|
2024-11-16 16:11:16 +08:00 |
|
hiyouga
|
c2766af6f4
|
fix dpo metrics
Former-commit-id: 4270f7dfb9a12471c91f6c03dce7ca6fd88566e1
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
3f7c874594
|
update tests
Former-commit-id: 93d3b8f43faf4a81b809d2f7d897e39bdb5475c3
|
2024-11-02 12:41:44 +08:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
13c7e873e0
|
fix #5749
Former-commit-id: 23dbe9a09999fe0f9eb2902a40e33b36db4ca584
|
2024-10-29 13:02:13 +00:00 |
|
hiyouga
|
d183966a5d
|
fix pissa
Former-commit-id: 51e5f962474739bbf396782afdaa68743636fe90
|
2024-10-29 12:18:45 +00:00 |
|
hiyouga
|
825ea1c72d
|
fix #5747
Former-commit-id: ae045c884f8ac2aa0ea27592e0757b7bca2dba13
|
2024-10-29 10:47:04 +00:00 |
|
hiyouga
|
0d8aa6e6ef
|
use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
|
2024-10-29 09:07:46 +00:00 |
|