hoshi-hiyouga
|
64a6fb9b50
|
[model] add QwQ 32b (#7179)
|
2025-03-06 11:58:36 +08:00 |
|
Ze-Yi LIN
|
8ad03258e1
|
[trainer] fix swanlab callback (#7176)
|
2025-03-06 00:33:37 +08:00 |
|
hoshi-hiyouga
|
b4b89b4ff3
|
[trainer] update config (#7174)
|
2025-03-05 23:32:54 +08:00 |
|
Ze-Yi LIN
|
891c487503
|
[webui] display swanlab exp link (#7089)
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-02-27 19:40:54 +08:00 |
|
Eric Tang
|
6edd4992d7
|
[ray] specify ray storage path (#6920)
|
2025-02-14 21:55:41 +08:00 |
|
Billy Cao
|
11eac71c13
|
[trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-02-13 02:35:06 +08:00 |
|
marko1616
|
b7fd1e9c00
|
[trainer] fix llama3.2 vision kto train (#6904)
|
2025-02-12 19:09:14 +08:00 |
|
hoshi-hiyouga
|
e1a7c1242c
|
[data] fix ollama template (#6902)
* fix ollama template
* add meta info
* use half precision
|
2025-02-11 22:43:09 +08:00 |
|
hoshi-hiyouga
|
9184a6e0ed
|
[misc] support export ollama modelfile (#6899)
* support export ollama modelfile
* update config
* add system and num ctx
|
2025-02-11 19:52:25 +08:00 |
|
hoshi-hiyouga
|
5f38bcaba9
|
[deps] upgrade vllm (#6857)
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
e2dc5b952a
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
15357cdad9
|
[breaking] support transformers 4.48 (#6628)
|
2025-01-31 01:36:33 +08:00 |
|
yinpu
|
0f45982bac
|
fix: avoid redundant normalization in DPO's SFT loss calculation (#6722)
|
2025-01-21 13:38:02 +08:00 |
|
hoshi-hiyouga
|
7a04021d04
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
d9189f9f0b
|
[optim] add support to APOLLO (#6617)
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
e3e2c8c689
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
|
2025-01-13 21:34:20 +08:00 |
|
hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
c46675d5e5
|
fix llamaboard with ray
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
d8cac6f546
|
refactor ray integration, support save ckpt
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
1e8e7be0a5
|
run style check
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
163ddb680b
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
870f23d7ea
|
fix #6546
|
2025-01-07 06:30:44 +00:00 |
|
hiyouga
|
1800f8c72d
|
fix #6499
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
2719867982
|
fix #6448
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
5111cac6f8
|
support report custom args
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
947e22a4a3
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
3a7ea2048a
|
fix: by hiyouga suggestion
|
2024-12-20 16:43:03 +08:00 |
|
ZeYi Lin
|
5f6dafd70e
|
feat: ui improve
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
d0eb64d5e3
|
fix: bugs
|
2024-12-19 21:08:16 +08:00 |
|
hiyouga
|
d4c1fda1ad
|
fix #6391
|
2024-12-19 12:16:38 +00:00 |
|
ZeYi Lin
|
8c2df41b93
|
feat: optimize frontend
|
2024-12-19 19:04:19 +08:00 |
|
ZeYi Lin
|
d5cf87990e
|
feat: swanlab params
|
2024-12-19 18:47:27 +08:00 |
|
hiyouga
|
c7cedc7569
|
support disable shuffling
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
96f8f103e5
|
add swanlab
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
eda76de32b
|
support control eos, fix #6345
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
142191e466
|
fix #6348
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
2811814fc4
|
fix mrope
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
hoshi-hiyouga
|
bd639a137e
|
Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation
support effective tokens calculation on sft/dpo
|
2024-11-20 13:43:15 +08:00 |
|
Ting
|
40627c601e
|
code refactor
|
2024-11-19 20:33:18 +08:00 |
|
Ting
|
f566ecc8d1
|
update
|
2024-11-19 19:12:10 +08:00 |
|
Ting
|
ef6e14550d
|
update
|
2024-11-19 19:10:07 +08:00 |
|
Ting
|
b9f00286d8
|
support efficient tokens calculation on sft/dpo
|
2024-11-19 17:15:47 +08:00 |
|
hoshi-hiyouga
|
dc82821872
|
fix #6050
|
2024-11-16 16:11:16 +08:00 |
|
hiyouga
|
4270f7dfb9
|
fix dpo metrics
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
93d3b8f43f
|
update tests
|
2024-11-02 12:41:44 +08:00 |
|
hiyouga
|
30567a1487
|
fix incorrect loss value for vlms
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
23dbe9a099
|
fix #5749
|
2024-10-29 13:02:13 +00:00 |
|