hiyouga
|
a400d896a1
|
update wechat
|
2024-12-30 13:54:22 +00:00 |
|
hoshi-hiyouga
|
91467ed313
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
|
2024-12-30 21:08:25 +08:00 |
|
hoshi-hiyouga
|
40805b0cc0
|
Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype
|
2024-12-30 16:34:32 +08:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hoshi-hiyouga
|
b55890291b
|
Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss
|
2024-12-28 01:02:56 +08:00 |
|
hiyouga
|
2719867982
|
fix #6448
|
2024-12-27 16:54:39 +00:00 |
|
hoshi-hiyouga
|
f68074d87b
|
Merge pull request #6457 from youkaichao/module-run
[misc] enable module run
|
2024-12-26 23:41:37 +08:00 |
|
youkaichao
|
c39d81cd1d
|
Update cli.py
|
2024-12-26 23:22:09 +08:00 |
|
hoshi-hiyouga
|
cd56f88ff2
|
Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq
|
2024-12-25 15:53:19 +08:00 |
|
hiyouga
|
ee0e400f41
|
add qvq #6439
|
2024-12-25 07:52:41 +00:00 |
|
hoshi-hiyouga
|
cbd494ddaf
|
Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
|
2024-12-24 16:13:20 +08:00 |
|
hiyouga
|
83202c9027
|
update wechat
|
2024-12-24 08:12:53 +00:00 |
|
hoshi-hiyouga
|
b9f73fc5ca
|
Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme
|
2024-12-23 22:17:19 +08:00 |
|
hiyouga
|
8fd38d273e
|
update readme
|
2024-12-23 14:08:59 +00:00 |
|
hoshi-hiyouga
|
c23a4d0658
|
Merge pull request #5922 from Tuyohai/main
support granite3 models
|
2024-12-23 16:46:02 +08:00 |
|
hoshi-hiyouga
|
d58746eca2
|
Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger
|
2024-12-22 05:47:55 +08:00 |
|
hiyouga
|
5111cac6f8
|
support report custom args
|
2024-12-21 21:42:45 +00:00 |
|
hiyouga
|
84cd1188ac
|
fix paligemma infer
|
2024-12-21 20:24:32 +00:00 |
|
hoshi-hiyouga
|
a2ad0738a2
|
Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
|
2024-12-22 04:08:26 +08:00 |
|
ZeYi Lin
|
744ef8c268
|
docs: use swanlab
|
2024-12-21 20:59:25 +08:00 |
|
hoshi-hiyouga
|
947e22a4a3
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
82e5d75014
|
fix: project blank
|
2024-12-20 18:26:02 +08:00 |
|
ZeYi Lin
|
3a7ea2048a
|
fix: by hiyouga suggestion
|
2024-12-20 16:43:03 +08:00 |
|
ZeYi Lin
|
5f6dafd70e
|
feat: ui improve
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
0a52962db3
|
fix: text
|
2024-12-19 21:26:02 +08:00 |
|
ZeYi Lin
|
d0eb64d5e3
|
fix: bugs
|
2024-12-19 21:08:16 +08:00 |
|
hoshi-hiyouga
|
c6e3c14a93
|
Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
[generate] fix generate kwargs
|
2024-12-19 20:24:17 +08:00 |
|
ZeYi Lin
|
7eb49e5ffa
|
docs: config framework
|
2024-12-19 20:22:36 +08:00 |
|
ZeYi Lin
|
3306919629
|
fix: string
|
2024-12-19 20:18:59 +08:00 |
|
hiyouga
|
d4c1fda1ad
|
fix #6391
|
2024-12-19 12:16:38 +00:00 |
|
ZeYi Lin
|
8c2df41b93
|
feat: optimize frontend
|
2024-12-19 19:04:19 +08:00 |
|
ZeYi Lin
|
d5cf87990e
|
feat: swanlab params
|
2024-12-19 18:47:27 +08:00 |
|
hoshi-hiyouga
|
ffbb4dbdb0
|
Merge pull request #6388 from hiyouga/hiyouga/shuffle_control
[trainer] support disable shuffling
|
2024-12-19 17:00:12 +08:00 |
|
hiyouga
|
c7cedc7569
|
support disable shuffling
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
96f8f103e5
|
add swanlab
|
2024-12-19 07:12:31 +00:00 |
|
hoshi-hiyouga
|
6ccd64ecd9
|
Merge pull request #6384 from hiyouga/hiyouga/fix_webui
[webui] fix webui args
|
2024-12-19 14:57:52 +08:00 |
|
hiyouga
|
369cca8110
|
fix webui
|
2024-12-19 06:48:03 +00:00 |
|
hoshi-hiyouga
|
933647e680
|
Merge pull request #6379 from hiyouga/hiyouga/add_paligemma2
[model] add paligemma2
|
2024-12-18 17:03:11 +08:00 |
|
hiyouga
|
d3509050dc
|
add paligemma2
|
2024-12-18 08:57:26 +00:00 |
|
hoshi-hiyouga
|
015f213788
|
Merge pull request #6313 from ge-xing/main
support telechat2 model
|
2024-12-18 16:16:17 +08:00 |
|
hoshi-hiyouga
|
af33627502
|
Merge pull request #6369 from hiyouga/hiyouga/template
[template] support qwen2 tool template
|
2024-12-18 04:23:49 +08:00 |
|
hiyouga
|
98795854e3
|
support qwen tool format
|
2024-12-17 20:12:06 +00:00 |
|
hiyouga
|
bcc413cf64
|
change default replace jinja to false
|
2024-12-17 19:27:10 +00:00 |
|
hoshi-hiyouga
|
2fad3792d9
|
Merge pull request #5473 from AlongWY/mistral
Support Mistral format tools
|
2024-12-18 03:23:24 +08:00 |
|
ylfeng
|
115924af47
|
Support Mistral format tools
|
2024-12-17 19:13:26 +00:00 |
|
hoshi-hiyouga
|
8974a0a185
|
Merge pull request #6368 from hiyouga/hiyouga/fix_llama_template
[template] fix llama3 tool template
|
2024-12-18 01:10:48 +08:00 |
|
hiyouga
|
df5655f61c
|
fix llama3 tool template
|
2024-12-17 17:05:10 +00:00 |
|
hoshi-hiyouga
|
e12c80ace8
|
Merge pull request #6367 from hiyouga/hiyouga/add_model
[model&template] add llama3.3 & support llama3 tool prompt
|
2024-12-18 00:13:28 +08:00 |
|
hiyouga
|
b24ae55ebf
|
support llama3 tool prompt
|
2024-12-17 15:52:37 +00:00 |
|
hoshi-hiyouga
|
2a832e489b
|
Merge pull request #5819 from yafshar/remote_code
Add trust_remote_code Parameter and Set Default to False
|
2024-12-17 21:10:24 +08:00 |
|