hiyouga
c83b74ab9e
add qvq #6439
...
Former-commit-id: ee0e400f41
2024-12-25 07:52:41 +00:00
hoshi-hiyouga
c5780f5eaa
Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
...
[assets] update wechat
Former-commit-id: cbd494ddaf
2024-12-24 16:13:20 +08:00
hiyouga
4cd1d05429
update wechat
...
Former-commit-id: 83202c9027
2024-12-24 08:12:53 +00:00
hoshi-hiyouga
459219a260
Merge pull request #6426 from hiyouga/hiyouga/update_readme
...
[assets] update readme
Former-commit-id: b9f73fc5ca
2024-12-23 22:17:19 +08:00
hiyouga
353259f03f
update readme
...
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228
Merge pull request #5922 from Tuyohai/main
...
support granite3 models
Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00
hoshi-hiyouga
c0418062c0
Merge pull request #6418 from hiyouga/hiyouga/add_report
...
[trainer] add custom args to experimental logger
Former-commit-id: d58746eca2
2024-12-22 05:47:55 +08:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144
fix paligemma infer
...
Former-commit-id: 84cd1188ac
2024-12-21 20:24:32 +00:00
hoshi-hiyouga
9d437a5f4f
Merge pull request #6416 from Zeyi-Lin/main
...
docs: use swanlab
Former-commit-id: a2ad0738a2
2024-12-22 04:08:26 +08:00
ZeYi Lin
1c1d6bea43
docs: use swanlab
...
Former-commit-id: 744ef8c268
2024-12-21 20:59:25 +08:00
hoshi-hiyouga
547f76e56e
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
...
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a3
2024-12-21 14:09:33 +08:00
ZeYi Lin
67d4757c35
fix: project blank
...
Former-commit-id: 82e5d75014
2024-12-20 18:26:02 +08:00
ZeYi Lin
cc703b58f5
fix: by hiyouga suggestion
...
Former-commit-id: 3a7ea2048a
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938
feat: ui improve
...
Former-commit-id: 5f6dafd70e
2024-12-20 11:03:02 +08:00
ZeYi Lin
03dba638e6
fix: text
...
Former-commit-id: 0a52962db3
2024-12-19 21:26:02 +08:00
ZeYi Lin
dd22454fc5
fix: bugs
...
Former-commit-id: d0eb64d5e3
2024-12-19 21:08:16 +08:00
hoshi-hiyouga
904f18b4a2
Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
...
[generate] fix generate kwargs
Former-commit-id: c6e3c14a93
2024-12-19 20:24:17 +08:00
ZeYi Lin
b512a06c3d
docs: config framework
...
Former-commit-id: 7eb49e5ffa
2024-12-19 20:22:36 +08:00
ZeYi Lin
c31933ef9e
fix: string
...
Former-commit-id: 3306919629
2024-12-19 20:18:59 +08:00
hiyouga
8524dcaa4a
fix #6391
...
Former-commit-id: d4c1fda1ad
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6
feat: optimize frontend
...
Former-commit-id: 8c2df41b93
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b
feat: swanlab params
...
Former-commit-id: d5cf87990e
2024-12-19 18:47:27 +08:00
hoshi-hiyouga
af9ef037dd
Merge pull request #6388 from hiyouga/hiyouga/shuffle_control
...
[trainer] support disable shuffling
Former-commit-id: ffbb4dbdb0
2024-12-19 17:00:12 +08:00
hiyouga
95d3c2620b
support disable shuffling
...
Former-commit-id: c7cedc7569
2024-12-19 08:53:21 +00:00
hiyouga
1a48340680
add swanlab
...
Former-commit-id: 96f8f103e5
2024-12-19 07:12:31 +00:00
hoshi-hiyouga
d6ce1045f7
Merge pull request #6384 from hiyouga/hiyouga/fix_webui
...
[webui] fix webui args
Former-commit-id: 6ccd64ecd9
2024-12-19 14:57:52 +08:00
hiyouga
92a0d08e27
fix webui
...
Former-commit-id: 369cca8110
2024-12-19 06:48:03 +00:00
hoshi-hiyouga
910884065e
Merge pull request #6379 from hiyouga/hiyouga/add_paligemma2
...
[model] add paligemma2
Former-commit-id: 933647e680
2024-12-18 17:03:11 +08:00
hiyouga
433d116080
add paligemma2
...
Former-commit-id: d3509050dc
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534
Merge pull request #6313 from ge-xing/main
...
support telechat2 model
Former-commit-id: 015f213788
2024-12-18 16:16:17 +08:00
hoshi-hiyouga
5f0dd86c15
Merge pull request #6369 from hiyouga/hiyouga/template
...
[template] support qwen2 tool template
Former-commit-id: af33627502
2024-12-18 04:23:49 +08:00
hiyouga
a421113466
support qwen tool format
...
Former-commit-id: 98795854e3
2024-12-17 20:12:06 +00:00
hiyouga
acd62fddb8
change default replace jinja to false
...
Former-commit-id: bcc413cf64
2024-12-17 19:27:10 +00:00
hoshi-hiyouga
d8f6569be1
Merge pull request #5473 from AlongWY/mistral
...
Support Mistral format tools
Former-commit-id: 2fad3792d9
2024-12-18 03:23:24 +08:00
ylfeng
857d23b324
Support Mistral format tools
...
Former-commit-id: 115924af47
2024-12-17 19:13:26 +00:00
hoshi-hiyouga
ad00c793ce
Merge pull request #6368 from hiyouga/hiyouga/fix_llama_template
...
[template] fix llama3 tool template
Former-commit-id: 8974a0a185
2024-12-18 01:10:48 +08:00
hiyouga
f6a2bfc0e8
fix llama3 tool template
...
Former-commit-id: df5655f61c
2024-12-17 17:05:10 +00:00
hoshi-hiyouga
1cc24ed206
Merge pull request #6367 from hiyouga/hiyouga/add_model
...
[model&template] add llama3.3 & support llama3 tool prompt
Former-commit-id: e12c80ace8
2024-12-18 00:13:28 +08:00
hiyouga
a935933bed
support llama3 tool prompt
...
Former-commit-id: b24ae55ebf
2024-12-17 15:52:37 +00:00
hoshi-hiyouga
09419dfbab
Merge pull request #5819 from yafshar/remote_code
...
Add trust_remote_code Parameter and Set Default to False
Former-commit-id: 2a832e489b
2024-12-17 21:10:24 +08:00
Yaser Afshar
76ebd62ac1
Add missing key to init_kwargs
...
Former-commit-id: 1c8ad22a5f
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
zhaohu xing
cfb4c42ae4
support telechat2 model
...
Former-commit-id: 04f19ed0f3
2024-12-17 12:15:33 +00:00
hoshi-hiyouga
fc18db6290
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
...
[model] support non-reenterent-gc
Former-commit-id: a665ad6178
2024-12-17 19:58:36 +08:00
hiyouga
64bac4bc7e
support non-reenterent-gc & fix #6358
...
Former-commit-id: f319da6937
2024-12-17 11:41:59 +00:00
hoshi-hiyouga
002c7d2867
Merge pull request #6363 from hiyouga/hiyouga/control_skip_eos
...
[infer] support control eos
Former-commit-id: 6973828307
2024-12-17 19:35:40 +08:00
hiyouga
a94a1eac67
support control eos, fix #6345
...
Former-commit-id: eda76de32b
2024-12-17 10:42:05 +00:00
hoshi-hiyouga
a8a990a9a7
Merge pull request #6362 from hiyouga/hiyouga/mllm_packing
...
[model] generalized packing
Former-commit-id: 9708a39179
2024-12-17 18:41:48 +08:00
hiyouga
bff1b94583
generalized packing & fix #6343
...
Former-commit-id: 2d107d3aef
2024-12-17 10:26:19 +00:00