hoshi-hiyouga
1cc24ed206
Merge pull request #6367 from hiyouga/hiyouga/add_model
...
[model&template] add llama3.3 & support llama3 tool prompt
Former-commit-id: e12c80ace8
2024-12-18 00:13:28 +08:00
hiyouga
a935933bed
support llama3 tool prompt
...
Former-commit-id: b24ae55ebf
2024-12-17 15:52:37 +00:00
hoshi-hiyouga
09419dfbab
Merge pull request #5819 from yafshar/remote_code
...
Add trust_remote_code Parameter and Set Default to False
Former-commit-id: 2a832e489b
2024-12-17 21:10:24 +08:00
Yaser Afshar
76ebd62ac1
Add missing key to init_kwargs
...
Former-commit-id: 1c8ad22a5f
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
zhaohu xing
cfb4c42ae4
support telechat2 model
...
Former-commit-id: 04f19ed0f3
2024-12-17 12:15:33 +00:00
hoshi-hiyouga
fc18db6290
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
...
[model] support non-reenterent-gc
Former-commit-id: a665ad6178
2024-12-17 19:58:36 +08:00
hiyouga
64bac4bc7e
support non-reenterent-gc & fix #6358
...
Former-commit-id: f319da6937
2024-12-17 11:41:59 +00:00
hoshi-hiyouga
002c7d2867
Merge pull request #6363 from hiyouga/hiyouga/control_skip_eos
...
[infer] support control eos
Former-commit-id: 6973828307
2024-12-17 19:35:40 +08:00
hiyouga
a94a1eac67
support control eos, fix #6345
...
Former-commit-id: eda76de32b
2024-12-17 10:42:05 +00:00
hoshi-hiyouga
a8a990a9a7
Merge pull request #6362 from hiyouga/hiyouga/mllm_packing
...
[model] generalized packing
Former-commit-id: 9708a39179
2024-12-17 18:41:48 +08:00
hiyouga
bff1b94583
generalized packing & fix #6343
...
Former-commit-id: 2d107d3aef
2024-12-17 10:26:19 +00:00
hoshi-hiyouga
4caf043cf8
Merge pull request #6359 from hiyouga/hiyouga/fix_qwen2vl_infer
...
[model] fix qwen2vl infern
Former-commit-id: 81815f053f
2024-12-17 18:15:23 +08:00
hiyouga
50ca43c3fb
fix #6348
...
Former-commit-id: 142191e466
2024-12-17 10:06:46 +00:00
hoshi-hiyouga
0f49e9cb07
Merge pull request #6334 from hiyouga/hiyouga/add_examples
...
[assets] update wechat and examples
Former-commit-id: e2fbd07096
2024-12-15 01:37:01 +08:00
hiyouga
ba901bc000
update assets
...
Former-commit-id: 7059055e89
2024-12-14 17:36:03 +00:00
hiyouga
6f1e450739
fix mrope
...
Former-commit-id: 2811814fc4
2024-12-12 15:08:17 +00:00
hoshi-hiyouga
93d1cba06e
Merge pull request #6253 from hiyouga/hiyouga/qwen2vl_mm_proj
...
[model] support qwen2vl train proj only
Former-commit-id: bcb4fb353e
2024-12-05 20:25:33 +08:00
hiyouga
cf8cad8e7e
support qwen2vl train proj only
...
Former-commit-id: 99c62660c6
2024-12-05 10:37:42 +00:00
hoshi-hiyouga
255260cfcb
Merge pull request #6251 from hiyouga/hiyouga/vllm_qwen2vl_infer
...
[infer] support qwen2vl vllm infer
Former-commit-id: 561a8e56d9
2024-12-05 18:26:19 +08:00
hiyouga
88b06a0c7f
support qwen2vl vllm infer
...
Former-commit-id: 207f8b069c
2024-12-05 10:17:26 +00:00
hoshi-hiyouga
7f8c59144e
Merge pull request #6246 from hiyouga/hiyouga/update_examples
...
[examples] update examples
Former-commit-id: 967a6c12a7
2024-12-05 16:49:30 +08:00
hiyouga
90fb5605c1
update examples
...
Former-commit-id: e5584dc7ba
2024-12-05 08:48:25 +00:00
hoshi-hiyouga
9f9ad6435d
Merge pull request #6242 from hiyouga/hiyouga/fix_script
...
[script] fix scripts
Former-commit-id: c42890bb2b
2024-12-05 11:54:46 +08:00
hiyouga
819f487c8f
fix scripts
...
Former-commit-id: eb3e147d19
2024-12-05 03:47:32 +00:00
hoshi-hiyouga
9bbeba6323
Merge pull request #6160 from village-way/pr_dataloader
...
fix:tokenized_path not None and load_from_disk return Dataset Trigger…
Former-commit-id: cf29846830
2024-12-04 22:18:19 +08:00
hoshi-hiyouga
92940817e7
lint
...
Former-commit-id: 6a5074e466
2024-12-04 22:08:27 +08:00
hoshi-hiyouga
68614f6bc1
Merge pull request #6238 from hiyouga/hiyouga/vllm_batchinfer
...
[infer] feat: support batch infer in vllm
Former-commit-id: 8328bd8fbb
2024-12-04 21:59:13 +08:00
hiyouga
235cdcacee
support batch infer in vllm
...
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hoshi-hiyouga
b2c67a989a
Merge pull request #6190 from JieShenAI/main
...
add vllm_infer script
Former-commit-id: dc78355002
2024-12-04 21:19:23 +08:00
hoshi-hiyouga
ed4c4bab49
Merge pull request #6170 from hykilpikonna/main
...
[+] Show the hostname in webui title
Former-commit-id: 263cb82bdb
2024-12-04 18:07:29 +08:00
hoshi-hiyouga
1804e8a491
Merge pull request #6233 from hiyouga/hiyouga/vlm_zero3
...
[data] fix vlm zero3 training
Former-commit-id: 187402203b
2024-12-04 17:51:10 +08:00
hiyouga
0ef1dc4dd5
fix vlm zero3 training
...
Former-commit-id: dbb9e5b70e
2024-12-04 09:40:39 +00:00
hoshi-hiyouga
b34c3bb796
Merge pull request #6224 from hiyouga/hiyouga-patch-1
...
[assets] chore: update wechat
Former-commit-id: 7965e9840c
2024-12-03 21:25:38 +08:00
hoshi-hiyouga
aa5535c622
update wechat
...
Former-commit-id: 722a396b69
2024-12-03 20:48:48 +08:00
JieShen
d4bf81b36a
add async call api
...
Former-commit-id: 4c61368600
2024-12-01 22:18:05 +08:00
JieShen
99265c7d2f
add vllm_infer script
...
Former-commit-id: 961e8c2d2e
2024-11-29 14:22:20 +08:00
Azalea
0efa34c9ef
[U] Compute hostname differently
...
Former-commit-id: 6554cdeedb
2024-11-28 22:23:41 -05:00
hoshi-hiyouga
f4729904f2
Merge pull request #6175 from hiyouga/hiyouga/add_qwq
...
[model] add QwQ
Former-commit-id: f2b2a37f08
2024-11-28 17:01:53 +08:00
hiyouga
1c3d86cd65
add qwq
...
Former-commit-id: 68a612115a
2024-11-28 08:50:57 +00:00
Azalea
f5e6e25a1b
[+] Show the hostname
...
Former-commit-id: dfb953b1ad
2024-11-28 12:25:02 +08:00
wangdepeng
ae09c6c214
fix:tokenized_path not None and load_from_disk return Dataset Trigger stuck
...
Former-commit-id: 4424d4de8a
2024-11-27 16:44:42 +08:00
hoshi-hiyouga
265a5821de
Merge pull request #6156 from hiyouga/hiyouga/add_o1
...
[data&model] add marco-o1, skywork-o1 and openo1
Former-commit-id: 86f41513c0
2024-11-27 14:36:01 +08:00
hiyouga
9822cb7bac
fix dataset
...
Former-commit-id: 046b6fb118
2024-11-27 06:27:44 +00:00
hiyouga
d51d96d594
add skywork o1
...
Former-commit-id: ec9ff8caa2
2024-11-27 05:51:59 +00:00
hiyouga
09a3a59c88
Merge remote-tracking branch 'origin/main' into hiyouga/add_o1
...
Former-commit-id: b7c7f3066f
2024-11-27 05:36:41 +00:00
hoshi-hiyouga
dfa4e927dd
Merge pull request #6157 from hiyouga/hiyouga/fix_ci
...
[ci] pin tokenizers version
Former-commit-id: 14d0d92bf3
2024-11-27 13:33:04 +08:00
hiyouga
61320965aa
pin tokenizers version
...
Former-commit-id: b7d4cf2caf
2024-11-27 05:24:58 +00:00
hiyouga
ab3782b0fa
add marco-o1 and openo1 dataset
...
Former-commit-id: 17afb7d410
2024-11-27 04:20:23 +00:00
hoshi-hiyouga
6cd90efb82
Merge pull request #6152 from hiyouga/hiyouga/add_num_proc_in_data_load
...
[data] add num_proc in load_dataset
Former-commit-id: b26c490ac3
2024-11-27 00:16:15 +08:00