hoshi-hiyouga
|
e12c80ace8
|
Merge pull request #6367 from hiyouga/hiyouga/add_model
[model&template] add llama3.3 & support llama3 tool prompt
|
2024-12-18 00:13:28 +08:00 |
|
hiyouga
|
b24ae55ebf
|
support llama3 tool prompt
|
2024-12-17 15:52:37 +00:00 |
|
hoshi-hiyouga
|
2a832e489b
|
Merge pull request #5819 from yafshar/remote_code
Add trust_remote_code Parameter and Set Default to False
|
2024-12-17 21:10:24 +08:00 |
|
Yaser Afshar
|
1c8ad22a5f
|
Add missing key to init_kwargs
|
2024-12-17 12:34:05 +00:00 |
|
Yaser Afshar
|
0943776326
|
Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
|
2024-12-17 12:25:12 +00:00 |
|
hoshi-hiyouga
|
a665ad6178
|
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
[model] support non-reenterent-gc
|
2024-12-17 19:58:36 +08:00 |
|
hiyouga
|
f319da6937
|
support non-reenterent-gc & fix #6358
|
2024-12-17 11:41:59 +00:00 |
|
hoshi-hiyouga
|
6973828307
|
Merge pull request #6363 from hiyouga/hiyouga/control_skip_eos
[infer] support control eos
|
2024-12-17 19:35:40 +08:00 |
|
hiyouga
|
eda76de32b
|
support control eos, fix #6345
|
2024-12-17 10:42:05 +00:00 |
|
hoshi-hiyouga
|
9708a39179
|
Merge pull request #6362 from hiyouga/hiyouga/mllm_packing
[model] generalized packing
|
2024-12-17 18:41:48 +08:00 |
|
hiyouga
|
2d107d3aef
|
generalized packing & fix #6343
|
2024-12-17 10:26:19 +00:00 |
|
hoshi-hiyouga
|
81815f053f
|
Merge pull request #6359 from hiyouga/hiyouga/fix_qwen2vl_infer
[model] fix qwen2vl infern
|
2024-12-17 18:15:23 +08:00 |
|
hiyouga
|
142191e466
|
fix #6348
|
2024-12-17 10:06:46 +00:00 |
|
hoshi-hiyouga
|
e2fbd07096
|
Merge pull request #6334 from hiyouga/hiyouga/add_examples
[assets] update wechat and examples
|
2024-12-15 01:37:01 +08:00 |
|
hiyouga
|
7059055e89
|
update assets
|
2024-12-14 17:36:03 +00:00 |
|
hiyouga
|
2811814fc4
|
fix mrope
|
2024-12-12 15:08:17 +00:00 |
|
hoshi-hiyouga
|
bcb4fb353e
|
Merge pull request #6253 from hiyouga/hiyouga/qwen2vl_mm_proj
[model] support qwen2vl train proj only
|
2024-12-05 20:25:33 +08:00 |
|
hiyouga
|
99c62660c6
|
support qwen2vl train proj only
|
2024-12-05 10:37:42 +00:00 |
|
hoshi-hiyouga
|
561a8e56d9
|
Merge pull request #6251 from hiyouga/hiyouga/vllm_qwen2vl_infer
[infer] support qwen2vl vllm infer
|
2024-12-05 18:26:19 +08:00 |
|
hiyouga
|
207f8b069c
|
support qwen2vl vllm infer
|
2024-12-05 10:17:26 +00:00 |
|
hoshi-hiyouga
|
967a6c12a7
|
Merge pull request #6246 from hiyouga/hiyouga/update_examples
[examples] update examples
|
2024-12-05 16:49:30 +08:00 |
|
hiyouga
|
e5584dc7ba
|
update examples
|
2024-12-05 08:48:25 +00:00 |
|
hoshi-hiyouga
|
c42890bb2b
|
Merge pull request #6242 from hiyouga/hiyouga/fix_script
[script] fix scripts
|
2024-12-05 11:54:46 +08:00 |
|
hiyouga
|
eb3e147d19
|
fix scripts
|
2024-12-05 03:47:32 +00:00 |
|
hoshi-hiyouga
|
cf29846830
|
Merge pull request #6160 from village-way/pr_dataloader
fix:tokenized_path not None and load_from_disk return Dataset Trigger…
|
2024-12-04 22:18:19 +08:00 |
|
hoshi-hiyouga
|
6a5074e466
|
lint
|
2024-12-04 22:08:27 +08:00 |
|
hoshi-hiyouga
|
8328bd8fbb
|
Merge pull request #6238 from hiyouga/hiyouga/vllm_batchinfer
[infer] feat: support batch infer in vllm
|
2024-12-04 21:59:13 +08:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
hoshi-hiyouga
|
dc78355002
|
Merge pull request #6190 from JieShenAI/main
add vllm_infer script
|
2024-12-04 21:19:23 +08:00 |
|
hoshi-hiyouga
|
263cb82bdb
|
Merge pull request #6170 from hykilpikonna/main
[+] Show the hostname in webui title
|
2024-12-04 18:07:29 +08:00 |
|
hoshi-hiyouga
|
187402203b
|
Merge pull request #6233 from hiyouga/hiyouga/vlm_zero3
[data] fix vlm zero3 training
|
2024-12-04 17:51:10 +08:00 |
|
hiyouga
|
dbb9e5b70e
|
fix vlm zero3 training
|
2024-12-04 09:40:39 +00:00 |
|
hoshi-hiyouga
|
7965e9840c
|
Merge pull request #6224 from hiyouga/hiyouga-patch-1
[assets] chore: update wechat
|
2024-12-03 21:25:38 +08:00 |
|
hoshi-hiyouga
|
722a396b69
|
update wechat
|
2024-12-03 20:48:48 +08:00 |
|
JieShen
|
4c61368600
|
add async call api
|
2024-12-01 22:18:05 +08:00 |
|
JieShen
|
961e8c2d2e
|
add vllm_infer script
|
2024-11-29 14:22:20 +08:00 |
|
Azalea
|
6554cdeedb
|
[U] Compute hostname differently
|
2024-11-28 22:23:41 -05:00 |
|
hoshi-hiyouga
|
f2b2a37f08
|
Merge pull request #6175 from hiyouga/hiyouga/add_qwq
[model] add QwQ
|
2024-11-28 17:01:53 +08:00 |
|
hiyouga
|
68a612115a
|
add qwq
|
2024-11-28 08:50:57 +00:00 |
|
Azalea
|
dfb953b1ad
|
[+] Show the hostname
|
2024-11-28 12:25:02 +08:00 |
|
wangdepeng
|
4424d4de8a
|
fix:tokenized_path not None and load_from_disk return Dataset Trigger stuck
|
2024-11-27 16:44:42 +08:00 |
|
hoshi-hiyouga
|
86f41513c0
|
Merge pull request #6156 from hiyouga/hiyouga/add_o1
[data&model] add marco-o1, skywork-o1 and openo1
|
2024-11-27 14:36:01 +08:00 |
|
hiyouga
|
046b6fb118
|
fix dataset
|
2024-11-27 06:27:44 +00:00 |
|
hiyouga
|
ec9ff8caa2
|
add skywork o1
|
2024-11-27 05:51:59 +00:00 |
|
hiyouga
|
b7c7f3066f
|
Merge remote-tracking branch 'origin/main' into hiyouga/add_o1
|
2024-11-27 05:36:41 +00:00 |
|
hoshi-hiyouga
|
14d0d92bf3
|
Merge pull request #6157 from hiyouga/hiyouga/fix_ci
[ci] pin tokenizers version
|
2024-11-27 13:33:04 +08:00 |
|
hiyouga
|
b7d4cf2caf
|
pin tokenizers version
|
2024-11-27 05:24:58 +00:00 |
|
hiyouga
|
17afb7d410
|
add marco-o1 and openo1 dataset
|
2024-11-27 04:20:23 +00:00 |
|
hoshi-hiyouga
|
b26c490ac3
|
Merge pull request #6152 from hiyouga/hiyouga/add_num_proc_in_data_load
[data] add num_proc in load_dataset
|
2024-11-27 00:16:15 +08:00 |
|
hoshi-hiyouga
|
88f087c8b9
|
Merge pull request #6151 from hiyouga/hiyouga/fix_mllama
[model] fix mllama cross mask
|
2024-11-27 00:07:54 +08:00 |
|