|
+| ---- | ---- |
----
@@ -106,7 +101,7 @@ Choose your path:
## Blogs
-- [Fine-tune Llama3.1-70B for Medical Diagnosis using LLaMA-Factory](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/) (Chinese)
+- [Fine-tune Llama3.1-70B for Medical Diagnosis using LLaMA-Factory](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/?utm_source=LLaMA-Factory) (Chinese)
- [A One-Stop Code-Free Model Reinforcement Learning and Deployment Platform based on LLaMA-Factory and EasyR1](https://aws.amazon.com/cn/blogs/china/building-llm-model-hub-based-on-llamafactory-and-easyr1/) (Chinese)
- [How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod](https://aws.amazon.com/cn/blogs/machine-learning/how-apoidea-group-enhances-visual-information-extraction-from-banking-documents-with-multimodal-models-using-llama-factory-on-amazon-sagemaker-hyperpod/) (English)
- [Easy Dataset × LLaMA Factory: Enabling LLMs to Efficiently Learn Domain Knowledge](https://buaa-act.feishu.cn/wiki/GVzlwYcRFiR8OLkHbL6cQpYin7g) (English)
diff --git a/README_zh.md b/README_zh.md
index f759a168..acf6d6e5 100644
--- a/README_zh.md
+++ b/README_zh.md
@@ -5,7 +5,7 @@
[](https://github.com/hiyouga/LLaMA-Factory/graphs/contributors)
[](https://github.com/hiyouga/LLaMA-Factory/actions/workflows/tests.yml)
[](https://pypi.org/project/llamafactory/)
-[](https://scholar.google.com/scholar?cites=12620864006390196564)
+[](https://scholar.google.com/scholar?cites=12620864006390196564)
[](https://hub.docker.com/r/hiyouga/llamafactory/tags)
[](https://twitter.com/llamafactory_ai)
@@ -25,13 +25,8 @@
### 赞助商 ❤️
-
-
-#### [Warp,面向开发者的智能终端](https://warp.dev/llama-factory)
-
-[适用于 MacOS、Linux 和 Windows](https://warp.dev/llama-factory)
+|
Warp,面向开发者的智能终端适用于 MacOS、Linux 和 Windows |
|
+| ---- | ---- |
----
@@ -108,7 +103,7 @@ https://github.com/user-attachments/assets/43b700c6-a178-41db-b1f8-8190a5d3fcfc
## 官方博客
-- [使用 LLaMA-Factory 微调 Llama3.1-70B 医学诊断模型](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/)(中文)
+- [使用 LLaMA-Factory 微调 Llama3.1-70B 医学诊断模型](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/?utm_source=LLaMA-Factory)(中文)
- [基于 LLaMA-Factory 和 EasyR1 打造一站式无代码大模型强化学习和部署平台 LLM Model Hub](https://aws.amazon.com/cn/blogs/china/building-llm-model-hub-based-on-llamafactory-and-easyr1/)(中文)
- [通过亚马逊 SageMaker HyperPod 上的 LLaMA-Factory 增强多模态模型银行文档的视觉信息提取](https://aws.amazon.com/cn/blogs/machine-learning/how-apoidea-group-enhances-visual-information-extraction-from-banking-documents-with-multimodal-models-using-llama-factory-on-amazon-sagemaker-hyperpod/)(英文)
- [Easy Dataset × LLaMA Factory: 让大模型高效学习领域知识](https://buaa-act.feishu.cn/wiki/KY9xwTGs1iqHrRkjXBwcZP9WnL9)(中文)
diff --git a/assets/serpapi.svg b/assets/serpapi.svg
new file mode 100644
index 00000000..79bdf400
--- /dev/null
+++ b/assets/serpapi.svg
@@ -0,0 +1 @@
+
diff --git a/assets/warp.jpg b/assets/warp.jpg
new file mode 100644
index 00000000..64bc01be
Binary files /dev/null and b/assets/warp.jpg differ
diff --git a/assets/wechat.jpg b/assets/wechat.jpg
index 72ea605a..3a9b4663 100644
Binary files a/assets/wechat.jpg and b/assets/wechat.jpg differ
diff --git a/assets/wechat_npu.jpg b/assets/wechat_npu.jpg
index e13da8ec..9eb6d305 100644
Binary files a/assets/wechat_npu.jpg and b/assets/wechat_npu.jpg differ
diff --git a/src/llamafactory/data/template.py b/src/llamafactory/data/template.py
index 7e9cb3bf..51fc2b02 100644
--- a/src/llamafactory/data/template.py
+++ b/src/llamafactory/data/template.py
@@ -1014,6 +1014,7 @@ register_template(
)
+# copied from glm4 template
register_template(
name="glm4_moe",
format_user=StringFormatter(slots=["<|user|>\n{{content}}<|assistant|>"]),
@@ -1101,7 +1102,7 @@ register_template(
),
format_tools=ToolFormatter(tool_format="default"),
stop_words=["<|end_of_text|>"],
- default_system=("You are Granite, developed by IBM. You are a helpful AI assistant."),
+ default_system="You are Granite, developed by IBM. You are a helpful AI assistant.",
)
diff --git a/src/llamafactory/extras/constants.py b/src/llamafactory/extras/constants.py
index 7e83d93c..6b1a8f91 100644
--- a/src/llamafactory/extras/constants.py
+++ b/src/llamafactory/extras/constants.py
@@ -1714,6 +1714,7 @@ register_model_group(
multimodal=True,
)
+
register_model_group(
models={
"MiniCPM-V-4": {
@@ -1725,6 +1726,7 @@ register_model_group(
multimodal=True,
)
+
register_model_group(
models={
"Ministral-8B-Instruct-2410": {