diff --git a/README.md b/README.md index 5f7bb4e1..2f82b21d 100644 --- a/README.md +++ b/README.md @@ -5,7 +5,7 @@ [![GitHub contributors](https://img.shields.io/github/contributors/hiyouga/LLaMA-Factory?color=orange)](https://github.com/hiyouga/LLaMA-Factory/graphs/contributors) [![GitHub workflow](https://github.com/hiyouga/LLaMA-Factory/actions/workflows/tests.yml/badge.svg)](https://github.com/hiyouga/LLaMA-Factory/actions/workflows/tests.yml) [![PyPI](https://img.shields.io/pypi/v/llamafactory)](https://pypi.org/project/llamafactory/) -[![Citation](https://img.shields.io/badge/citation-730-green)](https://scholar.google.com/scholar?cites=12620864006390196564) +[![Citation](https://img.shields.io/badge/citation-760-green)](https://scholar.google.com/scholar?cites=12620864006390196564) [![Docker Pulls](https://img.shields.io/docker/pulls/hiyouga/llamafactory)](https://hub.docker.com/r/hiyouga/llamafactory/tags) [![Twitter](https://img.shields.io/twitter/follow/llamafactory_ai)](https://twitter.com/llamafactory_ai) @@ -25,13 +25,8 @@ ### Supporters ❤️ - - Warp sponsorship - - -#### [Warp, the agentic terminal for developers](https://warp.dev/llama-factory) - -[Available for MacOS, Linux, & Windows](https://warp.dev/llama-factory) +|
Warp sponsorship
Warp, the agentic terminal for developers
Available for MacOS, Linux, & Windows | SerpAPI sponsorship | +| ---- | ---- | ---- @@ -106,7 +101,7 @@ Choose your path: ## Blogs -- [Fine-tune Llama3.1-70B for Medical Diagnosis using LLaMA-Factory](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/) (Chinese) +- [Fine-tune Llama3.1-70B for Medical Diagnosis using LLaMA-Factory](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/?utm_source=LLaMA-Factory) (Chinese) - [A One-Stop Code-Free Model Reinforcement Learning and Deployment Platform based on LLaMA-Factory and EasyR1](https://aws.amazon.com/cn/blogs/china/building-llm-model-hub-based-on-llamafactory-and-easyr1/) (Chinese) - [How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod](https://aws.amazon.com/cn/blogs/machine-learning/how-apoidea-group-enhances-visual-information-extraction-from-banking-documents-with-multimodal-models-using-llama-factory-on-amazon-sagemaker-hyperpod/) (English) - [Easy Dataset × LLaMA Factory: Enabling LLMs to Efficiently Learn Domain Knowledge](https://buaa-act.feishu.cn/wiki/GVzlwYcRFiR8OLkHbL6cQpYin7g) (English) diff --git a/README_zh.md b/README_zh.md index f759a168..acf6d6e5 100644 --- a/README_zh.md +++ b/README_zh.md @@ -5,7 +5,7 @@ [![GitHub contributors](https://img.shields.io/github/contributors/hiyouga/LLaMA-Factory?color=orange)](https://github.com/hiyouga/LLaMA-Factory/graphs/contributors) [![GitHub workflow](https://github.com/hiyouga/LLaMA-Factory/actions/workflows/tests.yml/badge.svg)](https://github.com/hiyouga/LLaMA-Factory/actions/workflows/tests.yml) [![PyPI](https://img.shields.io/pypi/v/llamafactory)](https://pypi.org/project/llamafactory/) -[![Citation](https://img.shields.io/badge/citation-730-green)](https://scholar.google.com/scholar?cites=12620864006390196564) +[![Citation](https://img.shields.io/badge/citation-760-green)](https://scholar.google.com/scholar?cites=12620864006390196564) [![Docker Pulls](https://img.shields.io/docker/pulls/hiyouga/llamafactory)](https://hub.docker.com/r/hiyouga/llamafactory/tags) [![Twitter](https://img.shields.io/twitter/follow/llamafactory_ai)](https://twitter.com/llamafactory_ai) @@ -25,13 +25,8 @@ ### 赞助商 ❤️ - - Warp sponsorship - - -#### [Warp,面向开发者的智能终端](https://warp.dev/llama-factory) - -[适用于 MacOS、Linux 和 Windows](https://warp.dev/llama-factory) +|
Warp sponsorship
Warp,面向开发者的智能终端
适用于 MacOS、Linux 和 Windows | SerpAPI sponsorship | +| ---- | ---- | ---- @@ -108,7 +103,7 @@ https://github.com/user-attachments/assets/43b700c6-a178-41db-b1f8-8190a5d3fcfc ## 官方博客 -- [使用 LLaMA-Factory 微调 Llama3.1-70B 医学诊断模型](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/)(中文) +- [使用 LLaMA-Factory 微调 Llama3.1-70B 医学诊断模型](https://docs.alayanew.com/docs/documents/bestPractice/bigModel/llama70B/?utm_source=LLaMA-Factory)(中文) - [基于 LLaMA-Factory 和 EasyR1 打造一站式无代码大模型强化学习和部署平台 LLM Model Hub](https://aws.amazon.com/cn/blogs/china/building-llm-model-hub-based-on-llamafactory-and-easyr1/)(中文) - [通过亚马逊 SageMaker HyperPod 上的 LLaMA-Factory 增强多模态模型银行文档的视觉信息提取](https://aws.amazon.com/cn/blogs/machine-learning/how-apoidea-group-enhances-visual-information-extraction-from-banking-documents-with-multimodal-models-using-llama-factory-on-amazon-sagemaker-hyperpod/)(英文) - [Easy Dataset × LLaMA Factory: 让大模型高效学习领域知识](https://buaa-act.feishu.cn/wiki/KY9xwTGs1iqHrRkjXBwcZP9WnL9)(中文) diff --git a/assets/serpapi.svg b/assets/serpapi.svg new file mode 100644 index 00000000..79bdf400 --- /dev/null +++ b/assets/serpapi.svg @@ -0,0 +1 @@ + diff --git a/assets/warp.jpg b/assets/warp.jpg new file mode 100644 index 00000000..64bc01be Binary files /dev/null and b/assets/warp.jpg differ diff --git a/assets/wechat.jpg b/assets/wechat.jpg index 72ea605a..3a9b4663 100644 Binary files a/assets/wechat.jpg and b/assets/wechat.jpg differ diff --git a/assets/wechat_npu.jpg b/assets/wechat_npu.jpg index e13da8ec..9eb6d305 100644 Binary files a/assets/wechat_npu.jpg and b/assets/wechat_npu.jpg differ diff --git a/src/llamafactory/data/template.py b/src/llamafactory/data/template.py index 7e9cb3bf..51fc2b02 100644 --- a/src/llamafactory/data/template.py +++ b/src/llamafactory/data/template.py @@ -1014,6 +1014,7 @@ register_template( ) +# copied from glm4 template register_template( name="glm4_moe", format_user=StringFormatter(slots=["<|user|>\n{{content}}<|assistant|>"]), @@ -1101,7 +1102,7 @@ register_template( ), format_tools=ToolFormatter(tool_format="default"), stop_words=["<|end_of_text|>"], - default_system=("You are Granite, developed by IBM. You are a helpful AI assistant."), + default_system="You are Granite, developed by IBM. You are a helpful AI assistant.", ) diff --git a/src/llamafactory/extras/constants.py b/src/llamafactory/extras/constants.py index 7e83d93c..6b1a8f91 100644 --- a/src/llamafactory/extras/constants.py +++ b/src/llamafactory/extras/constants.py @@ -1714,6 +1714,7 @@ register_model_group( multimodal=True, ) + register_model_group( models={ "MiniCPM-V-4": { @@ -1725,6 +1726,7 @@ register_model_group( multimodal=True, ) + register_model_group( models={ "Ministral-8B-Instruct-2410": {