Commit Graph

403 Commits

Author SHA1 Message Date
hoshi-hiyouga
7a04021d04 [optim] clean apollo (#6645)
* clean apollo code

* update readme
2025-01-15 01:42:50 +08:00
Zhangchi Feng
9b7ba093c7 update readme of MiniCPM-o (#6642)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme
2025-01-14 21:22:35 +08:00
Haian Huang(深度眸)
deacc00b12 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint
2025-01-14 18:07:27 +08:00
Zhangchi Feng
c3fda5046d Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
e3e2c8c689 [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples
2025-01-13 21:34:20 +08:00
codingma
03de5ac912 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-01-13 19:43:36 +08:00
Zhangchi Feng
3077f20339 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix
2025-01-13 16:46:48 +08:00
fzc8578
e45329e745 add minicpmv2.6 2025-01-10 23:45:44 +08:00
hiyouga
ae16ea755d improve template, add phi4 model 2025-01-09 18:27:54 +00:00
hiyouga
4b8add7287 update model name 2025-01-02 12:19:21 +00:00
hoshi-hiyouga
a766cad5d4 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project
2025-01-02 20:16:15 +08:00
hiyouga
b3e1137fbb add project 2025-01-02 12:15:41 +00:00
hiyouga
67442bd497 add gpt2 model 2025-01-02 12:07:38 +00:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hiyouga
ee0e400f41 add qvq #6439 2024-12-25 07:52:41 +00:00
hiyouga
8fd38d273e update readme 2024-12-23 14:08:59 +00:00
hoshi-hiyouga
c23a4d0658 Merge pull request #5922 from Tuyohai/main
support granite3 models
2024-12-23 16:46:02 +08:00
hiyouga
5111cac6f8 support report custom args 2024-12-21 21:42:45 +00:00
ZeYi Lin
744ef8c268 docs: use swanlab 2024-12-21 20:59:25 +08:00
hiyouga
d3509050dc add paligemma2 2024-12-18 08:57:26 +00:00
hoshi-hiyouga
015f213788 Merge pull request #6313 from ge-xing/main
support telechat2 model
2024-12-18 16:16:17 +08:00
hiyouga
df5655f61c fix llama3 tool template 2024-12-17 17:05:10 +00:00
zhaohu xing
04f19ed0f3 support telechat2 model 2024-12-17 12:15:33 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
hiyouga
68a612115a add qwq 2024-11-28 08:50:57 +00:00
hiyouga
ec9ff8caa2 add skywork o1 2024-11-27 05:51:59 +00:00
hiyouga
17afb7d410 add marco-o1 and openo1 dataset 2024-11-27 04:20:23 +00:00
hiyouga
a89ad72d03 update readme 2024-11-23 19:27:18 +00:00
hiyouga
446441fdb0 fix inputs 2024-11-23 18:26:02 +00:00
steven
6eefb4d7d2 support granite3 models 2024-11-04 10:35:03 +08:00
hiyouga
e7ed5091e1 update readme 2024-11-02 21:28:04 +08:00
Cuiyn
a15a69ab44 Add support for Index 2024-11-02 13:45:27 +08:00
hiyouga
e824b715ad add examples 2024-11-01 08:41:54 +00:00
hiyouga
2417b70a62 update readme 2024-10-30 09:14:01 +00:00
hoshi-hiyouga
9009a467e6 Merge pull request #5581 from Kuangdd01/pixtral-patch
[WIP] Support Pixtral-12B
2024-10-29 22:29:10 +08:00
hoshi-hiyouga
8fa20bf427 Update README_zh.md 2024-10-29 21:58:03 +08:00
hoshi-hiyouga
08d9a03c30 Update README_zh.md 2024-10-29 21:19:17 +08:00
grok
6fcabb3349 Update README_zh.md 2024-10-23 23:50:56 +08:00
grok
18a7f3ff76 Update README_zh.md 2024-10-23 23:36:14 +08:00
hoshi-hiyouga
110e4c548d Update README_zh.md 2024-10-17 19:47:33 +08:00
Kingsley
95330893c5 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-13 17:42:02 +08:00
hiyouga
3af57795dd tiny fix 2024-10-11 23:51:54 +08:00
huniu20
0f669f221a 1. add model and dataset info to support webui 2024-10-10 16:46:34 +08:00
huniu20
24ebe187e3 1. add modelers hub support 2024-10-09 17:21:37 +08:00
Kingsley
93a441a6b7 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-08 21:04:08 +08:00
hiyouga
1a7483c1a5 update readme 2024-10-07 11:31:18 +08:00
Kingsley
c668568bc7 unfactor md 2024-09-30 23:36:16 +08:00
Kingsley
2166b9bc6b fix some errors due to inconsistency of model cards 2024-09-30 19:58:34 +08:00
Zhangchi Feng
26f45829b4 Merge branch 'main' into pixtral-patch 2024-09-30 12:37:03 +08:00
hoshi-hiyouga
e472f355f2 Update README_zh.md 2024-09-29 23:56:32 +08:00