Commit Graph

502 Commits

Author SHA1 Message Date
Zhangchi Feng
57043fb4e6 update readme of MiniCPM-o (#6642)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

Former-commit-id: 9b7ba093c7
2025-01-14 21:22:35 +08:00
Haian Huang(深度眸)
864ee06243 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint

Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
Zhangchi Feng
201a495154 Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6

Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix

Former-commit-id: 3077f20339
2025-01-13 16:46:48 +08:00
fzc8578
9213e48fa2 add minicpmv2.6
Former-commit-id: e45329e745
2025-01-10 23:45:44 +08:00
hiyouga
b471def13d improve template, add phi4 model
Former-commit-id: ae16ea755d
2025-01-09 18:27:54 +00:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add7287
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project

Former-commit-id: a766cad5d4
2025-01-02 20:16:15 +08:00
hiyouga
9a3afbd5d1 add project
Former-commit-id: b3e1137fbb
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497
2025-01-02 12:07:38 +00:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3a
2024-12-30 13:39:20 +00:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f41
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c268
2024-12-21 20:59:25 +08:00
hiyouga
433d116080 add paligemma2
Former-commit-id: d3509050dc
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534 Merge pull request #6313 from ge-xing/main
support telechat2 model

Former-commit-id: 015f213788
2024-12-18 16:16:17 +08:00
hiyouga
f6a2bfc0e8 fix llama3 tool template
Former-commit-id: df5655f61c
2024-12-17 17:05:10 +00:00
zhaohu xing
cfb4c42ae4 support telechat2 model
Former-commit-id: 04f19ed0f3
2024-12-17 12:15:33 +00:00
hiyouga
235cdcacee support batch infer in vllm
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hiyouga
1c3d86cd65 add qwq
Former-commit-id: 68a612115a
2024-11-28 08:50:57 +00:00
hiyouga
d51d96d594 add skywork o1
Former-commit-id: ec9ff8caa2
2024-11-27 05:51:59 +00:00
hiyouga
ab3782b0fa add marco-o1 and openo1 dataset
Former-commit-id: 17afb7d410
2024-11-27 04:20:23 +00:00
hiyouga
7eaafe08bc update readme
Former-commit-id: a89ad72d03
2024-11-23 19:27:18 +00:00
hiyouga
e99031daa4 fix inputs
Former-commit-id: 446441fdb0
2024-11-23 18:26:02 +00:00
steven
7f7ee0a660 support granite3 models
Former-commit-id: 6eefb4d7d2
2024-11-04 10:35:03 +08:00
hiyouga
f05685c7cf update readme
Former-commit-id: e7ed5091e1
2024-11-02 21:28:04 +08:00
Cuiyn
7806bde8ad Add support for Index
Former-commit-id: a15a69ab44
2024-11-02 13:45:27 +08:00
hiyouga
2eba98e152 add examples
Former-commit-id: e824b715ad
2024-11-01 08:41:54 +00:00
hiyouga
7487bd7b1f update readme
Former-commit-id: 2417b70a62
2024-10-30 09:14:01 +00:00
hoshi-hiyouga
5142faca8f Merge pull request #5581 from Kuangdd01/pixtral-patch
[WIP] Support Pixtral-12B

Former-commit-id: 9009a467e6
2024-10-29 22:29:10 +08:00
hoshi-hiyouga
eca50b89a2 Update README_zh.md
Former-commit-id: 8fa20bf427
2024-10-29 21:58:03 +08:00
hoshi-hiyouga
b86b869187 Update README_zh.md
Former-commit-id: 08d9a03c30
2024-10-29 21:19:17 +08:00
grok
c24d477bdb Update README_zh.md
Former-commit-id: 6fcabb3349
2024-10-23 23:50:56 +08:00
grok
823d7f5c81 Update README_zh.md
Former-commit-id: 18a7f3ff76
2024-10-23 23:36:14 +08:00
hoshi-hiyouga
6fbf77aa54 Update README_zh.md
Former-commit-id: 110e4c548d
2024-10-17 19:47:33 +08:00
Kingsley
8ea1c5c69e Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 95330893c5
2024-10-13 17:42:02 +08:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dd
2024-10-11 23:51:54 +08:00
huniu20
132c1f1b0f 1. add model and dataset info to support webui
Former-commit-id: 0f669f221a
2024-10-10 16:46:34 +08:00
huniu20
26e897e861 1. add modelers hub support
Former-commit-id: 24ebe187e3
2024-10-09 17:21:37 +08:00
Kingsley
5523a6fd2c Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 93a441a6b7
2024-10-08 21:04:08 +08:00
hiyouga
74653597f1 update readme
Former-commit-id: 1a7483c1a5
2024-10-07 11:31:18 +08:00
Kingsley
dd2d1c3154 unfactor md
Former-commit-id: c668568bc7
2024-09-30 23:36:16 +08:00
Kingsley
94ce8f561f fix some errors due to inconsistency of model cards
Former-commit-id: 2166b9bc6b
2024-09-30 19:58:34 +08:00
Zhangchi Feng
69e801d456 Merge branch 'main' into pixtral-patch
Former-commit-id: 26f45829b4
2024-09-30 12:37:03 +08:00
hoshi-hiyouga
f051bff1e6 Update README_zh.md
Former-commit-id: e472f355f2
2024-09-29 23:56:32 +08:00
Zhangchi Feng
8e164f3594 Merge branch 'main' into main
Former-commit-id: 83abf86657
2024-09-29 21:32:54 +08:00