Commit Graph

562 Commits

Author SHA1 Message Date
qvlehao
f5350b103b [model] add deepseek-R1 & show think process (#6767)
Former-commit-id: 28417f862a
2025-01-29 12:16:26 +08:00
hoshi-hiyouga
ee0b3b1e1a [assets] update wechat (#6692)
Former-commit-id: 17b470630d
2025-01-18 12:35:03 +08:00
hoshi-hiyouga
320e40d873 update readme (#6648)
Former-commit-id: 563be2286a
2025-01-15 11:06:19 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
Zhangchi Feng
57043fb4e6 update readme of MiniCPM-o (#6642)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

Former-commit-id: 9b7ba093c7
2025-01-14 21:22:35 +08:00
hoshi-hiyouga
91433d639c lint (#6641)
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint

Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
Zhangchi Feng
201a495154 Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6

Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix

Former-commit-id: 3077f20339
2025-01-13 16:46:48 +08:00
fzc8578
9213e48fa2 add minicpmv2.6
Former-commit-id: e45329e745
2025-01-10 23:45:44 +08:00
hiyouga
b471def13d improve template, add phi4 model
Former-commit-id: ae16ea755d
2025-01-09 18:27:54 +00:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add7287
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project

Former-commit-id: a766cad5d4
2025-01-02 20:16:15 +08:00
hiyouga
9a3afbd5d1 add project
Former-commit-id: b3e1137fbb
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497
2025-01-02 12:07:38 +00:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3a
2024-12-30 13:39:20 +00:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f41
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c268
2024-12-21 20:59:25 +08:00
hiyouga
433d116080 add paligemma2
Former-commit-id: d3509050dc
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534 Merge pull request #6313 from ge-xing/main
support telechat2 model

Former-commit-id: 015f213788
2024-12-18 16:16:17 +08:00
hiyouga
f6a2bfc0e8 fix llama3 tool template
Former-commit-id: df5655f61c
2024-12-17 17:05:10 +00:00
zhaohu xing
cfb4c42ae4 support telechat2 model
Former-commit-id: 04f19ed0f3
2024-12-17 12:15:33 +00:00
hiyouga
235cdcacee support batch infer in vllm
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hiyouga
1c3d86cd65 add qwq
Former-commit-id: 68a612115a
2024-11-28 08:50:57 +00:00
hiyouga
d51d96d594 add skywork o1
Former-commit-id: ec9ff8caa2
2024-11-27 05:51:59 +00:00
hiyouga
ab3782b0fa add marco-o1 and openo1 dataset
Former-commit-id: 17afb7d410
2024-11-27 04:20:23 +00:00
hiyouga
7eaafe08bc update readme
Former-commit-id: a89ad72d03
2024-11-23 19:27:18 +00:00
hiyouga
e99031daa4 fix inputs
Former-commit-id: 446441fdb0
2024-11-23 18:26:02 +00:00
steven
7f7ee0a660 support granite3 models
Former-commit-id: 6eefb4d7d2
2024-11-04 10:35:03 +08:00
hoshi-hiyouga
e3fb3c313c Merge pull request #5914 from hiyouga/hiyouga/dev_read
[misc] update readme

Former-commit-id: 04c10d2e80
2024-11-02 21:44:10 +08:00
hiyouga
f05685c7cf update readme
Former-commit-id: e7ed5091e1
2024-11-02 21:28:04 +08:00
hoshi-hiyouga
d99e164cad Merge branch 'main' into main
Former-commit-id: 5f14910910
2024-11-02 21:20:27 +08:00
Cuiyn
7806bde8ad Add support for Index
Former-commit-id: a15a69ab44
2024-11-02 13:45:27 +08:00
hiyouga
2eba98e152 add examples
Former-commit-id: e824b715ad
2024-11-01 08:41:54 +00:00
hiyouga
7487bd7b1f update readme
Former-commit-id: 2417b70a62
2024-10-30 09:14:01 +00:00
hoshi-hiyouga
5142faca8f Merge pull request #5581 from Kuangdd01/pixtral-patch
[WIP] Support Pixtral-12B

Former-commit-id: 9009a467e6
2024-10-29 22:29:10 +08:00
hoshi-hiyouga
2876b429bc Update README.md
Former-commit-id: 1b57df074a
2024-10-29 21:57:28 +08:00
hoshi-hiyouga
233556d1c7 Update README.md
Former-commit-id: a76478c127
2024-10-29 21:18:15 +08:00
grok
3e3969784f Update README.md
update english readme

Former-commit-id: 7627ef0908
2024-10-23 23:49:47 +08:00
hoshi-hiyouga
79433fb6a6 Update README.md
Former-commit-id: 1fea871835
2024-10-17 19:46:36 +08:00
Kingsley
8ea1c5c69e Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 95330893c5
2024-10-13 17:42:02 +08:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dd
2024-10-11 23:51:54 +08:00
huniu20
132c1f1b0f 1. add model and dataset info to support webui
Former-commit-id: 0f669f221a
2024-10-10 16:46:34 +08:00
huniu20
26e897e861 1. add modelers hub support
Former-commit-id: 24ebe187e3
2024-10-09 17:21:37 +08:00
Kingsley
5523a6fd2c Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 93a441a6b7
2024-10-08 21:04:08 +08:00