hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
Zhangchi Feng
57043fb4e6
update readme of MiniCPM-o ( #6642 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
Former-commit-id: 9b7ba093c7e017ea18a4562550d5d2e82c4a0161
2025-01-14 21:22:35 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92eeb297e883aab89e2384c1df1d0e910
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b1226ca3d53bf7bb1231cf276eaa8296b
2025-01-14 18:07:27 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912336190d6b3583f70b6340ab9cf9cdf
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
Former-commit-id: 3077f20339158564009270edf79c8ef1b10e8b4a
2025-01-13 16:46:48 +08:00
fzc8578
9213e48fa2
add minicpmv2.6
...
Former-commit-id: e45329e7456b647d5684b1f9428641ad18af92d1
2025-01-10 23:45:44 +08:00
hiyouga
b471def13d
improve template, add phi4 model
...
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
2025-01-09 18:27:54 +00:00
hiyouga
528fb4f799
update model name
...
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367
Merge pull request #6514 from hiyouga/hiyouga/add_project
...
[readme] add project
Former-commit-id: a766cad5d49f226eb61a550bc3d157870c1068cc
2025-01-02 20:16:15 +08:00
hiyouga
9a3afbd5d1
add project
...
Former-commit-id: b3e1137fbbdfa4cc081903983fea36acff7afd75
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14
add gpt2 model
...
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
2025-01-02 12:07:38 +00:00
hiyouga
d0e729cd33
add deepseek3 model
...
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
2024-12-30 13:39:20 +00:00
hiyouga
c83b74ab9e
add qvq #6439
...
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f
update readme
...
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228
Merge pull request #5922 from Tuyohai/main
...
support granite3 models
Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
2024-12-23 16:46:02 +08:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
ZeYi Lin
1c1d6bea43
docs: use swanlab
...
Former-commit-id: 744ef8c2688efad82028e22683e6c9d874af6823
2024-12-21 20:59:25 +08:00
hiyouga
433d116080
add paligemma2
...
Former-commit-id: d3509050dc4d3105a6e62acc9a1ba481269279a2
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534
Merge pull request #6313 from ge-xing/main
...
support telechat2 model
Former-commit-id: 015f2137887bb9f27fcb0d6cc67ef729aad4031e
2024-12-18 16:16:17 +08:00
hiyouga
f6a2bfc0e8
fix llama3 tool template
...
Former-commit-id: df5655f61cb847dc2d9eb7b34266b20343ff90d6
2024-12-17 17:05:10 +00:00
zhaohu xing
cfb4c42ae4
support telechat2 model
...
Former-commit-id: 04f19ed0f36e691d89ccb7ac19bae70c59640aaa
2024-12-17 12:15:33 +00:00
hiyouga
235cdcacee
support batch infer in vllm
...
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
2024-12-04 13:50:00 +00:00
hiyouga
1c3d86cd65
add qwq
...
Former-commit-id: 68a612115aebba51695d22be4397c16c86f3b40a
2024-11-28 08:50:57 +00:00
hiyouga
d51d96d594
add skywork o1
...
Former-commit-id: ec9ff8caa2637965d41937cce7de4e4d51d054eb
2024-11-27 05:51:59 +00:00
hiyouga
ab3782b0fa
add marco-o1 and openo1 dataset
...
Former-commit-id: 17afb7d4103499a9a090a6624896cfa123e9e1d6
2024-11-27 04:20:23 +00:00
hiyouga
7eaafe08bc
update readme
...
Former-commit-id: a89ad72d039d03836f98625eaf438f332368a823
2024-11-23 19:27:18 +00:00
hiyouga
e99031daa4
fix inputs
...
Former-commit-id: 446441fdb020b5a102480251cb8536dd8b3f8f99
2024-11-23 18:26:02 +00:00
steven
7f7ee0a660
support granite3 models
...
Former-commit-id: 6eefb4d7d25879db42cefae8332ca9db88bff851
2024-11-04 10:35:03 +08:00
hoshi-hiyouga
e3fb3c313c
Merge pull request #5914 from hiyouga/hiyouga/dev_read
...
[misc] update readme
Former-commit-id: 04c10d2e80b7f7e516eba67fea420498a1238bb5
2024-11-02 21:44:10 +08:00
hiyouga
f05685c7cf
update readme
...
Former-commit-id: e7ed5091e1f8fb35e458f368558ceac71c6983b4
2024-11-02 21:28:04 +08:00
hoshi-hiyouga
d99e164cad
Merge branch 'main' into main
...
Former-commit-id: 5f14910910154ba569435e7e68acbd6c30f79e80
2024-11-02 21:20:27 +08:00
Cuiyn
7806bde8ad
Add support for Index
...
Former-commit-id: a15a69ab4417c6f3273c874cf7ee2c34a5a64141
2024-11-02 13:45:27 +08:00
hiyouga
2eba98e152
add examples
...
Former-commit-id: e824b715ad4bf885241b245b12d75563adab2e26
2024-11-01 08:41:54 +00:00
hiyouga
7487bd7b1f
update readme
...
Former-commit-id: 2417b70a620ec3bba7581c1a444e09c2440a58a0
2024-10-30 09:14:01 +00:00
hoshi-hiyouga
5142faca8f
Merge pull request #5581 from Kuangdd01/pixtral-patch
...
[WIP] Support Pixtral-12B
Former-commit-id: 9009a467e621a17ad9fa25bb30fb9ac9ee15df97
2024-10-29 22:29:10 +08:00
hoshi-hiyouga
2876b429bc
Update README.md
...
Former-commit-id: 1b57df074ab4deb29749086ccb10b459eebf5143
2024-10-29 21:57:28 +08:00
hoshi-hiyouga
233556d1c7
Update README.md
...
Former-commit-id: a76478c127bc98749079fbc7e5aacd6e60648f37
2024-10-29 21:18:15 +08:00
grok
3e3969784f
Update README.md
...
update english readme
Former-commit-id: 7627ef09088ecbc234c08c0cb4743cbaee576b76
2024-10-23 23:49:47 +08:00
hoshi-hiyouga
79433fb6a6
Update README.md
...
Former-commit-id: 1fea87183561559f140f8de9b869e893ff8a3378
2024-10-17 19:46:36 +08:00
Kingsley
8ea1c5c69e
Merge branch 'hiyouga:main' into pixtral-patch
...
Former-commit-id: 95330893c5cd290430a0a2a4e4afa87afab2eb88
2024-10-13 17:42:02 +08:00
hiyouga
e90a1199da
tiny fix
...
Former-commit-id: 3af57795dda5d236200bad4aa3f2e29ae8930fe2
2024-10-11 23:51:54 +08:00
huniu20
132c1f1b0f
1. add model and dataset info to support webui
...
Former-commit-id: 0f669f221a31622ec7a53d0baab5da6a7891f9b6
2024-10-10 16:46:34 +08:00
huniu20
26e897e861
1. add modelers hub support
...
Former-commit-id: 24ebe187e360753666b768685a0dcc78054bb702
2024-10-09 17:21:37 +08:00
Kingsley
5523a6fd2c
Merge branch 'hiyouga:main' into pixtral-patch
...
Former-commit-id: 93a441a6b746e9a933dad8c45553fb5b68bf2b34
2024-10-08 21:04:08 +08:00
hiyouga
74653597f1
update readme
...
Former-commit-id: 1a7483c1a5fb49dba660f21beb45784ebd829c92
2024-10-07 11:31:18 +08:00
Kingsley
dd2d1c3154
unfactor md
...
Former-commit-id: c668568bc73914ba071a4121c4fec1ee7f2ab76c
2024-09-30 23:36:16 +08:00
Kingsley
94ce8f561f
fix some errors due to inconsistency of model cards
...
Former-commit-id: 2166b9bc6ba35760ff85b63620af9fa0213a4c78
2024-09-30 19:58:34 +08:00