1841 Commits

Author SHA1 Message Date
Zhangchi Feng
f51ac40f0a Merge branch 'main' into minicpmv
Former-commit-id: fc045d7dd871985d621430b5662cba882188a59c
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219 add some
Former-commit-id: 096a6cb67a7dfd14a6e339d96baab78c12d36a87
2025-01-10 20:01:22 +08:00
hiyouga
b471def13d improve template, add phi4 model
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
2025-01-09 18:27:54 +00:00
hoshi-hiyouga
b777fed171 Merge pull request #6564 from stephen-nju/fix_ray
Fix ray

Former-commit-id: 6b34b69fa688c4622489d3d5f33d847fb6b95528
2025-01-08 18:14:18 +08:00
zhubin
014a7ea042 fix –get ray args when args not a dict
Former-commit-id: 9c4c84828b77acf48caf60726e4e7ef3e972118d
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
2025-01-08 09:56:10 +00:00
hiyouga
0c1ad5f3fb fix llamaboard with ray
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
2025-01-07 08:55:44 +00:00
hiyouga
8c57169eb7 fix #6546
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
2025-01-07 06:30:44 +00:00
fzc8578
b9eeaa9706 add some
Former-commit-id: 785cc70ff205f5962c3ca67f453589e4a471ba8c
2025-01-06 19:32:39 +08:00
Zhangchi Feng
a0188a430f Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: ab87bd6b1398b379b1a7a95f01a6539743b9db2d
2025-01-04 11:20:33 +08:00
fzc8578
b5ef5059ee add some
Former-commit-id: 79c2d7090cbf364063ea3608814ab18aa27fdc87
2025-01-04 11:11:15 +08:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
2025-01-02 12:19:21 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
2025-01-02 12:07:38 +00:00
hiyouga
da8721a70e fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
2025-01-02 11:28:54 +00:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template

Former-commit-id: 91467ed313802ac3950c2e11a7d0997a36bcbddd
2024-12-30 21:08:25 +08:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
2024-12-27 16:54:39 +00:00
youkaichao
f6d5dd6f10 Update cli.py
Former-commit-id: c39d81cd1d108d832746e100ac890b2d4ecaa60e
2024-12-26 23:22:09 +08:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
2024-12-23 16:46:02 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144 fix paligemma infer
Former-commit-id: 84cd1188ac03c165e1a626db297936c2458627d6
2024-12-21 20:24:32 +00:00
hoshi-hiyouga
547f76e56e Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
2024-12-21 14:09:33 +08:00
ZeYi Lin
67d4757c35 fix: project blank
Former-commit-id: 82e5d75014ffe5fbe762711adecf59c94ab29f59
2024-12-20 18:26:02 +08:00
ZeYi Lin
cc703b58f5 fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a41eafc41fdca944e142f5a0f35a5b3
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938 feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
2024-12-20 11:03:02 +08:00
ZeYi Lin
03dba638e6 fix: text
Former-commit-id: 0a52962db365e7456c858a8e58c19313f19d1e09
2024-12-19 21:26:02 +08:00
ZeYi Lin
dd22454fc5 fix: bugs
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
2024-12-19 21:08:16 +08:00
ZeYi Lin
b512a06c3d docs: config framework
Former-commit-id: 7eb49e5ffaea59d8a2756ae7ff55bd57b9077f4b
2024-12-19 20:22:36 +08:00
ZeYi Lin
c31933ef9e fix: string
Former-commit-id: 330691962960fdd2053236e43a919e8f15e2bf27
2024-12-19 20:18:59 +08:00
hiyouga
8524dcaa4a fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6 feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b feat: swanlab params
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
2024-12-19 18:47:27 +08:00
hiyouga
95d3c2620b support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
2024-12-19 08:53:21 +00:00
hiyouga
1a48340680 add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
2024-12-19 07:12:31 +00:00
hiyouga
92a0d08e27 fix webui
Former-commit-id: 369cca8110e6923ad9978b6b93928a3bcb5c6f30
2024-12-19 06:48:03 +00:00
hiyouga
433d116080 add paligemma2
Former-commit-id: d3509050dc4d3105a6e62acc9a1ba481269279a2
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534 Merge pull request #6313 from ge-xing/main
support telechat2 model

Former-commit-id: 015f2137887bb9f27fcb0d6cc67ef729aad4031e
2024-12-18 16:16:17 +08:00
hiyouga
a421113466 support qwen tool format
Former-commit-id: 98795854e3fda7b0c0bc209b3e2496b0036e154e
2024-12-17 20:12:06 +00:00
hiyouga
acd62fddb8 change default replace jinja to false
Former-commit-id: bcc413cf64cbee068e2f19475ce7919c65284489
2024-12-17 19:27:10 +00:00
ylfeng
857d23b324 Support Mistral format tools
Former-commit-id: 115924af47496daa747a018952b6a32ccbd9cecb
2024-12-17 19:13:26 +00:00
hiyouga
f6a2bfc0e8 fix llama3 tool template
Former-commit-id: df5655f61cb847dc2d9eb7b34266b20343ff90d6
2024-12-17 17:05:10 +00:00
hoshi-hiyouga
1cc24ed206 Merge pull request #6367 from hiyouga/hiyouga/add_model
[model&template] add llama3.3 & support llama3 tool prompt

Former-commit-id: e12c80ace8b59a9556ee40f5b810f233f9b8174a
2024-12-18 00:13:28 +08:00
hiyouga
a935933bed support llama3 tool prompt
Former-commit-id: b24ae55ebf548db904a9fe1876192024d8a96108
2024-12-17 15:52:37 +00:00
Yaser Afshar
76ebd62ac1 Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
2024-12-17 12:34:05 +00:00