2768 Commits

Author SHA1 Message Date
hoshi-hiyouga
bb664d2fc5 Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model

Former-commit-id: 2382a5f0317d768ba8f4931977f5caed6057b3c0
2024-12-30 21:50:13 +08:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template

Former-commit-id: 91467ed313802ac3950c2e11a7d0997a36bcbddd
2024-12-30 21:08:25 +08:00
hoshi-hiyouga
089f824cd1 Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype

Former-commit-id: 40805b0cc0cff478703f68067a330ba307bb5809
2024-12-30 16:34:32 +08:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hoshi-hiyouga
951d845af2 Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss

Former-commit-id: b55890291b0049dd90ef4d1d0bf0ba1efb1e4f0a
2024-12-28 01:02:56 +08:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
2024-12-27 16:54:39 +00:00
shibingli@yeah.net
c76c33ddb1 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.
Former-commit-id: f1d76786e094562f6f095a0b56c9c6cd32e2fa5e
2024-12-27 18:31:14 +08:00
shibingli@yeah.net
a37ef0eaae Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions.
Former-commit-id: a3a49b1ea477313c979a1649ee6a7f843fe36469
2024-12-27 18:17:17 +08:00
hoshi-hiyouga
377dfe5665 Merge pull request #6457 from youkaichao/module-run
[misc] enable module run

Former-commit-id: f68074d87bcc915a49a8765b3ebb32d935aa5445
2024-12-26 23:41:37 +08:00
youkaichao
f6d5dd6f10 Update cli.py
Former-commit-id: c39d81cd1d108d832746e100ac890b2d4ecaa60e
2024-12-26 23:22:09 +08:00
hoshi-hiyouga
a36f9d923e Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq

Former-commit-id: cd56f88ff2c5c3edc381f3807f466621cee86b67
2024-12-25 15:53:19 +08:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
2024-12-25 07:52:41 +00:00
hoshi-hiyouga
c5780f5eaa Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: cbd494ddaf692faf83d4825fe4b4595430b111f5
2024-12-24 16:13:20 +08:00
hiyouga
4cd1d05429 update wechat
Former-commit-id: 83202c9027222b83c949d1fe1bff1317f5715015
2024-12-24 08:12:53 +00:00
hoshi-hiyouga
459219a260 Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme

Former-commit-id: b9f73fc5caf5753bd5b96de5383eaf80cd958e3d
2024-12-23 22:17:19 +08:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
2024-12-23 16:46:02 +08:00
hoshi-hiyouga
c0418062c0 Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger

Former-commit-id: d58746eca203d97ec57abbc312ecf4c00b5d5535
2024-12-22 05:47:55 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144 fix paligemma infer
Former-commit-id: 84cd1188ac03c165e1a626db297936c2458627d6
2024-12-21 20:24:32 +00:00
hoshi-hiyouga
9d437a5f4f Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
Former-commit-id: a2ad0738a22f71af453a7f266c350ff7662bf67c
2024-12-22 04:08:26 +08:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c2688efad82028e22683e6c9d874af6823
2024-12-21 20:59:25 +08:00
hoshi-hiyouga
547f76e56e Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
2024-12-21 14:09:33 +08:00
ZeYi Lin
67d4757c35 fix: project blank
Former-commit-id: 82e5d75014ffe5fbe762711adecf59c94ab29f59
2024-12-20 18:26:02 +08:00
ZeYi Lin
cc703b58f5 fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a41eafc41fdca944e142f5a0f35a5b3
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938 feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
2024-12-20 11:03:02 +08:00
ZeYi Lin
03dba638e6 fix: text
Former-commit-id: 0a52962db365e7456c858a8e58c19313f19d1e09
2024-12-19 21:26:02 +08:00
ZeYi Lin
dd22454fc5 fix: bugs
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
2024-12-19 21:08:16 +08:00
hoshi-hiyouga
904f18b4a2 Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
[generate] fix generate kwargs

Former-commit-id: c6e3c14a93490df6e7f8099e09d62d602a667542
2024-12-19 20:24:17 +08:00
ZeYi Lin
b512a06c3d docs: config framework
Former-commit-id: 7eb49e5ffaea59d8a2756ae7ff55bd57b9077f4b
2024-12-19 20:22:36 +08:00
ZeYi Lin
c31933ef9e fix: string
Former-commit-id: 330691962960fdd2053236e43a919e8f15e2bf27
2024-12-19 20:18:59 +08:00
hiyouga
8524dcaa4a fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6 feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b feat: swanlab params
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
2024-12-19 18:47:27 +08:00
hoshi-hiyouga
af9ef037dd Merge pull request #6388 from hiyouga/hiyouga/shuffle_control
[trainer] support disable shuffling

Former-commit-id: ffbb4dbdb09ba799af1800c78b2e9d669bccd24b
2024-12-19 17:00:12 +08:00
hiyouga
95d3c2620b support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
2024-12-19 08:53:21 +00:00
hiyouga
1a48340680 add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
2024-12-19 07:12:31 +00:00
hoshi-hiyouga
d6ce1045f7 Merge pull request #6384 from hiyouga/hiyouga/fix_webui
[webui] fix webui args

Former-commit-id: 6ccd64ecd9089d8694decde3186836bd8e5df511
2024-12-19 14:57:52 +08:00
hiyouga
92a0d08e27 fix webui
Former-commit-id: 369cca8110e6923ad9978b6b93928a3bcb5c6f30
2024-12-19 06:48:03 +00:00
hoshi-hiyouga
910884065e Merge pull request #6379 from hiyouga/hiyouga/add_paligemma2
[model] add paligemma2

Former-commit-id: 933647e6806428a608c2f0fd90b8ea1ea84cdd89
2024-12-18 17:03:11 +08:00
hiyouga
433d116080 add paligemma2
Former-commit-id: d3509050dc4d3105a6e62acc9a1ba481269279a2
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534 Merge pull request #6313 from ge-xing/main
support telechat2 model

Former-commit-id: 015f2137887bb9f27fcb0d6cc67ef729aad4031e
2024-12-18 16:16:17 +08:00
hoshi-hiyouga
5f0dd86c15 Merge pull request #6369 from hiyouga/hiyouga/template
[template] support qwen2 tool template

Former-commit-id: af336275021cd6aee3fe9f67b9ac9bcd1276de7c
2024-12-18 04:23:49 +08:00
hiyouga
a421113466 support qwen tool format
Former-commit-id: 98795854e3fda7b0c0bc209b3e2496b0036e154e
2024-12-17 20:12:06 +00:00
hiyouga
acd62fddb8 change default replace jinja to false
Former-commit-id: bcc413cf64cbee068e2f19475ce7919c65284489
2024-12-17 19:27:10 +00:00
hoshi-hiyouga
d8f6569be1 Merge pull request #5473 from AlongWY/mistral
Support Mistral format tools

Former-commit-id: 2fad3792d98f4181ae23e861c3d050fe1bcd8e4e
2024-12-18 03:23:24 +08:00
ylfeng
857d23b324 Support Mistral format tools
Former-commit-id: 115924af47496daa747a018952b6a32ccbd9cecb
2024-12-17 19:13:26 +00:00
hoshi-hiyouga
ad00c793ce Merge pull request #6368 from hiyouga/hiyouga/fix_llama_template
[template] fix llama3 tool template

Former-commit-id: 8974a0a185daf7744b4d3a0b2776f9bd72e24426
2024-12-18 01:10:48 +08:00
hiyouga
f6a2bfc0e8 fix llama3 tool template
Former-commit-id: df5655f61cb847dc2d9eb7b34266b20343ff90d6
2024-12-17 17:05:10 +00:00