2530 Commits

Author SHA1 Message Date
hiyouga
20a9565e36 update scripts
Former-commit-id: dd44c65d7f60cb6f5d0e0d8ee5f4e7643defb89b
2025-01-03 10:50:32 +00:00
hoshi-hiyouga
85317bcbaf Merge pull request #6515 from hiyouga/hiyouga/misc
[misc] update model name

Former-commit-id: 51ef90ce0ace4a45f9c01ba7e674adf5e3c92baa
2025-01-02 20:20:02 +08:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project

Former-commit-id: a766cad5d49f226eb61a550bc3d157870c1068cc
2025-01-02 20:16:15 +08:00
hoshi-hiyouga
b2ecb80729 Merge pull request #6513 from hiyouga/hiyouga/add_gpt2
[model] add gpt2 model

Former-commit-id: 29ddc6b77862f740570a00d3b8ea548ee1a2ce03
2025-01-02 20:15:55 +08:00
hiyouga
9a3afbd5d1 add project
Former-commit-id: b3e1137fbbdfa4cc081903983fea36acff7afd75
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
2025-01-02 12:07:38 +00:00
hoshi-hiyouga
b921dde749 Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic
[trainer] fix generate logic

Former-commit-id: 72d86ecc9e327933a0a2c893b8ffd2740c99be6b
2025-01-02 19:36:54 +08:00
hoshi-hiyouga
d195329185 Merge pull request #6462 from shibingli/main
Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building

Former-commit-id: 8741e5b3e87a392a3c9d50455e4916c3a938fb24
2025-01-02 19:34:17 +08:00
hiyouga
da8721a70e fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
2025-01-02 11:28:54 +00:00
hoshi-hiyouga
f318dc9464 Merge pull request #6493 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: f8e80d566f7666b6af00360df97065698a1d3a9f
2024-12-30 21:55:03 +08:00
hiyouga
01bbe66f41 update wechat
Former-commit-id: a400d896a18e317acdbd3c79282c81b50cc2c54d
2024-12-30 13:54:22 +00:00
hoshi-hiyouga
bb664d2fc5 Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
[model] add deepseek3 model

Former-commit-id: 2382a5f0317d768ba8f4931977f5caed6057b3c0
2024-12-30 21:50:13 +08:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template

Former-commit-id: 91467ed313802ac3950c2e11a7d0997a36bcbddd
2024-12-30 21:08:25 +08:00
hoshi-hiyouga
089f824cd1 Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
[model] update vllm & fix paligemma dtype

Former-commit-id: 40805b0cc0cff478703f68067a330ba307bb5809
2024-12-30 16:34:32 +08:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hoshi-hiyouga
951d845af2 Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss
[trainer] fix eval loss

Former-commit-id: b55890291b0049dd90ef4d1d0bf0ba1efb1e4f0a
2024-12-28 01:02:56 +08:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
2024-12-27 16:54:39 +00:00
shibingli@yeah.net
c76c33ddb1 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.
Former-commit-id: f1d76786e094562f6f095a0b56c9c6cd32e2fa5e
2024-12-27 18:31:14 +08:00
shibingli@yeah.net
a37ef0eaae Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions.
Former-commit-id: a3a49b1ea477313c979a1649ee6a7f843fe36469
2024-12-27 18:17:17 +08:00
hoshi-hiyouga
377dfe5665 Merge pull request #6457 from youkaichao/module-run
[misc] enable module run

Former-commit-id: f68074d87bcc915a49a8765b3ebb32d935aa5445
2024-12-26 23:41:37 +08:00
youkaichao
f6d5dd6f10 Update cli.py
Former-commit-id: c39d81cd1d108d832746e100ac890b2d4ecaa60e
2024-12-26 23:22:09 +08:00
hoshi-hiyouga
a36f9d923e Merge pull request #6443 from hiyouga/hiyouga/add_qvq
[modle] add qvq

Former-commit-id: cd56f88ff2c5c3edc381f3807f466621cee86b67
2024-12-25 15:53:19 +08:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
2024-12-25 07:52:41 +00:00
hoshi-hiyouga
c5780f5eaa Merge pull request #6430 from hiyouga/hiyouga/upd_wechat
[assets] update wechat

Former-commit-id: cbd494ddaf692faf83d4825fe4b4595430b111f5
2024-12-24 16:13:20 +08:00
hiyouga
4cd1d05429 update wechat
Former-commit-id: 83202c9027222b83c949d1fe1bff1317f5715015
2024-12-24 08:12:53 +00:00
hoshi-hiyouga
459219a260 Merge pull request #6426 from hiyouga/hiyouga/update_readme
[assets] update readme

Former-commit-id: b9f73fc5caf5753bd5b96de5383eaf80cd958e3d
2024-12-23 22:17:19 +08:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
2024-12-23 16:46:02 +08:00
hoshi-hiyouga
c0418062c0 Merge pull request #6418 from hiyouga/hiyouga/add_report
[trainer] add custom args to experimental logger

Former-commit-id: d58746eca203d97ec57abbc312ecf4c00b5d5535
2024-12-22 05:47:55 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144 fix paligemma infer
Former-commit-id: 84cd1188ac03c165e1a626db297936c2458627d6
2024-12-21 20:24:32 +00:00
hoshi-hiyouga
9d437a5f4f Merge pull request #6416 from Zeyi-Lin/main
docs: use swanlab
Former-commit-id: a2ad0738a22f71af453a7f266c350ff7662bf67c
2024-12-22 04:08:26 +08:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c2688efad82028e22683e6c9d874af6823
2024-12-21 20:59:25 +08:00
hoshi-hiyouga
547f76e56e Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
2024-12-21 14:09:33 +08:00
ZeYi Lin
67d4757c35 fix: project blank
Former-commit-id: 82e5d75014ffe5fbe762711adecf59c94ab29f59
2024-12-20 18:26:02 +08:00
ZeYi Lin
cc703b58f5 fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a41eafc41fdca944e142f5a0f35a5b3
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938 feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
2024-12-20 11:03:02 +08:00
ZeYi Lin
03dba638e6 fix: text
Former-commit-id: 0a52962db365e7456c858a8e58c19313f19d1e09
2024-12-19 21:26:02 +08:00
ZeYi Lin
dd22454fc5 fix: bugs
Former-commit-id: d0eb64d5e3472a166c9adac4cb4ba06bdd663e46
2024-12-19 21:08:16 +08:00
hoshi-hiyouga
904f18b4a2 Merge pull request #6395 from hiyouga/hiyouga/fix_genkwargs
[generate] fix generate kwargs

Former-commit-id: c6e3c14a93490df6e7f8099e09d62d602a667542
2024-12-19 20:24:17 +08:00
ZeYi Lin
b512a06c3d docs: config framework
Former-commit-id: 7eb49e5ffaea59d8a2756ae7ff55bd57b9077f4b
2024-12-19 20:22:36 +08:00
ZeYi Lin
c31933ef9e fix: string
Former-commit-id: 330691962960fdd2053236e43a919e8f15e2bf27
2024-12-19 20:18:59 +08:00
hiyouga
8524dcaa4a fix #6391
Former-commit-id: d4c1fda1ad19e73484d8d51d81e490cdb8781955
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6 feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b feat: swanlab params
Former-commit-id: d5cf87990e5bea920ecd1561def09fa17cf328b1
2024-12-19 18:47:27 +08:00
hoshi-hiyouga
af9ef037dd Merge pull request #6388 from hiyouga/hiyouga/shuffle_control
[trainer] support disable shuffling

Former-commit-id: ffbb4dbdb09ba799af1800c78b2e9d669bccd24b
2024-12-19 17:00:12 +08:00
hiyouga
95d3c2620b support disable shuffling
Former-commit-id: c7cedc7569973a2879c689637b2923e8b26f1a81
2024-12-19 08:53:21 +00:00
hiyouga
1a48340680 add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
2024-12-19 07:12:31 +00:00