60 Commits

Author SHA1 Message Date
hoshi-hiyouga
dc8cca11b3
[deps] upgrade transformers (#8159) 2025-05-26 22:03:58 +08:00
hoshi-hiyouga
b0c8ba73e0
[deps] update to transformers 4.52 (#8125) 2025-05-21 05:16:18 +08:00
hoshi-hiyouga
0a0cfeb782
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci

* fix

* fix

* fix

* fix

* fix
2025-04-17 02:36:48 +08:00
hoshi-hiyouga
3ef36d0057
[misc] upgrade cli (#7714) 2025-04-14 15:41:22 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
Ze-Yi LIN
210cdb9557 [webui] display swanlab exp link (#7089)
* webui add swanlab link

* change callback name

* update

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 891c4875039e8e3b7d0de025ee61c4ff003ff0c4
2025-02-27 19:40:54 +08:00
hoshi-hiyouga
1fee69f874 [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year

Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
f6779b0e0c [breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
245de012ca [webui] improve webui & reasoning mode (#6778)
Former-commit-id: 45e68b9f092879dda55023ebbcd8cf4660e3045a
2025-01-31 00:09:21 +08:00
hoshi-hiyouga
1efe525df7 [model] support yarn (#6693)
Former-commit-id: 1f47b6186c267de86cbdbd47ba2adbf1f9db7f39
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
770433fa33 [webui] upgrade to gradio 5 (#6688)
Former-commit-id: 4d0f662dbe227ab0da11a1e109f7a2c5ab8f70b9
2025-01-17 20:15:42 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
hiyouga
0c1ad5f3fb fix llamaboard with ray
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
2025-01-07 09:59:24 +00:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
ZeYi Lin
8f786ee938 feat: ui improve
Former-commit-id: 5f6dafd70e962b8fe9a294d555133002135f80df
2024-12-20 11:03:02 +08:00
ZeYi Lin
53103f55b6 feat: optimize frontend
Former-commit-id: 8c2df41b937f491f7ebf593b20c65a19738c7642
2024-12-19 19:04:19 +08:00
hiyouga
1a48340680 add swanlab
Former-commit-id: 96f8f103e58a8ff307b0ce36c967de04f452434a
2024-12-19 07:12:31 +00:00
Yaser Afshar
76ebd62ac1 Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
2024-12-17 12:34:05 +00:00
hoshi-hiyouga
1eaaa85980 do not split save_cmd ret value
Former-commit-id: d20b97e7e9514e68c33d0428512fd21b548c1131
2024-11-21 22:30:23 +08:00
superboy-zjc
e272f87849 [patch] Patch remote OS command injection vulnerability
Former-commit-id: aa6a174d6822340022433c5ba38182b4932adecb
2024-11-21 01:52:12 -05:00
hiyouga
d7e35ddcaa fix extra args
Former-commit-id: eb294fed1319e97657b54b9aa479d87734c711dc
2024-11-09 00:24:27 +08:00
hiyouga
c2766af6f4 fix dpo metrics
Former-commit-id: 4270f7dfb9a12471c91f6c03dce7ca6fd88566e1
2024-11-02 20:59:01 +08:00
hiyouga
e83cb17f97 support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
2024-11-02 18:31:04 +08:00
hiyouga
aba4268607 support extra args in llamaboard
Former-commit-id: 86e5affc0d5f5682a20b28d27f80e0015b36fdd9
2024-10-30 08:55:54 +00:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hiyouga
c765292093 support liger kernel
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
2024-08-27 11:20:14 +08:00
hiyouga
0aad78694a tiny fix
Former-commit-id: ace1d44857368dbbc868e4bba2d7ff17329921f5
2024-07-24 18:33:39 +08:00
hiyouga
34f16cc635 follow #4878 fix #4684
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
2024-07-18 22:06:12 +08:00
Shiyu Zhang
249adacc4d 仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
2024-07-18 15:30:25 +08:00
hiyouga
14bc7b0551 fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
2024-07-15 01:04:56 +08:00
hiyouga
f3c105f088 fix #4705
Former-commit-id: a15782cb9f3ee64ba1f5fc2a3da20ac6c6ef0aa0
2024-07-07 13:10:06 +08:00
hiyouga
b254df2d34 update ui
Former-commit-id: 7f770f6895f1e2e0b8e4f0b49088bfae096f6d3c
2024-07-03 23:13:49 +08:00
hoshi-hiyouga
2452f57cd7 Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
2024-07-01 21:01:09 +08:00
hiyouga
835f0578c2 refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
2024-06-28 01:04:24 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
d519c2fde5 tiny fix
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
2024-06-25 01:15:19 +08:00
hiyouga
029c343537 fix llamaboard abort
Former-commit-id: 3f84411b5d904dbbc5cddeda44884417fe68f689
2024-06-19 23:22:28 +08:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
ancv
045eb155a2 implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61c2778dc359613dca496d9e134c2743
2024-06-12 11:56:01 +07:00
hiyouga
a8318723a4 add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
2024-06-08 00:22:16 +08:00
hiyouga
d79222894c fix #4137
Former-commit-id: 8bf9da659cc789dbdd909930f79ac59539b51127
2024-06-07 19:16:06 +08:00
hiyouga
cceff9f520 lora modules: all by default
Former-commit-id: cae47379079ff811aa385c297481a27020a8da6b
2024-06-06 03:53:28 +08:00
hoshi-hiyouga
229794a148 Merge pull request #4066 from injet-zhou/main
add throughput entry to training log

Former-commit-id: f2580ad403cd0ae91aa0954c0a15363c46452438
2024-06-06 03:32:04 +08:00
hiyouga
00b3fb4d14 update train hparams
Former-commit-id: dc4a00dd63769dc02d898c8bad2c158e4e5c0447
2024-06-06 01:49:20 +08:00
faddddeout
a2931b813b add throughput entry to log
Former-commit-id: b2f04595423b8e84b3763d169e402a0cd34f3175
2024-06-04 11:04:29 +00:00