hoshi-hiyouga
|
ba032828e2
|
[deps] upgrade transformers (#8159)
|
2025-05-26 22:03:58 +08:00 |
|
hoshi-hiyouga
|
9ae17cd173
|
[deps] update to transformers 4.52 (#8125)
|
2025-05-21 05:16:18 +08:00 |
|
hoshi-hiyouga
|
86ebb219d6
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
hoshi-hiyouga
|
7c61b35106
|
[misc] upgrade cli (#7714)
|
2025-04-14 15:41:22 +08:00 |
|
hoshi-hiyouga
|
264538cb26
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
Ze-Yi LIN
|
11672f760d
|
[webui] display swanlab exp link (#7089)
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b
|
2025-02-27 19:40:54 +08:00 |
|
hoshi-hiyouga
|
c2022431aa
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
Former-commit-id: d9ae594178796994d400a5f207d6499712816f89
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
222423bcef
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
|
2025-01-31 01:36:33 +08:00 |
|
hoshi-hiyouga
|
e71737351f
|
[webui] improve webui & reasoning mode (#6778)
Former-commit-id: 3f17fc0d7163372e0446f1a38792ff761e99b739
|
2025-01-31 00:09:21 +08:00 |
|
hoshi-hiyouga
|
87d685b59f
|
[model] support yarn (#6693)
Former-commit-id: 8c412abc44a4c61b683465e36c6288580d980250
|
2025-01-18 13:56:09 +08:00 |
|
hoshi-hiyouga
|
31daa6570b
|
[webui] upgrade to gradio 5 (#6688)
Former-commit-id: 9df7721264ddef0008d7648e6ed173adef99bd74
|
2025-01-17 20:15:42 +08:00 |
|
hoshi-hiyouga
|
7638f1070e
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
c2120432db
|
[optim] add support to APOLLO (#6617)
Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
2a05941b14
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f
|
2025-01-13 21:34:20 +08:00 |
|
hiyouga
|
0ef1f981da
|
fix llamaboard with ray
Former-commit-id: bd8a432d6a980b1b24a551626304fe3d394b1baf
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
a897d46049
|
support report custom args
Former-commit-id: d41254c40a1c5cacf9377096adb27efa9bdb79ea
|
2024-12-21 21:42:45 +00:00 |
|
ZeYi Lin
|
e5d9d8c55d
|
feat: ui improve
Former-commit-id: 6a1effb1741a13ae5238b0e9b429b4cbe3b6534f
|
2024-12-20 11:03:02 +08:00 |
|
ZeYi Lin
|
44895ebe36
|
feat: optimize frontend
Former-commit-id: 4a78603c141d9bd78bcaf81261b443cf082bf51f
|
2024-12-19 19:04:19 +08:00 |
|
hiyouga
|
7eeeffdb8a
|
add swanlab
Former-commit-id: c85a77c8a8824a56a67d56b97b4877fcd6edeb3d
|
2024-12-19 07:12:31 +00:00 |
|
Yaser Afshar
|
6f1c8dacea
|
Add missing key to init_kwargs
Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca
|
2024-12-17 12:34:05 +00:00 |
|
hoshi-hiyouga
|
b852c895cf
|
do not split save_cmd ret value
Former-commit-id: 1e312072fb4a9f472e2d3fa7e6b4fb0aec00b566
|
2024-11-21 22:30:23 +08:00 |
|
superboy-zjc
|
aaa7ed8712
|
[patch] Patch remote OS command injection vulnerability
Former-commit-id: 4678ceea4ce334a8289caf87d86047e67c67c603
|
2024-11-21 01:52:12 -05:00 |
|
hiyouga
|
0027f46ccc
|
fix extra args
Former-commit-id: 2c98a1bc3d885170f8298872c2ea2e24427fb447
|
2024-11-09 00:24:27 +08:00 |
|
hiyouga
|
2bb3255e74
|
fix dpo metrics
Former-commit-id: 57029280da825a39fbf5a05097921b861f126669
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
093eda2ad6
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
aeeee9d4b5
|
support extra args in llamaboard
Former-commit-id: da0a5fd612e2214cc4bcb72516efd768fbe18a20
|
2024-10-30 08:55:54 +00:00 |
|
hiyouga
|
248d5daaff
|
use pre-commit
Former-commit-id: 7cfede95df22a9ff236788f04159b6b16b8d04bb
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
2f6fc27c8b
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
206a8364d4
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
211038584a
|
tiny fix
Former-commit-id: 28cac0e325bfd7a6c0c344ad2d46511613190cd7
|
2024-07-24 18:33:39 +08:00 |
|
hiyouga
|
4c1513a845
|
follow #4878 fix #4684
Former-commit-id: 4715e5c5b8040b21e5f401f7e969b9fd2757d520
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
c1e1918db1
|
仅仅训练最后一轮对话
Former-commit-id: ab6198e4c099edeb1a400f58729cd617e8cd8e50
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
cb10050cb9
|
fix #4705
Former-commit-id: cfd25c6463bcc263c8672d1de365dd81a028b66a
|
2024-07-07 13:10:06 +08:00 |
|
hiyouga
|
8ac4f87c91
|
update ui
Former-commit-id: b1522a3c0951e2e57f873dc6c758aaed33ca374e
|
2024-07-03 23:13:49 +08:00 |
|
hoshi-hiyouga
|
a715490c2a
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
8aaf1185a5
|
support HQQ/EETQ #4113
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
135bfbf7c1
|
tiny fix
Former-commit-id: bb57478366a70a0871af30ab31c890f471e27ff4
|
2024-06-25 01:15:19 +08:00 |
|
hiyouga
|
b2f5c0e0db
|
fix llamaboard abort
Former-commit-id: 9ef609a2c0185040e531dea3829a6f481539cdea
|
2024-06-19 23:22:28 +08:00 |
|
hiyouga
|
32f45c9e91
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
ancv
|
c7ab302c69
|
implement efficient packing without cross-contamination attention
Former-commit-id: a64a5305c0da5ef092d4cc26faf829bb44de65d1
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
3f6b3eed98
|
add resume args in webui
Former-commit-id: 1d86ad768b1f36e54b4c2a9f18f6ea5a7df04c90
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
f45e81e186
|
fix #4137
Former-commit-id: cdc0d6f5a2e5040e145c82c4801f37bd76529047
|
2024-06-07 19:16:06 +08:00 |
|
hiyouga
|
937f49ec3d
|
lora modules: all by default
Former-commit-id: 52c4ae87c7f4312704c31ef26b079b2c5b95ea5f
|
2024-06-06 03:53:28 +08:00 |
|
hoshi-hiyouga
|
8fdb32d0a3
|
Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
Former-commit-id: d2816f343f405f3fab09f2a8eade774b886e8f92
|
2024-06-06 03:32:04 +08:00 |
|
hiyouga
|
35379c7c0e
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
faddddeout
|
99ecb0daaf
|
add throughput entry to log
Former-commit-id: 691f999f64c7bac78761e4354f89816d2f0d46fc
|
2024-06-04 11:04:29 +00:00 |
|