hoshi-hiyouga
|
0a0cfeb782
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
hoshi-hiyouga
|
ac8c6fdd3a
|
[assets] update model readme (#7724)
|
2025-04-15 00:41:09 +08:00 |
|
hoshi-hiyouga
|
3ef36d0057
|
[misc] upgrade cli (#7714)
|
2025-04-14 15:41:22 +08:00 |
|
hoshi-hiyouga
|
1fd4d14fbb
|
[deps] upgrade transformers (#7704)
|
2025-04-13 18:11:34 +08:00 |
|
hoshi-hiyouga
|
34fdabe005
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
hoshi-hiyouga
|
39876b85fc
|
[assets] update readme (#7644)
|
2025-04-09 01:06:06 +08:00 |
|
hoshi-hiyouga
|
6c200fd218
|
[model] add llama4 (#7611)
|
2025-04-06 13:42:31 +08:00 |
|
hoshi-hiyouga
|
468eea6f6d
|
[deps] pin pydantic to 2.10.6 (#7546)
|
2025-03-31 14:42:28 +08:00 |
|
hoshi-hiyouga
|
59e12bffe8
|
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
|
2025-03-25 12:15:58 +08:00 |
|
GuoCoder
|
b6d8749bf3
|
[model] fix lora on quant models (#7456)
Co-authored-by: root <root@ai>
|
2025-03-25 11:59:46 +08:00 |
|
hoshi-hiyouga
|
c841e92116
|
[misc] fix ci (#7441)
* fix ci
* improve ci
|
2025-03-23 21:09:35 +08:00 |
|
hoshi-hiyouga
|
b1b78daf06
|
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers
* fix hf cache
* fix dpo trainer
|
2025-03-23 17:44:27 +08:00 |
|
hoshi-hiyouga
|
142fd7e755
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hiyouga
|
f5810a6e47
|
release v0.9.2
Former-commit-id: aaad96359398c50bfe4a864859039a99b9f3a3a7
|
2025-03-11 14:49:13 +08:00 |
|
hoshi-hiyouga
|
f4aa0a146c
|
[misc] fix project toml (#7067)
Former-commit-id: 96fd510e6a03eae7a1f41772e1d6b784df6d5d2e
|
2025-02-25 23:22:48 +08:00 |
|
hoshi-hiyouga
|
3fbd4848e8
|
[version] support transformers 449 (#6982)
* support transformers 449
* fix mm plugin
Former-commit-id: b00b290c07beb560a5af857ce64f4ce424831a2c
|
2025-02-18 17:05:40 +08:00 |
|
hoshi-hiyouga
|
ff6658ad27
|
[deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
|
2025-02-08 15:02:28 +08:00 |
|
Zhangchi Feng
|
01915eaf40
|
[model] support audio (#6701)
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
|
2025-02-05 04:59:09 +08:00 |
|
hoshi-hiyouga
|
445d643ef3
|
[model] add mistral small models (#6786)
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
hoshi-hiyouga
|
bbf334f823
|
disable valset by default (#6690)
Former-commit-id: 77bbf659053e1b205974eb6df69998fee0305d26
|
2025-01-17 21:09:30 +08:00 |
|
hoshi-hiyouga
|
770433fa33
|
[webui] upgrade to gradio 5 (#6688)
Former-commit-id: 4d0f662dbe227ab0da11a1e109f7a2c5ab8f70b9
|
2025-01-17 20:15:42 +08:00 |
|
hiyouga
|
61320965aa
|
pin tokenizers version
Former-commit-id: b7d4cf2caf2d02f7d16bab3f0ec8bf0108b7be75
|
2024-11-27 05:24:58 +00:00 |
|
hoshi-hiyouga
|
618a8e6c9f
|
fix #6061
Former-commit-id: 4ac5b97011225b1fd5fa741c1335948d721489ac
|
2024-11-18 20:56:44 +08:00 |
|
hiyouga
|
3730fc046f
|
update datasets version
Former-commit-id: c5fae465ec8cbc30f9e91e6c32b88e74c805874a
|
2024-11-04 07:52:26 +00:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
163cf2ba5c
|
update requires
Former-commit-id: 77666bd2278a3cfe5b567f4fe285b0f93871d166
|
2024-10-29 16:10:07 +08:00 |
|
hiyouga
|
92de726102
|
fix #5668
Former-commit-id: 40ceba500bab7452b8671a9fbcd14bbf4a8f6f37
|
2024-10-12 01:24:43 +08:00 |
|
hiyouga
|
4464a6ff5b
|
tiny fix
Former-commit-id: 451d271718a8026056d0f7d7b8ab333391d24ad4
|
2024-10-08 17:48:56 +08:00 |
|
hoshi-hiyouga
|
15dbd4893e
|
Update requirements.txt
Former-commit-id: 905b7c03ae074bd958afdab6d79e45b30cec5271
|
2024-09-29 21:51:23 +08:00 |
|
Zhangchi Feng
|
4b6606832c
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 4643089a7dc6a88c391663131333f35b5da5015b
|
2024-09-10 13:20:24 +08:00 |
|
BUAADreamer
|
f00f4ae9b6
|
support llava-next(video)
Former-commit-id: 31259e7e0caa9ff6449b4abcee0554e211167178
|
2024-09-10 12:31:53 +08:00 |
|
hiyouga
|
38505ae9e1
|
update accelerate ver for schedule_free optimizers
Former-commit-id: bdde35fd2e4a919c1d63ebfc9a0ea8ba0c97e14c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
21d3976eea
|
fix #5295
Former-commit-id: ad72f3e06593f124d661d61774def336511716e0
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
|
2024-08-05 23:48:19 +08:00 |
|
MengqingCao
|
cd563116ca
|
update dependencies
Former-commit-id: 7d4a29303350711558566d10d02230ed85ee1b69
|
2024-06-20 02:09:47 +00:00 |
|
hoshi-hiyouga
|
09c34e5b6c
|
Update requirements.txt
Former-commit-id: e8c518c08a1235f83f66f83d6f8a6fcad8c598df
|
2024-06-18 22:27:24 +08:00 |
|
胡翀
|
8ab2d707e5
|
Update requirements.txt
add pandas version requirements
Former-commit-id: 12869c3ede9bf11bc0fbdfa7af559808551563be
|
2024-06-17 16:45:57 +08:00 |
|
hiyouga
|
d3196318be
|
fix #4120
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
a16786d8ba
|
fix #4090
Former-commit-id: 67fe822324a9f830175e44f89acdd9d759b38852
|
2024-06-06 00:50:32 +08:00 |
|
hiyouga
|
ecd06d0110
|
fix #4079
Former-commit-id: 83a005e3d404f5a8ccb7b8ac17c50db75df4e8d4
|
2024-06-05 16:56:54 +08:00 |
|
hiyouga
|
af7748139a
|
bump versions
transformers 4.37.2->4.41.2
datasets 2.14.3->2.16.0
accelerate 0.27.2->0.30.1
peft 0.10.0->0.11.1
trl 0.8.1->0.8.6
Former-commit-id: 876bc92865605be872bc811a56a1d1e05490ec8a
|
2024-06-03 18:29:38 +08:00 |
|
hiyouga
|
9dc209b458
|
resolve python 3.8 package
Former-commit-id: 75aec4cf8e9089808a1731e2848b29191f26d51d
|
2024-05-09 16:52:27 +08:00 |
|
hiyouga
|
1d3fb90590
|
add deepseek moe 236B
Former-commit-id: 10ab83f4c4dc96013e916462f056d1497c6ddf6c
|
2024-05-08 16:37:54 +08:00 |
|
hiyouga
|
289d1f3679
|
update webui and add CLIs
Former-commit-id: 245fe47ece22a4b7822449b126715aaa8ec25aba
|
2024-05-03 02:58:23 +08:00 |
|
hiyouga
|
80c8586534
|
reenable sdpa and fast tok by default
Former-commit-id: 07737a3d2d026c973ab964f948953d6ce0e1f2a9
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
8beb7a9239
|
update readme and gradio version
Former-commit-id: 5d62a51c12061c59b509db8fe367817f4e48f737
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
f334b89616
|
back to gradio 4.21 and fix chat
Former-commit-id: 4b920f24d35c73814b83d56373dd5c913bb57e49
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
54a4a8217a
|
fix bug in latest gradio
Former-commit-id: 5ddcecda50ccff93d51bebc9ac72c2a0dd483e9b
|
2024-04-04 00:55:31 +08:00 |
|