hoshi-hiyouga
|
610f164c69
|
[trainer] fix pt loss (#7748)
* fix pt loss
* robust
* fix
* test
|
2025-04-17 03:15:35 +08:00 |
|
hoshi-hiyouga
|
0a0cfeb782
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
hoshi-hiyouga
|
ac8c6fdd3a
|
[assets] update model readme (#7724)
|
2025-04-15 00:41:09 +08:00 |
|
hoshi-hiyouga
|
1fd4d14fbb
|
[deps] upgrade transformers (#7704)
|
2025-04-13 18:11:34 +08:00 |
|
hoshi-hiyouga
|
34fdabe005
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
hoshi-hiyouga
|
39876b85fc
|
[assets] update readme (#7644)
|
2025-04-09 01:06:06 +08:00 |
|
hoshi-hiyouga
|
6c200fd218
|
[model] add llama4 (#7611)
|
2025-04-06 13:42:31 +08:00 |
|
hoshi-hiyouga
|
59e12bffe8
|
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
|
2025-03-25 12:15:58 +08:00 |
|
hoshi-hiyouga
|
b1b78daf06
|
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers
* fix hf cache
* fix dpo trainer
|
2025-03-23 17:44:27 +08:00 |
|
hoshi-hiyouga
|
142fd7e755
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hoshi-hiyouga
|
7c1640ed5f
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
3fbd4848e8
|
[version] support transformers 449 (#6982)
* support transformers 449
* fix mm plugin
Former-commit-id: b00b290c07beb560a5af857ce64f4ce424831a2c
|
2025-02-18 17:05:40 +08:00 |
|
hoshi-hiyouga
|
ff6658ad27
|
[deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
1fee69f874
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
445d643ef3
|
[model] add mistral small models (#6786)
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
f6779b0e0c
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
bff1b94583
|
generalized packing & fix #6343
Former-commit-id: 2d107d3aefd5af61163056634c8b91fe3cb3e77c
|
2024-12-17 10:26:19 +00:00 |
|
hiyouga
|
3730fc046f
|
update datasets version
Former-commit-id: c5fae465ec8cbc30f9e91e6c32b88e74c805874a
|
2024-11-04 07:52:26 +00:00 |
|
hiyouga
|
584ce3a105
|
fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
163cf2ba5c
|
update requires
Former-commit-id: 77666bd2278a3cfe5b567f4fe285b0f93871d166
|
2024-10-29 16:10:07 +08:00 |
|
huniu20
|
26e897e861
|
1. add modelers hub support
Former-commit-id: 24ebe187e360753666b768685a0dcc78054bb702
|
2024-10-09 17:21:37 +08:00 |
|
hiyouga
|
4464a6ff5b
|
tiny fix
Former-commit-id: 451d271718a8026056d0f7d7b8ab333391d24ad4
|
2024-10-08 17:48:56 +08:00 |
|
hiyouga
|
38505ae9e1
|
update accelerate ver for schedule_free optimizers
Former-commit-id: bdde35fd2e4a919c1d63ebfc9a0ea8ba0c97e14c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
995491594d
|
tiny fix
Former-commit-id: 76f2e5950483c669a15a961f0554442b6eb5c4a6
|
2024-09-05 23:41:16 +08:00 |
|
hiyouga
|
236f97b35c
|
tiny fix
Former-commit-id: 55027282cdaa59a470ac89bfb3860504ba9075ff
|
2024-09-01 21:15:44 +08:00 |
|
hiyouga
|
cb776752f6
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 9967ccb3aef3ca557ad6eafb78c6c99866857008
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
92c398166d
|
tiny fix
Former-commit-id: bee1bd43b946501690d70e4980205f9d82404296
|
2024-08-30 03:21:50 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
21d3976eea
|
fix #5295
Former-commit-id: ad72f3e06593f124d661d61774def336511716e0
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
e1e01d7efd
|
add unittest
Former-commit-id: 608de799a21f37319bf31c04c0aa50c4542ec757
|
2024-07-19 01:06:27 +08:00 |
|
hiyouga
|
0b26011181
|
fix gemma2 attention
Former-commit-id: 2f6af73da28c4f8321b625fd09ddec8bd4977b08
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
820404946e
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 80708717329b4552920dd4ce8cebc683e65d54c5
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
|
2024-05-16 18:39:08 +08:00 |
|