36 Commits

Author SHA1 Message Date
hoshi-hiyouga
610f164c69
[trainer] fix pt loss (#7748)
* fix pt loss

* robust

* fix

* test
2025-04-17 03:15:35 +08:00
hoshi-hiyouga
0a0cfeb782
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci

* fix

* fix

* fix

* fix

* fix
2025-04-17 02:36:48 +08:00
hoshi-hiyouga
ac8c6fdd3a
[assets] update model readme (#7724) 2025-04-15 00:41:09 +08:00
hoshi-hiyouga
1fd4d14fbb
[deps] upgrade transformers (#7704) 2025-04-13 18:11:34 +08:00
hoshi-hiyouga
34fdabe005
[data] add coig-p dataset (#7657) 2025-04-09 21:18:25 +08:00
hoshi-hiyouga
39876b85fc
[assets] update readme (#7644) 2025-04-09 01:06:06 +08:00
hoshi-hiyouga
6c200fd218
[model] add llama4 (#7611) 2025-04-06 13:42:31 +08:00
hoshi-hiyouga
59e12bffe8
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b

* fix ci

* upgrade peft to 0.15

* fix ci

* fix ci
2025-03-25 12:15:58 +08:00
hoshi-hiyouga
b1b78daf06
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers

* fix hf cache

* fix dpo trainer
2025-03-23 17:44:27 +08:00
hoshi-hiyouga
142fd7e755
[misc] upgrade deps (#7257) 2025-03-12 00:33:47 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hoshi-hiyouga
3fbd4848e8 [version] support transformers 449 (#6982)
* support transformers 449

* fix mm plugin

Former-commit-id: b00b290c07beb560a5af857ce64f4ce424831a2c
2025-02-18 17:05:40 +08:00
hoshi-hiyouga
ff6658ad27 [deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
1fee69f874 [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year

Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
445d643ef3 [model] add mistral small models (#6786)
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
f6779b0e0c [breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
2025-01-31 01:36:33 +08:00
hiyouga
bff1b94583 generalized packing & fix #6343
Former-commit-id: 2d107d3aefd5af61163056634c8b91fe3cb3e77c
2024-12-17 10:26:19 +00:00
hiyouga
3730fc046f update datasets version
Former-commit-id: c5fae465ec8cbc30f9e91e6c32b88e74c805874a
2024-11-04 07:52:26 +00:00
hiyouga
584ce3a105 fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
2024-10-30 08:56:46 +00:00
hiyouga
163cf2ba5c update requires
Former-commit-id: 77666bd2278a3cfe5b567f4fe285b0f93871d166
2024-10-29 16:10:07 +08:00
huniu20
26e897e861 1. add modelers hub support
Former-commit-id: 24ebe187e360753666b768685a0dcc78054bb702
2024-10-09 17:21:37 +08:00
hiyouga
4464a6ff5b tiny fix
Former-commit-id: 451d271718a8026056d0f7d7b8ab333391d24ad4
2024-10-08 17:48:56 +08:00
hiyouga
38505ae9e1 update accelerate ver for schedule_free optimizers
Former-commit-id: bdde35fd2e4a919c1d63ebfc9a0ea8ba0c97e14c
2024-09-09 22:51:08 +08:00
hiyouga
7ccb86b215 add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
2024-09-08 00:56:56 +08:00
hiyouga
995491594d tiny fix
Former-commit-id: 76f2e5950483c669a15a961f0554442b6eb5c4a6
2024-09-05 23:41:16 +08:00
hiyouga
236f97b35c tiny fix
Former-commit-id: 55027282cdaa59a470ac89bfb3860504ba9075ff
2024-09-01 21:15:44 +08:00
hiyouga
cb776752f6 fix mixed mm inputs and rlhf-v
Former-commit-id: 9967ccb3aef3ca557ad6eafb78c6c99866857008
2024-09-01 20:52:47 +08:00
hiyouga
92c398166d tiny fix
Former-commit-id: bee1bd43b946501690d70e4980205f9d82404296
2024-08-30 03:21:50 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hiyouga
21d3976eea fix #5295
Former-commit-id: ad72f3e06593f124d661d61774def336511716e0
2024-08-29 20:30:18 +08:00
hiyouga
20013e130b fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
hiyouga
e1e01d7efd add unittest
Former-commit-id: 608de799a21f37319bf31c04c0aa50c4542ec757
2024-07-19 01:06:27 +08:00
hiyouga
0b26011181 fix gemma2 attention
Former-commit-id: 2f6af73da28c4f8321b625fd09ddec8bd4977b08
2024-07-13 23:33:45 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
820404946e better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui


Former-commit-id: 80708717329b4552920dd4ce8cebc683e65d54c5
2024-05-29 23:55:38 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00