hoshi-hiyouga
|
40fb24916f
|
[model] add llama4 (#7611)
|
2025-04-06 13:42:31 +08:00 |
|
hoshi-hiyouga
|
cb42e2c4de
|
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
|
2025-03-25 12:15:58 +08:00 |
|
hoshi-hiyouga
|
1a7c872c14
|
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers
* fix hf cache
* fix dpo trainer
|
2025-03-23 17:44:27 +08:00 |
|
hoshi-hiyouga
|
9e7e07b78f
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hoshi-hiyouga
|
efa86e730c
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
865b2b8b87
|
[version] support transformers 449 (#6982)
* support transformers 449
* fix mm plugin
Former-commit-id: e9118a9df0839d24f6ddff5a0b55ef101a1d3d22
|
2025-02-18 17:05:40 +08:00 |
|
hoshi-hiyouga
|
c322512037
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
40b6e9045d
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
Former-commit-id: d9ae594178796994d400a5f207d6499712816f89
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
e335c548c1
|
[model] add mistral small models (#6786)
Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
46068b3324
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
85f185449e
|
generalized packing & fix #6343
Former-commit-id: 3b1e4194616cacd5c24f08b328e31a008bddcf29
|
2024-12-17 10:26:19 +00:00 |
|
hiyouga
|
2e0092ed48
|
update datasets version
Former-commit-id: feba2c6418a15715fee77a34428fa3cf47fcee5b
|
2024-11-04 07:52:26 +00:00 |
|
hiyouga
|
25f00034d5
|
fix incorrect loss value for vlms
Former-commit-id: 0aa29a71ce958343a2086090d647eb63b8f5f5be
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
625a884707
|
update requires
Former-commit-id: cae0e688ddcead370821e126c192bddc53ff6017
|
2024-10-29 16:10:07 +08:00 |
|
huniu20
|
c3a040b4a5
|
1. add modelers hub support
Former-commit-id: 14678eb444d8181176745d18d4a6865fd6860f58
|
2024-10-09 17:21:37 +08:00 |
|
hiyouga
|
aa22bf217f
|
tiny fix
Former-commit-id: d8ddd07c2ed14d871fb25743c20265fc99e3e221
|
2024-10-08 17:48:56 +08:00 |
|
hiyouga
|
2c61942632
|
update accelerate ver for schedule_free optimizers
Former-commit-id: 2de74e79049ce8e50f605f649275b1dbfb899c8c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
fb90faf19a
|
add docstrings, refactor logger
Former-commit-id: c34e489d71f8f539028543ccf8ee92cecedd6276
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
2959f12c6e
|
tiny fix
Former-commit-id: c0e9c0484dae6db93cef5048bad827ff22b1986a
|
2024-09-05 23:41:16 +08:00 |
|
hiyouga
|
baf60f10c9
|
tiny fix
Former-commit-id: 8ccaae3871d8d1fe3ea4633d427aecb2ab3addec
|
2024-09-01 21:15:44 +08:00 |
|
hiyouga
|
ad6c42ff0a
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
f389b6a676
|
tiny fix
Former-commit-id: 830511a6d0216da99520aee8b3a753d347a71fa9
|
2024-08-30 03:21:50 +08:00 |
|
hiyouga
|
228f745235
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
6c373064c5
|
fix #5295
Former-commit-id: c76873b0eb8225f6e6bfc7223c6012387dceb8ed
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
019a932b2f
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
a66ff6052b
|
add unittest
Former-commit-id: 8a1f0c5f922989e08a19c65de0b2c4afd2a5771f
|
2024-07-19 01:06:27 +08:00 |
|
hiyouga
|
9cd850c3b9
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
acfae2e677
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
219a16130a
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 84cfb2452cc86b037ccddee6e833f8eb7c129fa4
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
ee759aa0d8
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|