hoshi-hiyouga
|
c322512037
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
e335c548c1
|
[model] add mistral small models (#6786)
Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
46068b3324
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
760dea0787
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
85f185449e
|
generalized packing & fix #6343
Former-commit-id: 3b1e4194616cacd5c24f08b328e31a008bddcf29
|
2024-12-17 10:26:19 +00:00 |
|
hiyouga
|
cc22c045a7
|
fix inputs
Former-commit-id: 7d535bb8cdf7e81edda81152e63c8cfe6c9dcc9f
|
2024-11-23 18:26:02 +00:00 |
|
hiyouga
|
a117731ecb
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
25f00034d5
|
fix incorrect loss value for vlms
Former-commit-id: 0aa29a71ce958343a2086090d647eb63b8f5f5be
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
625a884707
|
update requires
Former-commit-id: cae0e688ddcead370821e126c192bddc53ff6017
|
2024-10-29 16:10:07 +08:00 |
|
hiyouga
|
aa22bf217f
|
tiny fix
Former-commit-id: d8ddd07c2ed14d871fb25743c20265fc99e3e221
|
2024-10-08 17:48:56 +08:00 |
|
hiyouga
|
2959f12c6e
|
tiny fix
Former-commit-id: c0e9c0484dae6db93cef5048bad827ff22b1986a
|
2024-09-05 23:41:16 +08:00 |
|
hiyouga
|
228f745235
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
019a932b2f
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
9cd850c3b9
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
ab24bde597
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
bd294e7cc3
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
d124ce001b
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
f849d03533
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
7c08a4a82a
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
ancv
|
4d345f7901
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
ancv
|
84e1f06e45
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
ancv
|
c5e1dfb3a0
|
remove some unused params
Former-commit-id: fef8132c50505a5fb6a246bd024491bd31798a3c
|
2024-06-15 23:00:55 +07:00 |
|
ancv
|
4463a5227a
|
implement efficient packing without cross-contamination attention
Former-commit-id: a64a5305c0da5ef092d4cc26faf829bb44de65d1
|
2024-06-12 11:56:01 +07:00 |
|