hiyouga
|
20ee1d2e19
|
fix #5542
Former-commit-id: cf28e7418c2eb07e86923a53ef832ef218e45af1
|
2024-09-30 23:28:55 +08:00 |
|
BUAADreamer
|
a769d0e3d4
|
fix constants
Former-commit-id: 69309a23598995aa1937fd8d80732a018c18db87
|
2024-09-29 22:00:01 +08:00 |
|
BUAADreamer
|
3cc5408da7
|
fix style
Former-commit-id: dc1bdcb69e6f2c605a2c533dab15613affc902f4
|
2024-09-29 21:39:37 +08:00 |
|
Zhangchi Feng
|
689f5c4554
|
Merge branch 'main' into main
Former-commit-id: 7566589b820e6030269523e9d08c312594f893ae
|
2024-09-29 21:32:54 +08:00 |
|
BUAADreamer
|
ab5d042cd3
|
add more llava-next series template
Former-commit-id: 93f64f2aebf41582d39aa8a2c6059e562ca694b0
|
2024-09-29 21:29:29 +08:00 |
|
BUAADreamer
|
85a919b6f7
|
fix readme
Former-commit-id: 867e7e70dbff207dbd78668af09a638654937f71
|
2024-09-29 20:45:02 +08:00 |
|
BUAADreamer
|
722e01c8ab
|
fix some
Former-commit-id: aeca8c0f978cb9754e0526b40cd431aaf867044f
|
2024-09-29 17:55:40 +08:00 |
|
hoshi-hiyouga
|
1ded3abdf1
|
Update attention.py
Former-commit-id: 2adf79c195053bb4541e0317573a2c89da28b5bc
|
2024-09-29 10:47:41 +08:00 |
|
Amirreza A
|
ca736bcab7
|
made a small change to a warning about fa2 for gemma2 models.
Former-commit-id: e0695a026d822c896cb4f5b33e0c4f88441d75e9
|
2024-09-28 19:03:36 +03:30 |
|
BUAADreamer
|
6de82ca843
|
fix some
Former-commit-id: 12e509da85af76ccf1e9a879a78e450a7b70cc4b
|
2024-09-28 01:15:33 +08:00 |
|
BUAADreamer
|
b6fb00e046
|
add llava-next/llava-next-video/video-llava
Former-commit-id: a4e4239931b0b0e3fd12c9f9bbfd2c201cbc78ca
|
2024-09-28 00:57:03 +08:00 |
|
Zhangchi Feng
|
86c84972c8
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 2695dcdf468f9e39e3aeec7892eb3dad399736ee
|
2024-09-27 18:14:39 +08:00 |
|
Billy Cao
|
2b4da8baf6
|
Add qwen_vl to liger kernel supported list
Former-commit-id: 053b2d832450cb6cd6af673b9fc51404f1fb1e41
|
2024-09-14 19:28:20 +08:00 |
|
BUAADreamer
|
514f976cc1
|
try to past test
Former-commit-id: 3b6bfae0e5fe795a70d530b2765f27d95c5862f8
|
2024-09-10 13:12:51 +08:00 |
|
BUAADreamer
|
484128b641
|
support llava-next(video)
Former-commit-id: 27e94593ac467e56e3a7f5c64f4ff6cee81f4b47
|
2024-09-10 12:31:53 +08:00 |
|
hiyouga
|
3cbc9109ea
|
tiny fix
Former-commit-id: 76177039c8f9ef5a63724a339dae6195d89fa215
|
2024-09-08 23:18:08 +08:00 |
|
hiyouga
|
b6810b209a
|
fix test case
Former-commit-id: b075b2971c6acb2c6039b36420a296f1f4e1b91b
|
2024-09-08 01:50:51 +08:00 |
|
hiyouga
|
158e0e1f63
|
add test case
Former-commit-id: c452d65e1551074dddd1d87517c0d44dc014c6aa
|
2024-09-08 01:40:49 +08:00 |
|
hiyouga
|
294a103ead
|
support activation offloading via unsloth gc
Former-commit-id: d3d0dd0feba3ca6f0ae970d5856bec989d26ef67
|
2024-09-08 01:22:19 +08:00 |
|
hiyouga
|
27547355e6
|
tiny fix
Former-commit-id: c0e9c0484dae6db93cef5048bad827ff22b1986a
|
2024-09-05 23:41:16 +08:00 |
|
hiyouga
|
b5e9df5df8
|
fix #5324
Former-commit-id: f7aa06c9c0b18c28419ea5792410915d3f322cbf
|
2024-09-02 23:56:21 +08:00 |
|
hiyouga
|
7e4c5d4bb3
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
2f6fc27c8b
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hoshi-hiyouga
|
77c2c7076b
|
Merge pull request #5290 from simonJJJ/qwen2_vl
support qwen2-vl
Former-commit-id: 7156f832af8505b26371559d340c0e69eb962bbc
|
2024-08-30 02:10:36 +08:00 |
|
hiyouga
|
c1369a1ec9
|
update liger kernel
Former-commit-id: d6bf6ca2161c99dd5d644e31d2b1df451017b68c
|
2024-08-29 20:46:08 +08:00 |
|
simonJJJ
|
0f3d54d8a0
|
initial-commit
Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928
|
2024-08-28 16:51:35 +08:00 |
|
hiyouga
|
206a8364d4
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
71e4404c0d
|
tiny fix
Former-commit-id: 220d7c1ce15e8013a900e59fe0c7937e38b5c3b5
|
2024-07-14 10:56:45 +08:00 |
|
hiyouga
|
5ab997d484
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
8567dab167
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a90c6306f8
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
60558388ec
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
b29a7f8cd6
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
a1501591e8
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
1408aa078d
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
a42671c2d7
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|
hoshi-hiyouga
|
a715490c2a
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
4357e42391
|
tiny fix
Former-commit-id: 19e43c3a9ed771e991cb273d394ab28fb923f868
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
3c4f8eaa55
|
loose gemma2 attention
Former-commit-id: a0b645017a2de3d58b6cbc71bd91ec96fc7a818b
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
fda2cf677b
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
Former-commit-id: da66c32c7be0adc28d2185b23e9f62d56acb961c
|
2024-06-28 06:00:26 +08:00 |
|
hiyouga
|
d1aad72826
|
add quant checks
Former-commit-id: 15bb053e3549739b1a2134640a659b0f35df7de7
|
2024-06-27 01:12:25 +08:00 |
|
hiyouga
|
8aaf1185a5
|
support HQQ/EETQ #4113
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
08fa707085
|
improve autogptq integration
Former-commit-id: d68408c7b123b8ff92014db35cac0b24b414a6f4
|
2024-06-26 22:11:44 +08:00 |
|
stceum
|
16e950454e
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 171289d8e4c111fdca2b100282b64c74a04a4726
|
2024-06-24 20:39:31 +08:00 |
|
ancv
|
6c185a2c57
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
5f5d4c1923
|
update patcher
Former-commit-id: afb365e515d615dd62f791622450debab60ce5cc
|
2024-06-19 21:27:00 +08:00 |
|