hiyouga
|
a1ab66416a
|
fix #5324
Former-commit-id: f7aa06c9c0b18c28419ea5792410915d3f322cbf
|
2024-09-02 23:56:21 +08:00 |
|
hiyouga
|
04db03bdfd
|
add rlhf-v dataset
Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
ad6c42ff0a
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
a73e7e022d
|
add test mm plugin
Former-commit-id: ddea5cca5a3174de1dcc7fdee8ec69e77700b6bf
|
2024-08-31 01:53:38 +08:00 |
|
hiyouga
|
ec2da8b06a
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
228f745235
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hoshi-hiyouga
|
618e357740
|
Merge pull request #5290 from simonJJJ/qwen2_vl
support qwen2-vl
Former-commit-id: 7156f832af8505b26371559d340c0e69eb962bbc
|
2024-08-30 02:10:36 +08:00 |
|
hiyouga
|
4096d52752
|
update liger kernel
Former-commit-id: d6bf6ca2161c99dd5d644e31d2b1df451017b68c
|
2024-08-29 20:46:08 +08:00 |
|
simonJJJ
|
5e728ec221
|
initial-commit
Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928
|
2024-08-28 16:51:35 +08:00 |
|
hiyouga
|
dd6c96b96d
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
019a932b2f
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
3d32ca59e5
|
tiny fix
Former-commit-id: bf6a2f032c598f969708c1c3db4875d6239c41a9
|
2024-07-22 21:10:15 +08:00 |
|
hoshi-hiyouga
|
490c0dd3c0
|
fix #4917
Former-commit-id: e26919aafd8436489d065789c9c25d72c8d05a6d
|
2024-07-22 11:28:31 +08:00 |
|
hiyouga
|
8ce43766c6
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
71275c49f8
|
tiny fix
Former-commit-id: 220d7c1ce15e8013a900e59fe0c7937e38b5c3b5
|
2024-07-14 10:56:45 +08:00 |
|
hiyouga
|
9cd850c3b9
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
ab24bde597
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
9dcdaee09c
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bd294e7cc3
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
d124ce001b
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
f849d03533
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
7c08a4a82a
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
e8a1dc2785
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|
ancv
|
260f55ea47
|
move efficient_packing from data_args to model_args
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
9174675ba9
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
711ffd0aaf
|
tiny fix
Former-commit-id: 19e43c3a9ed771e991cb273d394ab28fb923f868
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
35c65ddf8c
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
f7a4f3d9c0
|
loose gemma2 attention
Former-commit-id: a0b645017a2de3d58b6cbc71bd91ec96fc7a818b
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
6ce0b5891b
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
Former-commit-id: da66c32c7be0adc28d2185b23e9f62d56acb961c
|
2024-06-28 06:00:26 +08:00 |
|
hiyouga
|
2381fb68a4
|
add quant checks
Former-commit-id: 15bb053e3549739b1a2134640a659b0f35df7de7
|
2024-06-27 01:12:25 +08:00 |
|
hiyouga
|
28c2c7fba5
|
support HQQ/EETQ #4113
Former-commit-id: b7cb51ddb394f04fe4646b2c297fc8d918c9979e
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
4041aa024b
|
improve autogptq integration
Former-commit-id: d68408c7b123b8ff92014db35cac0b24b414a6f4
|
2024-06-26 22:11:44 +08:00 |
|
hiyouga
|
3d1d42030f
|
fix #4432
Former-commit-id: 972a3b469c600bc6528aef3a49b6fdec63d65803
|
2024-06-25 02:34:04 +08:00 |
|
hiyouga
|
a27d4bb4be
|
fix #4410
Former-commit-id: f49adc4ab5eade21d7a9e029212f17688ee9b0cf
|
2024-06-24 22:34:31 +08:00 |
|
stceum
|
0bf750ade8
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 171289d8e4c111fdca2b100282b64c74a04a4726
|
2024-06-24 20:39:31 +08:00 |
|
ancv
|
4d345f7901
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 9c5e972c9c81957f2e9e30bf284ef1c076de9fd0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
fecde5c13f
|
tiny fix
Former-commit-id: 2d8d47f6126d68db1701ed18fc31310c6f14dd49
|
2024-06-20 22:56:05 +08:00 |
|
hiyouga
|
0680f18633
|
update patcher
Former-commit-id: afb365e515d615dd62f791622450debab60ce5cc
|
2024-06-19 21:27:00 +08:00 |
|
hiyouga
|
650bb45954
|
fix #4357
Former-commit-id: a6741bba8cebd16a6a3f97a2dc81057d0e27eb39
|
2024-06-18 22:42:45 +08:00 |
|
hiyouga
|
bb8c7e7048
|
fix #4326
Former-commit-id: 3c2c45812a720d92f7f5b15b9f03370fe6bf069e
|
2024-06-17 18:17:48 +08:00 |
|
ancv
|
84e1f06e45
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
0b571f84b4
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
640372cb66
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
ancv
|
c5e1dfb3a0
|
remove some unused params
Former-commit-id: fef8132c50505a5fb6a246bd024491bd31798a3c
|
2024-06-15 23:00:55 +07:00 |
|
hiyouga
|
acfae2e677
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
bbeb3b10aa
|
add test cases
Former-commit-id: 731176ff34cdf0cbf6b41c40c69f4ceb54c2daf6
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
344d1192ac
|
clean code
Former-commit-id: f54cafd5c7f0383370d1a2f357834a61a97397ce
|
2024-06-13 01:58:16 +08:00 |
|
ancv
|
4463a5227a
|
implement efficient packing without cross-contamination attention
Former-commit-id: a64a5305c0da5ef092d4cc26faf829bb44de65d1
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
a7233181f2
|
fix deepspeed version
Former-commit-id: 938a69bb07d4de7d82928ff01c582032162c1480
|
2024-06-11 16:52:36 +08:00 |
|