77 Commits

Author SHA1 Message Date
hiyouga
e83cb17f97 support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
2024-11-02 18:31:04 +08:00
hiyouga
3f7c874594 update tests
Former-commit-id: 93d3b8f43faf4a81b809d2f7d897e39bdb5475c3
2024-11-02 12:41:44 +08:00
hiyouga
584ce3a105 fix incorrect loss value for vlms
Former-commit-id: 30567a1487727473950104718e626ff660f10cbb
2024-10-30 08:56:46 +00:00
hoshi-hiyouga
2179b91acb Update visual.py
Former-commit-id: 0baa7735f64cbef9bd90e1db485c120b4c1c88bd
2024-10-29 22:10:29 +08:00
Kingsley
3053a806e9 Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 67f59579d79e97689a4b3cba7101a423c30dab2b
2024-10-29 21:01:25 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
hiyouga
163cf2ba5c update requires
Former-commit-id: 77666bd2278a3cfe5b567f4fe285b0f93871d166
2024-10-29 16:10:07 +08:00
Kingsley
5523a6fd2c Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 93a441a6b746e9a933dad8c45553fb5b68bf2b34
2024-10-08 21:04:08 +08:00
hiyouga
4464a6ff5b tiny fix
Former-commit-id: 451d271718a8026056d0f7d7b8ab333391d24ad4
2024-10-08 17:48:56 +08:00
Kingsley
f3ac97a749 Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: e53f47c0b3de491d4d9b31c995f9cea100f98896
2024-10-01 00:52:31 +08:00
hiyouga
4df090ff48 fix #5542
Former-commit-id: fe7ffccdb9a45b31e20ab7e88282a75b45504a97
2024-09-30 23:28:55 +08:00
Kingsley
6729ed2c7e sync with former
Former-commit-id: 9ddb84052e3cc72e21a92b8103caa179a35859c4
2024-09-30 20:27:05 +08:00
Kingsley
94ce8f561f fix some errors due to inconsistency of model cards
Former-commit-id: 2166b9bc6ba35760ff85b63620af9fa0213a4c78
2024-09-30 19:58:34 +08:00
Zhangchi Feng
69e801d456 Merge branch 'main' into pixtral-patch
Former-commit-id: 26f45829b453ff1a0c76f6c1ddaba893d48f821e
2024-09-30 12:37:03 +08:00
BUAADreamer
87ab7fc01c fix constants
Former-commit-id: 485fc047169afd027ee65d05e3c5c08b371b6c4d
2024-09-29 22:00:01 +08:00
BUAADreamer
ddec40ac16 fix style
Former-commit-id: 23916d57c1d22653739dbf913d3e427fcb978a15
2024-09-29 21:39:37 +08:00
Zhangchi Feng
8e164f3594 Merge branch 'main' into main
Former-commit-id: 83abf86657ea38968e953e1dc4a2e8c34471b06a
2024-09-29 21:32:54 +08:00
BUAADreamer
1b71afb277 add more llava-next series template
Former-commit-id: 65a8923f5a7d20d34fabf4f81746fe9b7bc8c84a
2024-09-29 21:29:29 +08:00
BUAADreamer
534dc58363 fix readme
Former-commit-id: bf0bcbc5ec4ca0182ade283ea9f37012f224f519
2024-09-29 20:45:02 +08:00
BUAADreamer
1e2ea34419 fix some
Former-commit-id: d5c69400cd27cdf0667290f3863a3aab47143eb3
2024-09-29 17:55:40 +08:00
hoshi-hiyouga
5df765e376 Update attention.py
Former-commit-id: fe7057a8a3eb111cdaf8349b6ac077d898bf4935
2024-09-29 10:47:41 +08:00
Kingsley
a2452d0b1c Tiny fix
Former-commit-id: 8f13a3627d06a6f0a9b4e35443a415958d9ad1c9
2024-09-29 00:00:23 +08:00
Amirreza A
6ae0e27c8b made a small change to a warning about fa2 for gemma2 models.
Former-commit-id: 94ee105526d817e59bfd91f7bd4161d7cb2fd216
2024-09-28 19:03:36 +03:30
Kingsley
fd79cf8551 tiny fix
Former-commit-id: 3d3cc6705d4575f7f20bf4da2b7dab60b337006b
2024-09-28 22:50:53 +08:00
BUAADreamer
0e33902f61 fix some
Former-commit-id: 7f3f81009e3728fe25b9c063491ee71acc498c35
2024-09-28 01:15:33 +08:00
BUAADreamer
5aa1e847d9 add llava-next/llava-next-video/video-llava
Former-commit-id: 6642cd501d55a1657678428ef2aa0c9b99b7e83f
2024-09-28 00:57:03 +08:00
Zhangchi Feng
c576b7ca32 Merge branch 'hiyouga:main' into main
Former-commit-id: 900631755b28692bb150a8cf39354af4e2e986c9
2024-09-27 18:14:39 +08:00
Billy Cao
38e955d4a9 Add qwen_vl to liger kernel supported list
Former-commit-id: 7a2958a44f3b99cddb91f7b67fa0dd1c26c1a991
2024-09-14 19:28:20 +08:00
BUAADreamer
16c7326bc5 try to past test
Former-commit-id: 7b4ba0efb658422fd29dca63bac1e9cee8e82af8
2024-09-10 13:12:51 +08:00
BUAADreamer
f00f4ae9b6 support llava-next(video)
Former-commit-id: 31259e7e0caa9ff6449b4abcee0554e211167178
2024-09-10 12:31:53 +08:00
hiyouga
0229263fbe tiny fix
Former-commit-id: c9b3870adb60a2aca8cfd82c1a8b8044319bacbc
2024-09-08 23:18:08 +08:00
hiyouga
f6f58ebef0 fix test case
Former-commit-id: b332908ab4aad392e39f0b8661d100f096d8a6ec
2024-09-08 01:50:51 +08:00
hiyouga
945841503e add test case
Former-commit-id: 52a06efaf8af26d16137ba9095f1fd81e8f61983
2024-09-08 01:40:49 +08:00
hiyouga
0daee7cb39 support activation offloading via unsloth gc
Former-commit-id: fb72a3adb0916232cc9ac9f0c725c02d07b9354c
2024-09-08 01:22:19 +08:00
hiyouga
995491594d tiny fix
Former-commit-id: 76f2e5950483c669a15a961f0554442b6eb5c4a6
2024-09-05 23:41:16 +08:00
hiyouga
6e98872622 fix #5324
Former-commit-id: a61c8c4890962f3847b19eff31b170cd7f54316c
2024-09-02 23:56:21 +08:00
hiyouga
cb776752f6 fix mixed mm inputs and rlhf-v
Former-commit-id: 9967ccb3aef3ca557ad6eafb78c6c99866857008
2024-09-01 20:52:47 +08:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hoshi-hiyouga
98b0c7530c Merge pull request #5290 from simonJJJ/qwen2_vl
support qwen2-vl

Former-commit-id: 727e1848401d306274fb60ba78f66fed577b7b55
2024-08-30 02:10:36 +08:00
hiyouga
0e4ee9d9a3 update liger kernel
Former-commit-id: a7dd7d325e68c92c7470c1e9ef83a7c8abcbc616
2024-08-29 20:46:08 +08:00
simonJJJ
8a09b1e732 initial-commit
Former-commit-id: aeb85f200bd824748008dae6047c2607dfcdf174
2024-08-28 16:51:35 +08:00
hiyouga
c765292093 support liger kernel
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
2024-08-27 11:20:14 +08:00
hiyouga
20013e130b fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
hiyouga
14bc7b0551 fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
2024-07-15 01:04:56 +08:00
hiyouga
12e0e5d0d7 tiny fix
Former-commit-id: d3c01552e0f978f150902175f096f6e3bfb64363
2024-07-14 10:56:45 +08:00
hiyouga
0b26011181 fix gemma2 attention
Former-commit-id: 2f6af73da28c4f8321b625fd09ddec8bd4977b08
2024-07-13 23:33:45 +08:00
hiyouga
d7657d772d tiny fix
Former-commit-id: 0c699de39de06eac96af67e8dd4fc4c53335b17e
2024-07-04 03:47:05 +08:00
hiyouga
7b3c1f29ff fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
2024-07-04 01:52:43 +08:00
hoshi-hiyouga
a38ff842d0 Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention

Former-commit-id: 87d9b2d00513c163335d3f2e2bb3cb3299cecdaa
2024-07-04 01:18:54 +08:00