161 Commits

Author SHA1 Message Date
Kingsley
fd79cf8551 tiny fix
Former-commit-id: 3d3cc6705d4575f7f20bf4da2b7dab60b337006b
2024-09-28 22:50:53 +08:00
Kingsley
66e473d519 remove some unnecessary if conditions
Former-commit-id: de06e2678e2168586614242f65939c5772e78774
2024-09-28 02:14:06 +08:00
BUAADreamer
5aa1e847d9 add llava-next/llava-next-video/video-llava
Former-commit-id: 6642cd501d55a1657678428ef2aa0c9b99b7e83f
2024-09-28 00:57:03 +08:00
Zhangchi Feng
c576b7ca32 Merge branch 'hiyouga:main' into main
Former-commit-id: 900631755b28692bb150a8cf39354af4e2e986c9
2024-09-27 18:14:39 +08:00
Kingsley
35e44143fd Merge branches 'pixtral-patch' and 'pixtral-patch' of https://github.com/Kuangdd01/LLaMA-Factory-X into pixtral-patch
Former-commit-id: 5e64b0c37165a50296036a6e09e09193fb2ad644
2024-09-26 12:18:25 +08:00
Kingsley
c436d6ea0b add pixtral template
Former-commit-id: 86f5a9be548ef02ce334bba35a529c70e8b3ad7f
2024-09-26 12:11:58 +08:00
hoshi-hiyouga
a73988141b add modelscope models
Former-commit-id: 8e5d12c2c4b687dc0d2c5bc25a916ba9f6ce67c9
2024-09-26 11:22:48 +08:00
marko1616
b70da07977 Chore: Support llama3.2.
Former-commit-id: 885a0b77ab83bf001d7175e2ba440f7928fa4731
2024-09-25 16:08:44 -04:00
hoshi-hiyouga
56058e2e84 add qwen2.5 models
Former-commit-id: 92ef62f5025475606e533947b7d9c3cae9bfcdbf
2024-09-19 02:07:54 +08:00
BUAADreamer
f00f4ae9b6 support llava-next(video)
Former-commit-id: 31259e7e0caa9ff6449b4abcee0554e211167178
2024-09-10 12:31:53 +08:00
hiyouga
3aefdad4ec release v0.9.0 (real)
Former-commit-id: 90d6df622252c6fad985f68b97771c979357e2fc
2024-09-09 01:00:25 +08:00
hiyouga
561ae4d1af fix constants
Former-commit-id: 653fe70acbe44853fa0ad073a9b8391d75ef6c2a
2024-09-08 23:52:30 +08:00
hiyouga
fb9280a0a7 release v0.9.0
Former-commit-id: 54b5c4b8195d23bd9dcc1921af9910d5bdd181fd
2024-09-08 23:43:35 +08:00
hoshi-hiyouga
de277a8ab8 Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'

Former-commit-id: 12743562639ccc6eb0caf170e7123d9844e2b4a6
2024-09-05 21:35:42 +08:00
liudan
1797fe50a4 根据代码规范修改了代码
Former-commit-id: 3d3fbaaff98da327e10bdebb4aedbdf1ec9565e8
2024-09-05 20:17:55 +08:00
hiyouga
4fccc65579 support Yi-Coder models
Former-commit-id: 359ef8bb0ebb8ccf9651ac2b737c5a705dab6bad
2024-09-05 03:12:24 +08:00
hiyouga
9df7a26e6b video datasets
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
2024-09-05 02:04:17 +08:00
liudan
09cff03026 增加了对minicpm3.0的适配'
Former-commit-id: d7ba97be484bf781d6fe80252ea29eb505b261bb
2024-09-04 23:10:05 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hiyouga
a8add5c04b add qwen2 math models
Former-commit-id: dc770efb14bd6e18421511912fbb959a3cf9f78d
2024-08-09 20:20:35 +08:00
hiyouga
20013e130b fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
codingma
7125b6cf70 support gemma-2-2b
Former-commit-id: dc09d454f285b8584d9017349a9cee3b44eadb72
2024-08-01 13:45:48 +08:00
hiyouga
91e54d458f add mistral nemo model
Former-commit-id: 1550fe7331370ad39e8ed69c1b060ead902a77e4
2024-07-24 16:25:53 +08:00
hiyouga
e0875f82b3 add llama3.1
Former-commit-id: 26533c0604ef765170f93986bc06f3066c5e28ee
2024-07-24 16:20:11 +08:00
hiyouga
7fcffb860d add codegeex4, internlm2.5
Former-commit-id: 53b1002fb74123095e7466c75b941a31a7cfba4d
2024-07-06 16:16:47 +08:00
hiyouga
7b3c1f29ff fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
2024-07-04 01:52:43 +08:00
hoshi-hiyouga
a38ff842d0 Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention

Former-commit-id: 87d9b2d00513c163335d3f2e2bb3cb3299cecdaa
2024-07-04 01:18:54 +08:00
hiyouga
bfdaadcc40 update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
2024-07-04 01:10:55 +08:00
hiyouga
e671ed520b update arg name
Former-commit-id: 8a6a7b9c8a876da9c16e5ada7df461eb8cabee21
2024-07-03 23:23:24 +08:00
hiyouga
cc31014002 improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
2024-07-02 22:23:08 +08:00
hzhaoy
28e787116b add TeleChat-1B
Former-commit-id: 57b7c00430bcfc83afd11547ceead041e8edfd8d
2024-07-02 17:49:04 +08:00
hoshi-hiyouga
2452f57cd7 Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
2024-07-01 21:01:09 +08:00
hiyouga
d3b7c489f2 add Gemma2 models
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
2024-06-28 01:26:50 +08:00
hiyouga
7be502c5c5 update readme
Former-commit-id: e507e60638b2e8c66f24805b3b28f6b9f98f5924
2024-06-24 18:22:12 +08:00
ancv
5319447aa5 move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
2024-06-21 00:45:06 +07:00
hiyouga
e3bf22f61b add deepseek coder v2 #4346
Former-commit-id: a233fbc258d38c62d78b9d1eaf034720361795e6
2024-06-18 22:53:54 +08:00
ancv
988231026a update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99809c6ae2571b59bdce8d8ea3c700b9
2024-06-16 02:25:47 +07:00
hiyouga
f0d6e63f55 add minicpm #4227
Former-commit-id: 572d8bbfdd73c1a00b432f0d0411f46fad6aa1a6
2024-06-15 17:58:52 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
a8318723a4 add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
2024-06-08 00:22:16 +08:00
hiyouga
8a0263551d add qwen2 models
Former-commit-id: 8e95648850fdd5075724359ffdb22beb48b75952
2024-06-07 00:22:57 +08:00
hiyouga
cceff9f520 lora modules: all by default
Former-commit-id: cae47379079ff811aa385c297481a27020a8da6b
2024-06-06 03:53:28 +08:00
hiyouga
679810a3d2 add codestral 22B
Former-commit-id: c23cc63d3d3c4fd8edd6c3b3ca1a2a32ec328d7d
2024-06-06 03:42:50 +08:00
hiyouga
94c37490d1 support glm-4
Former-commit-id: f48f5e646e2da9e02333d027033141b0e75dfcf8
2024-06-05 15:16:38 +08:00
hiyouga
820404946e better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui


Former-commit-id: 80708717329b4552920dd4ce8cebc683e65d54c5
2024-05-29 23:55:38 +08:00
hiyouga
a71a6a05c3 update readme
Former-commit-id: 89ca832740731dfb121175aa5c16b13bd4944011
2024-05-29 18:39:11 +08:00
hzhaoy
ce1be3da4b add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: 0dd632fe9e5bbf08605d4b9c6887208b7a127317
2024-05-29 15:00:37 +08:00
hiyouga
0706dbf7e6 tiny fix
Former-commit-id: c1fdf81df6ade5da7be4eb66b715f0efd171d5aa
2024-05-27 20:54:26 +08:00
hoshi-hiyouga
ad3ca3f556 Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support

Former-commit-id: 87ea0a8bcd8d76a9e916cc8da6905bc805bb18aa
2024-05-27 20:52:37 +08:00
Jianbai Ye
d2c1df7f3d add openchat-3.6-8B support
Former-commit-id: cff815391fd15f30647e8694e08c47a514fd6eb2
2024-05-27 20:42:08 +08:00