60 Commits

Author SHA1 Message Date
hoshi-hiyouga
a38ff842d0 Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention

Former-commit-id: 87d9b2d00513c163335d3f2e2bb3cb3299cecdaa
2024-07-04 01:18:54 +08:00
hiyouga
bfdaadcc40 update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
2024-07-04 01:10:55 +08:00
hiyouga
e671ed520b update arg name
Former-commit-id: 8a6a7b9c8a876da9c16e5ada7df461eb8cabee21
2024-07-03 23:23:24 +08:00
hiyouga
cc31014002 improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
2024-07-02 22:23:08 +08:00
hzhaoy
28e787116b add TeleChat-1B
Former-commit-id: 57b7c00430bcfc83afd11547ceead041e8edfd8d
2024-07-02 17:49:04 +08:00
hoshi-hiyouga
2452f57cd7 Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
2024-07-01 21:01:09 +08:00
hiyouga
bbc37b2880 fix #4398 #4592
Former-commit-id: d74244d56858d837044e5c9cea57a1b3c2ca0214
2024-06-30 21:28:51 +08:00
hiyouga
d3b7c489f2 add Gemma2 models
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
2024-06-28 01:26:50 +08:00
hiyouga
835f0578c2 refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
2024-06-28 01:04:24 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
7be502c5c5 update readme
Former-commit-id: e507e60638b2e8c66f24805b3b28f6b9f98f5924
2024-06-24 18:22:12 +08:00
ancv
5319447aa5 move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
2024-06-21 00:45:06 +07:00
hiyouga
80e9f8e000 set dev version
Former-commit-id: 42e69a3c634ccae792bd8ffb4642061ee475e836
2024-06-19 21:08:16 +08:00
hiyouga
9c1b04cd11 release v0.8.2
Former-commit-id: 71327ba85a3a1bb2d2d20c86951c6c7c0ba98829
2024-06-19 20:42:09 +08:00
hiyouga
e3bf22f61b add deepseek coder v2 #4346
Former-commit-id: a233fbc258d38c62d78b9d1eaf034720361795e6
2024-06-18 22:53:54 +08:00
ancv
988231026a update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99809c6ae2571b59bdce8d8ea3c700b9
2024-06-16 02:25:47 +07:00
hiyouga
c0c6b8075a tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
2024-06-16 01:06:41 +08:00
hiyouga
8053929b20 add tests
Former-commit-id: 1b834f50be64ae9b5123da0e6f528cfbd5167477
2024-06-15 19:51:20 +08:00
hiyouga
f0d6e63f55 add minicpm #4227
Former-commit-id: 572d8bbfdd73c1a00b432f0d0411f46fad6aa1a6
2024-06-15 17:58:52 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
833aa324c2 clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
2024-06-13 01:58:16 +08:00
hiyouga
d6632fefc9 set dev version
Former-commit-id: 91e62a098fd997d0d1d12baef64d089aabc01fba
2024-06-11 00:50:53 +08:00
hiyouga
75e1bbf128 release v0.8.1
Former-commit-id: 2b6ebd6b51133cf114d6f0e8605ad2bb26aa6d65
2024-06-11 00:44:26 +08:00
hiyouga
1a261add61 fix llamafactory-cli env
Former-commit-id: 972ec9c668de1a9b6d872187dbc0c1d94f6fec6b
2024-06-08 07:15:45 +08:00
hiyouga
de3400a521 set dev version
Former-commit-id: 3ac11e77cccf686e0da499bd152997133b49a265
2024-06-08 06:46:09 +08:00
hiyouga
ce40d12692 release v0.8.0
Former-commit-id: 5aa4ce47567146cd97c61623018153b41d7c1278
2024-06-08 05:20:54 +08:00
hiyouga
a8318723a4 add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
2024-06-08 00:22:16 +08:00
hiyouga
d3196318be fix #4120
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d add qwen2 models
Former-commit-id: 8e95648850fdd5075724359ffdb22beb48b75952
2024-06-07 00:22:57 +08:00
hiyouga
6cbc66a602 fix torch gc
Former-commit-id: 451b6693c0cb86cc9ac03d1a9389cf1fd2b918ec
2024-06-06 20:30:25 +08:00
hiyouga
cceff9f520 lora modules: all by default
Former-commit-id: cae47379079ff811aa385c297481a27020a8da6b
2024-06-06 03:53:28 +08:00
hiyouga
679810a3d2 add codestral 22B
Former-commit-id: c23cc63d3d3c4fd8edd6c3b3ca1a2a32ec328d7d
2024-06-06 03:42:50 +08:00
hiyouga
8f25af89b6 lint
Former-commit-id: 7daf8366db0e161d46993fd87cf983a27a0ce2a3
2024-06-06 03:33:44 +08:00
hoshi-hiyouga
229794a148 Merge pull request #4066 from injet-zhou/main
add throughput entry to training log

Former-commit-id: f2580ad403cd0ae91aa0954c0a15363c46452438
2024-06-06 03:32:04 +08:00
hiyouga
00b3fb4d14 update train hparams
Former-commit-id: dc4a00dd63769dc02d898c8bad2c158e4e5c0447
2024-06-06 01:49:20 +08:00
hiyouga
0398338a0f add llamafactory-cli env
Former-commit-id: d4908d57085bbcfcd29e0a8d4ee6425318ee4285
2024-06-06 01:28:14 +08:00
hiyouga
a16786d8ba fix #4090
Former-commit-id: 67fe822324a9f830175e44f89acdd9d759b38852
2024-06-06 00:50:32 +08:00
hiyouga
94c37490d1 support glm-4
Former-commit-id: f48f5e646e2da9e02333d027033141b0e75dfcf8
2024-06-05 15:16:38 +08:00
faddddeout
a2931b813b add throughput entry to log
Former-commit-id: b2f04595423b8e84b3763d169e402a0cd34f3175
2024-06-04 11:04:29 +00:00
hiyouga
af7748139a bump versions
transformers 4.37.2->4.41.2
datasets 2.14.3->2.16.0
accelerate 0.27.2->0.30.1
peft 0.10.0->0.11.1
trl 0.8.1->0.8.6


Former-commit-id: 876bc92865605be872bc811a56a1d1e05490ec8a
2024-06-03 18:29:38 +08:00
hiyouga
820404946e better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui


Former-commit-id: 80708717329b4552920dd4ce8cebc683e65d54c5
2024-05-29 23:55:38 +08:00
hiyouga
a71a6a05c3 update readme
Former-commit-id: 89ca832740731dfb121175aa5c16b13bd4944011
2024-05-29 18:39:11 +08:00
hzhaoy
ce1be3da4b add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: 0dd632fe9e5bbf08605d4b9c6887208b7a127317
2024-05-29 15:00:37 +08:00
hiyouga
3ea8f5e6b9 support DDP in webui
Former-commit-id: 7c016b22aa9208ec14f00a9bdb51f69aebb02af5
2024-05-28 19:24:22 +08:00
hiyouga
0706dbf7e6 tiny fix
Former-commit-id: c1fdf81df6ade5da7be4eb66b715f0efd171d5aa
2024-05-27 20:54:26 +08:00
hoshi-hiyouga
ad3ca3f556 Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support

Former-commit-id: 87ea0a8bcd8d76a9e916cc8da6905bc805bb18aa
2024-05-27 20:52:37 +08:00
Jianbai Ye
d2c1df7f3d add openchat-3.6-8B support
Former-commit-id: cff815391fd15f30647e8694e08c47a514fd6eb2
2024-05-27 20:42:08 +08:00
hiyouga
fc5a6b5c4e support Aya23
Former-commit-id: e626e264460d12b282099bfbb8e6679c31e85fc0
2024-05-27 20:23:24 +08:00
hiyouga
51a1097c64 add phi-3 7b/14b, mistral v0.3 models
Former-commit-id: efa4b196ca8053881bb9d15cfb571204bcb0bbda
2024-05-27 18:20:16 +08:00
hiyouga
df33548b39 update readme
Former-commit-id: 5581cb2e4e59f3f8109e2acd4611789f9e50bfca
2024-05-27 18:14:02 +08:00