hiyouga
|
daebca2368
|
tiny fix
Former-commit-id: c8b4c7fee5398654683b713ad5c03b5daf13218a
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
5582674f06
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: d39f4a62d3c5a3bbbf39d1eb4b92439acedae18e
|
2024-08-19 23:51:39 +08:00 |
|
Ricardo
|
a9312387bc
|
_is_bf16_available judgment supports npu
Former-commit-id: 384ab8db84eef7d1f6a7643c15c565a7d4906a5c
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
41a8387195
|
fix: report correct device count for intel xpu
Former-commit-id: dc36fcc3de721bdd28edd4eed36677e59a7614be
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
a8add5c04b
|
add qwen2 math models
Former-commit-id: dc770efb14bd6e18421511912fbb959a3cf9f78d
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
7125b6cf70
|
support gemma-2-2b
Former-commit-id: dc09d454f285b8584d9017349a9cee3b44eadb72
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
91e54d458f
|
add mistral nemo model
Former-commit-id: 1550fe7331370ad39e8ed69c1b060ead902a77e4
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
e0875f82b3
|
add llama3.1
Former-commit-id: 26533c0604ef765170f93986bc06f3066c5e28ee
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
726e7046db
|
set dev version
Former-commit-id: 88c7fc159999511e6e19fff3d37147a6a7064335
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
f5cfea56bd
|
release v0.8.3
Former-commit-id: bbd5a644230d633f507c72929e8819c07ae38bba
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
hoshi-hiyouga
|
7483e187c6
|
Update packages.py
Former-commit-id: f84b007ebbb9fa63f797b4bd1c487372877bbc65
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
7ca84e0a09
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: 322663bf90ce7b99ca5b0b43ff9dbd95eb36ff6b
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
7fcffb860d
|
add codegeex4, internlm2.5
Former-commit-id: 53b1002fb74123095e7466c75b941a31a7cfba4d
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a38ff842d0
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: 87d9b2d00513c163335d3f2e2bb3cb3299cecdaa
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
e671ed520b
|
update arg name
Former-commit-id: 8a6a7b9c8a876da9c16e5ada7df461eb8cabee21
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
28e787116b
|
add TeleChat-1B
Former-commit-id: 57b7c00430bcfc83afd11547ceead041e8edfd8d
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
bbc37b2880
|
fix #4398 #4592
Former-commit-id: d74244d56858d837044e5c9cea57a1b3c2ca0214
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
d3b7c489f2
|
add Gemma2 models
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
7be502c5c5
|
update readme
Former-commit-id: e507e60638b2e8c66f24805b3b28f6b9f98f5924
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
5319447aa5
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
80e9f8e000
|
set dev version
Former-commit-id: 42e69a3c634ccae792bd8ffb4642061ee475e836
|
2024-06-19 21:08:16 +08:00 |
|
hiyouga
|
9c1b04cd11
|
release v0.8.2
Former-commit-id: 71327ba85a3a1bb2d2d20c86951c6c7c0ba98829
|
2024-06-19 20:42:09 +08:00 |
|
hiyouga
|
e3bf22f61b
|
add deepseek coder v2 #4346
Former-commit-id: a233fbc258d38c62d78b9d1eaf034720361795e6
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
988231026a
|
update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99809c6ae2571b59bdce8d8ea3c700b9
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
8053929b20
|
add tests
Former-commit-id: 1b834f50be64ae9b5123da0e6f528cfbd5167477
|
2024-06-15 19:51:20 +08:00 |
|
hiyouga
|
f0d6e63f55
|
add minicpm #4227
Former-commit-id: 572d8bbfdd73c1a00b432f0d0411f46fad6aa1a6
|
2024-06-15 17:58:52 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
833aa324c2
|
clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
d6632fefc9
|
set dev version
Former-commit-id: 91e62a098fd997d0d1d12baef64d089aabc01fba
|
2024-06-11 00:50:53 +08:00 |
|
hiyouga
|
75e1bbf128
|
release v0.8.1
Former-commit-id: 2b6ebd6b51133cf114d6f0e8605ad2bb26aa6d65
|
2024-06-11 00:44:26 +08:00 |
|
hiyouga
|
1a261add61
|
fix llamafactory-cli env
Former-commit-id: 972ec9c668de1a9b6d872187dbc0c1d94f6fec6b
|
2024-06-08 07:15:45 +08:00 |
|
hiyouga
|
de3400a521
|
set dev version
Former-commit-id: 3ac11e77cccf686e0da499bd152997133b49a265
|
2024-06-08 06:46:09 +08:00 |
|
hiyouga
|
ce40d12692
|
release v0.8.0
Former-commit-id: 5aa4ce47567146cd97c61623018153b41d7c1278
|
2024-06-08 05:20:54 +08:00 |
|
hiyouga
|
a8318723a4
|
add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
d3196318be
|
fix #4120
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
8a0263551d
|
add qwen2 models
Former-commit-id: 8e95648850fdd5075724359ffdb22beb48b75952
|
2024-06-07 00:22:57 +08:00 |
|
hiyouga
|
6cbc66a602
|
fix torch gc
Former-commit-id: 451b6693c0cb86cc9ac03d1a9389cf1fd2b918ec
|
2024-06-06 20:30:25 +08:00 |
|
hiyouga
|
cceff9f520
|
lora modules: all by default
Former-commit-id: cae47379079ff811aa385c297481a27020a8da6b
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
679810a3d2
|
add codestral 22B
Former-commit-id: c23cc63d3d3c4fd8edd6c3b3ca1a2a32ec328d7d
|
2024-06-06 03:42:50 +08:00 |
|
hiyouga
|
8f25af89b6
|
lint
Former-commit-id: 7daf8366db0e161d46993fd87cf983a27a0ce2a3
|
2024-06-06 03:33:44 +08:00 |
|
hoshi-hiyouga
|
229794a148
|
Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
Former-commit-id: f2580ad403cd0ae91aa0954c0a15363c46452438
|
2024-06-06 03:32:04 +08:00 |
|