fanjunliang
e7bd3ab6c3
fix torch-npu dependency
...
Former-commit-id: 8096f94a7d
2024-06-26 18:21:42 +08:00
hoshi-hiyouga
2300fb616b
Merge pull request #4544 from MengqingCao/npu
...
fix docker-compose path
Former-commit-id: 82d0b46bc9
2024-06-26 10:19:24 +08:00
MengqingCao
7c7d6614d8
fix docker-compose path
...
Former-commit-id: 106647a99d
2024-06-26 02:15:00 +00:00
hzhaoy
08a221443c
support flash-attn in Dockerfile
...
Former-commit-id: c88b1be9f3
2024-06-25 15:13:07 +08:00
hiyouga
1a79dd23ff
fix #4456
...
Former-commit-id: 50b44d3c6d
2024-06-25 14:34:13 +08:00
hiyouga
f3f25ae3b7
lint
...
Former-commit-id: 555ca8d780
2024-06-25 02:55:50 +08:00
hiyouga
80effa2993
fix test case
...
Former-commit-id: c244af0dc3
2024-06-25 02:51:49 +08:00
hiyouga
0ae1302e41
fix #4432
...
Former-commit-id: 1e9d0aa1e4
2024-06-25 02:34:04 +08:00
hiyouga
ad0304e147
fix #4379
...
Former-commit-id: cc016461e6
2024-06-25 02:31:44 +08:00
hiyouga
a225b5a70c
tiny fix about badam
...
Former-commit-id: 095fab58d3
2024-06-25 01:54:53 +08:00
hiyouga
dafc9268bc
fix #4419
...
Former-commit-id: efb81b25ec
2024-06-25 01:51:29 +08:00
hoshi-hiyouga
fe6ef6400c
Merge pull request #4352 from Ledzy/main
...
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5b
2024-06-25 01:49:13 +08:00
hiyouga
d519c2fde5
tiny fix
...
Former-commit-id: 41086059b1
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
ab1fbbc3ec
Merge pull request #4355 from MengqingCao/npu
...
Add docker-npu
Former-commit-id: d0e6059902
2024-06-25 01:07:43 +08:00
hoshi-hiyouga
678884f97c
Update README_zh.md
...
Former-commit-id: ec95f942d1
2024-06-25 01:06:59 +08:00
hoshi-hiyouga
cbc23fc299
Update README.md
...
Former-commit-id: 5dc8fa647e
2024-06-25 01:03:38 +08:00
hoshi-hiyouga
af5b2b9299
Update docker-compose.yml
...
Former-commit-id: 721acd8768
2024-06-25 00:54:28 +08:00
hoshi-hiyouga
6cd45e95f7
Update Dockerfile
...
Former-commit-id: 3af936a76d
2024-06-25 00:50:34 +08:00
hoshi-hiyouga
62e63d74ec
Update docker-compose.yml
...
Former-commit-id: 15608d0558
2024-06-25 00:46:47 +08:00
hoshi-hiyouga
cfa2dbefcb
Update Dockerfile
...
Former-commit-id: fce146ab68
2024-06-25 00:46:08 +08:00
hoshi-hiyouga
f84bce3638
Update Dockerfile
...
Former-commit-id: dcc2e24f5c
2024-06-24 23:41:35 +08:00
hoshi-hiyouga
37a079a072
Merge pull request #4409 from kno10/patch-2
...
Print help if no arguments given
Former-commit-id: 3bed18c644
2024-06-24 23:21:31 +08:00
hoshi-hiyouga
60937ccf32
Update cli.py
...
Former-commit-id: acb61f7ab7
2024-06-24 23:21:10 +08:00
hoshi-hiyouga
709bbc1d92
Merge pull request #4417 from mMrBun/main
...
Add tool_format parameter to rewrite templates for different function call formats.
Former-commit-id: def6d280db
2024-06-24 23:17:55 +08:00
hoshi-hiyouga
18863245df
Update test_formatter.py
...
Former-commit-id: 672152d2ce
2024-06-24 23:14:36 +08:00
hoshi-hiyouga
b7f5cfde6e
Update template.py
...
Former-commit-id: 1240bd57d8
2024-06-24 23:12:59 +08:00
hoshi-hiyouga
673f27a59e
Update loader.py
...
Former-commit-id: dddfd516ee
2024-06-24 23:06:18 +08:00
hiyouga
47651a94a3
fix #4410
...
Former-commit-id: fca893d73c
2024-06-24 22:34:31 +08:00
hoshi-hiyouga
f3a2dda567
Merge pull request #4445 from MengqingCao/label
...
auto-label npu issue
Former-commit-id: e0014db7d2
2024-06-24 22:02:05 +08:00
hoshi-hiyouga
78baa8a509
Update label_issue.yml
...
Former-commit-id: 80d1910a93
2024-06-24 22:01:23 +08:00
hoshi-hiyouga
1a0758b0a1
Update label_issue.yml
...
Former-commit-id: aa60cd8910
2024-06-24 21:59:39 +08:00
hoshi-hiyouga
fe407e8de6
Merge pull request #4446 from stceum/bug-fix
...
Bug Fix: `off` is parsed as `False` in yaml file
Former-commit-id: cc452c32c7
2024-06-24 21:41:28 +08:00
hoshi-hiyouga
e74fcdf7b1
Update parser.py
...
Former-commit-id: e90c424f55
2024-06-24 21:37:42 +08:00
hoshi-hiyouga
a9f10a9abd
Update test_attention.py
...
Former-commit-id: a9b3d91952
2024-06-24 21:35:34 +08:00
stceum
9aa640f27b
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
...
Former-commit-id: 3ed063f281
2024-06-24 20:39:31 +08:00
MengqingCao
f923989a6e
auto-label npu issue
...
Former-commit-id: 90c74ff251
2024-06-24 12:27:00 +00:00
MengqingCao
3b499948a5
update docker files
...
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
a1df18c5df
update readme
...
Former-commit-id: 4ea84a8333
2024-06-24 18:29:04 +08:00
hiyouga
7be502c5c5
update readme
...
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
codemayq
bb9f48590f
update wechat
...
Former-commit-id: 5b897e7c35
2024-06-22 11:57:39 +08:00
mMrBun
c0e005e2ea
Add tool_format to overwrite tool formatter template
...
Former-commit-id: 20e2e6fdcb
2024-06-22 02:13:23 +08:00
hiyouga
98abb5c900
remove dup template
...
Former-commit-id: db9a1912e3
2024-06-22 01:31:32 +08:00
hiyouga
ccc9a895a6
fix api
...
Former-commit-id: 3ce44dda99
2024-06-22 00:00:38 +08:00
Erich Schubert
cf23a279fd
Print help if no arguments given
...
Former-commit-id: 7d70ba7fb8
2024-06-21 09:14:21 +02:00
ancv
5319447aa5
move configure_packing to llamafactory.model.patcher and fix constants
...
Former-commit-id: 770f75dc83
2024-06-21 00:45:06 +07:00
hiyouga
0844750bb9
tiny fix
...
Former-commit-id: 8d4f5093cf
2024-06-20 22:56:05 +08:00
hoshi-hiyouga
7d3b21684c
Merge pull request #4382 from MengqingCao/bugfix
...
upper bound numpy version to <2.0
Former-commit-id: a459624474
2024-06-20 10:19:37 +08:00
MengqingCao
cd563116ca
update dependencies
...
Former-commit-id: 7d4a293033
2024-06-20 02:09:47 +00:00
hiyouga
6ea4680334
improve llamaboard
...
Former-commit-id: f22d8f9ca4
2024-06-19 23:46:03 +08:00
hiyouga
029c343537
fix llamaboard abort
...
Former-commit-id: 3f84411b5d
2024-06-19 23:22:28 +08:00