Commit Graph

2126 Commits

Author SHA1 Message Date
hoshi-hiyouga
1a0758b0a1 Update label_issue.yml
Former-commit-id: aa60cd8910
2024-06-24 21:59:39 +08:00
hoshi-hiyouga
fe407e8de6 Merge pull request #4446 from stceum/bug-fix
Bug Fix: `off` is parsed as `False` in yaml file

Former-commit-id: cc452c32c7
2024-06-24 21:41:28 +08:00
hoshi-hiyouga
e74fcdf7b1 Update parser.py
Former-commit-id: e90c424f55
2024-06-24 21:37:42 +08:00
hoshi-hiyouga
a9f10a9abd Update test_attention.py
Former-commit-id: a9b3d91952
2024-06-24 21:35:34 +08:00
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281
2024-06-24 20:39:31 +08:00
MengqingCao
f923989a6e auto-label npu issue
Former-commit-id: 90c74ff251
2024-06-24 12:27:00 +00:00
MengqingCao
3b499948a5 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path


Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
a1df18c5df update readme
Former-commit-id: 4ea84a8333
2024-06-24 18:29:04 +08:00
hiyouga
7be502c5c5 update readme
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
codemayq
bb9f48590f update wechat
Former-commit-id: 5b897e7c35
2024-06-22 11:57:39 +08:00
mMrBun
c0e005e2ea Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb
2024-06-22 02:13:23 +08:00
hiyouga
98abb5c900 remove dup template
Former-commit-id: db9a1912e3
2024-06-22 01:31:32 +08:00
hiyouga
ccc9a895a6 fix api
Former-commit-id: 3ce44dda99
2024-06-22 00:00:38 +08:00
Erich Schubert
cf23a279fd Print help if no arguments given
Former-commit-id: 7d70ba7fb8
2024-06-21 09:14:21 +02:00
ancv
5319447aa5 move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc83
2024-06-21 00:45:06 +07:00
hiyouga
0844750bb9 tiny fix
Former-commit-id: 8d4f5093cf
2024-06-20 22:56:05 +08:00
hoshi-hiyouga
7d3b21684c Merge pull request #4382 from MengqingCao/bugfix
upper bound numpy version to <2.0

Former-commit-id: a459624474
2024-06-20 10:19:37 +08:00
MengqingCao
cd563116ca update dependencies
Former-commit-id: 7d4a293033
2024-06-20 02:09:47 +00:00
hiyouga
6ea4680334 improve llamaboard
Former-commit-id: f22d8f9ca4
2024-06-19 23:46:03 +08:00
hiyouga
029c343537 fix llamaboard abort
Former-commit-id: 3f84411b5d
2024-06-19 23:22:28 +08:00
hiyouga
030b4811c7 update patcher
Former-commit-id: 3b040e8e0f
2024-06-19 21:27:00 +08:00
hiyouga
80e9f8e000 set dev version
Former-commit-id: 42e69a3c63
2024-06-19 21:08:16 +08:00
hiyouga
fded2306dc Update publish.yml
Former-commit-id: 87e330fee5
2024-06-19 20:46:33 +08:00
hiyouga
9c1b04cd11 release v0.8.2
Former-commit-id: 71327ba85a
2024-06-19 20:42:09 +08:00
hiyouga
3d72b1a856 fix jinja template
Former-commit-id: 2b596fb55f
2024-06-19 20:03:50 +08:00
hiyouga
7735456561 fix templates
Former-commit-id: 4cff6a4ad5
2024-06-19 17:44:05 +08:00
codingma
53b48eb052 update wechat_npu.jpg
Former-commit-id: c48cbc371d
2024-06-19 14:02:24 +08:00
Jonery
c779899f7b Cleaner integration.
Former-commit-id: 5c2ff1b749
2024-06-19 12:29:40 +08:00
hiyouga
c9557241f6 fix bug
Former-commit-id: 6d2bf216ac
2024-06-19 03:49:23 +08:00
hiyouga
e73a235a38 use prefix to replace force system
Former-commit-id: 4f22eae8f4
2024-06-19 03:39:52 +08:00
hiyouga
bccc852f76 fix tool formatter, allow parallel function #4362
Former-commit-id: cd75b1fe9d
2024-06-19 03:23:51 +08:00
hoshi-hiyouga
6db02615d4 Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format

Former-commit-id: c0ca42566c
2024-06-19 03:18:55 +08:00
hiyouga
89564e90d7 update data
Former-commit-id: 9ab0401948
2024-06-19 02:48:43 +08:00
hiyouga
9e5988717d tiny fix
Former-commit-id: 344b9a36b2
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9055e66643 Merge pull request #4314 from EliMCosta/patch-2
Fix Dockerfile

Former-commit-id: 89a50dbfde
2024-06-18 23:30:59 +08:00
hoshi-hiyouga
9b30635ff0 Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples

Former-commit-id: 10316dd8ca
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b add deepseek coder v2 #4346
Former-commit-id: a233fbc258
2024-06-18 22:53:54 +08:00
hiyouga
5156114981 fix #4357
Former-commit-id: 4bd77d8563
2024-06-18 22:42:45 +08:00
hoshi-hiyouga
b596addd1f Merge pull request #4334 from zzxzz12345/bugfix/add-pandas-versions
Update requirements.txt

Former-commit-id: 078040babd
2024-06-18 22:30:35 +08:00
hoshi-hiyouga
09c34e5b6c Update requirements.txt
Former-commit-id: e8c518c08a
2024-06-18 22:27:24 +08:00
hiyouga
15a5eb6647 fix #4335
Former-commit-id: c96264bc47
2024-06-18 22:08:56 +08:00
Jonery
bc1c082bc2 add example
Former-commit-id: 97c5235160
2024-06-18 13:50:26 +08:00
Jonery
c2734108e7 fix typo
Former-commit-id: 8f7c78b641
2024-06-18 12:39:26 +08:00
Jonery
3a5eacb4cf Support distributed BAdam.
Former-commit-id: 0f72aac8c9
2024-06-18 12:27:47 +08:00
hiyouga
19bf21efba lint
Former-commit-id: 24c160df3d
2024-06-17 22:35:56 +08:00
hiyouga
3d85217464 update chat engine #4335
Former-commit-id: 7857c0990b
2024-06-17 19:07:17 +08:00
hiyouga
9e0ec3831f update readme
Former-commit-id: fcb2e8e7b7
2024-06-17 18:47:24 +08:00
Jonery
5d59f6562a Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e0
2024-06-17 18:44:51 +08:00
Jonery
67df86201a update gitigore
Former-commit-id: b2fc9cc15f
2024-06-17 18:29:36 +08:00
Jonery
756566342d adapt for badam with ds zero3
Former-commit-id: 33b4372778
2024-06-17 18:18:10 +08:00