hoshi-hiyouga
78baa8a509
Update label_issue.yml
...
Former-commit-id: 80d1910a93
2024-06-24 22:01:23 +08:00
hoshi-hiyouga
1a0758b0a1
Update label_issue.yml
...
Former-commit-id: aa60cd8910
2024-06-24 21:59:39 +08:00
hoshi-hiyouga
fe407e8de6
Merge pull request #4446 from stceum/bug-fix
...
Bug Fix: `off` is parsed as `False` in yaml file
Former-commit-id: cc452c32c7
2024-06-24 21:41:28 +08:00
hoshi-hiyouga
e74fcdf7b1
Update parser.py
...
Former-commit-id: e90c424f55
2024-06-24 21:37:42 +08:00
hoshi-hiyouga
a9f10a9abd
Update test_attention.py
...
Former-commit-id: a9b3d91952
2024-06-24 21:35:34 +08:00
stceum
9aa640f27b
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
...
Former-commit-id: 3ed063f281
2024-06-24 20:39:31 +08:00
MengqingCao
f923989a6e
auto-label npu issue
...
Former-commit-id: 90c74ff251
2024-06-24 12:27:00 +00:00
MengqingCao
3b499948a5
update docker files
...
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
a1df18c5df
update readme
...
Former-commit-id: 4ea84a8333
2024-06-24 18:29:04 +08:00
hiyouga
7be502c5c5
update readme
...
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
codemayq
bb9f48590f
update wechat
...
Former-commit-id: 5b897e7c35
2024-06-22 11:57:39 +08:00
mMrBun
c0e005e2ea
Add tool_format to overwrite tool formatter template
...
Former-commit-id: 20e2e6fdcb
2024-06-22 02:13:23 +08:00
hiyouga
98abb5c900
remove dup template
...
Former-commit-id: db9a1912e3
2024-06-22 01:31:32 +08:00
hiyouga
ccc9a895a6
fix api
...
Former-commit-id: 3ce44dda99
2024-06-22 00:00:38 +08:00
Erich Schubert
cf23a279fd
Print help if no arguments given
...
Former-commit-id: 7d70ba7fb8
2024-06-21 09:14:21 +02:00
ancv
5319447aa5
move configure_packing to llamafactory.model.patcher and fix constants
...
Former-commit-id: 770f75dc83
2024-06-21 00:45:06 +07:00
hiyouga
0844750bb9
tiny fix
...
Former-commit-id: 8d4f5093cf
2024-06-20 22:56:05 +08:00
hoshi-hiyouga
7d3b21684c
Merge pull request #4382 from MengqingCao/bugfix
...
upper bound numpy version to <2.0
Former-commit-id: a459624474
2024-06-20 10:19:37 +08:00
MengqingCao
cd563116ca
update dependencies
...
Former-commit-id: 7d4a293033
2024-06-20 02:09:47 +00:00
hiyouga
6ea4680334
improve llamaboard
...
Former-commit-id: f22d8f9ca4
2024-06-19 23:46:03 +08:00
hiyouga
029c343537
fix llamaboard abort
...
Former-commit-id: 3f84411b5d
2024-06-19 23:22:28 +08:00
hiyouga
030b4811c7
update patcher
...
Former-commit-id: 3b040e8e0f
2024-06-19 21:27:00 +08:00
hiyouga
80e9f8e000
set dev version
...
Former-commit-id: 42e69a3c63
2024-06-19 21:08:16 +08:00
hiyouga
fded2306dc
Update publish.yml
...
Former-commit-id: 87e330fee5
2024-06-19 20:46:33 +08:00
hiyouga
9c1b04cd11
release v0.8.2
...
Former-commit-id: 71327ba85a
2024-06-19 20:42:09 +08:00
hiyouga
3d72b1a856
fix jinja template
...
Former-commit-id: 2b596fb55f
2024-06-19 20:03:50 +08:00
hiyouga
7735456561
fix templates
...
Former-commit-id: 4cff6a4ad5
2024-06-19 17:44:05 +08:00
codingma
53b48eb052
update wechat_npu.jpg
...
Former-commit-id: c48cbc371d
2024-06-19 14:02:24 +08:00
Jonery
c779899f7b
Cleaner integration.
...
Former-commit-id: 5c2ff1b749
2024-06-19 12:29:40 +08:00
hiyouga
c9557241f6
fix bug
...
Former-commit-id: 6d2bf216ac
2024-06-19 03:49:23 +08:00
hiyouga
e73a235a38
use prefix to replace force system
...
Former-commit-id: 4f22eae8f4
2024-06-19 03:39:52 +08:00
hiyouga
bccc852f76
fix tool formatter, allow parallel function #4362
...
Former-commit-id: cd75b1fe9d
2024-06-19 03:23:51 +08:00
hoshi-hiyouga
6db02615d4
Merge pull request #4173 from mMrBun/main
...
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
Former-commit-id: c0ca42566c
2024-06-19 03:18:55 +08:00
hiyouga
89564e90d7
update data
...
Former-commit-id: 9ab0401948
2024-06-19 02:48:43 +08:00
hiyouga
9e5988717d
tiny fix
...
Former-commit-id: 344b9a36b2
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9055e66643
Merge pull request #4314 from EliMCosta/patch-2
...
Fix Dockerfile
Former-commit-id: 89a50dbfde
2024-06-18 23:30:59 +08:00
hoshi-hiyouga
9b30635ff0
Merge pull request #4309 from EliMCosta/patch-1
...
Add Magpie and Webinstruct dataset samples
Former-commit-id: 10316dd8ca
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b
add deepseek coder v2 #4346
...
Former-commit-id: a233fbc258
2024-06-18 22:53:54 +08:00
hiyouga
5156114981
fix #4357
...
Former-commit-id: 4bd77d8563
2024-06-18 22:42:45 +08:00
hoshi-hiyouga
b596addd1f
Merge pull request #4334 from zzxzz12345/bugfix/add-pandas-versions
...
Update requirements.txt
Former-commit-id: 078040babd
2024-06-18 22:30:35 +08:00
hoshi-hiyouga
09c34e5b6c
Update requirements.txt
...
Former-commit-id: e8c518c08a
2024-06-18 22:27:24 +08:00
hiyouga
15a5eb6647
fix #4335
...
Former-commit-id: c96264bc47
2024-06-18 22:08:56 +08:00
Jonery
bc1c082bc2
add example
...
Former-commit-id: 97c5235160
2024-06-18 13:50:26 +08:00
Jonery
c2734108e7
fix typo
...
Former-commit-id: 8f7c78b641
2024-06-18 12:39:26 +08:00
Jonery
3a5eacb4cf
Support distributed BAdam.
...
Former-commit-id: 0f72aac8c9
2024-06-18 12:27:47 +08:00
hiyouga
19bf21efba
lint
...
Former-commit-id: 24c160df3d
2024-06-17 22:35:56 +08:00
hiyouga
3d85217464
update chat engine #4335
...
Former-commit-id: 7857c0990b
2024-06-17 19:07:17 +08:00
hiyouga
9e0ec3831f
update readme
...
Former-commit-id: fcb2e8e7b7
2024-06-17 18:47:24 +08:00
Jonery
5d59f6562a
Merge remote-tracking branch 'upstream/main'
...
Former-commit-id: ea1f3ba5e0
2024-06-17 18:44:51 +08:00
Jonery
67df86201a
update gitigore
...
Former-commit-id: b2fc9cc15f
2024-06-17 18:29:36 +08:00