Commit Graph

52 Commits

Author SHA1 Message Date
hiyouga
8cafc7b055 video datasets 2024-09-05 02:04:17 +08:00
hiyouga
dabad5570b update get template 2024-09-04 22:36:20 +08:00
hiyouga
d41d43a7c3 fix #5344 2024-09-04 03:06:06 +08:00
hiyouga
9967ccb3ae fix mixed mm inputs and rlhf-v 2024-09-01 20:52:47 +08:00
hiyouga
a2a8c0b92c add test mm plugin 2024-08-31 01:53:38 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
simonJJJ
aeb85f200b initial-commit 2024-08-28 16:51:35 +08:00
hoshi-hiyouga
15be296347 Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
2024-08-20 00:09:03 +08:00
hoshi-hiyouga
ec72eeca52 Update template.py 2024-08-20 00:03:33 +08:00
hoshi-hiyouga
5f3300ec5d Update template.py 2024-08-19 23:40:16 +08:00
Huiyu Chen
2502833a77 Add SailorLLM template 2024-08-15 15:10:14 +08:00
“Wzw”
bcbbf45063 fix Llama-template's system prompt bug 2024-08-12 19:22:12 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hoshi-hiyouga
51542cb15f Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
2024-08-09 17:58:27 +08:00
hoshi-hiyouga
4f62e1cb24 Update template.py 2024-08-09 16:27:42 +08:00
“Wzw”
b5ca86cc07 fix mask_history tiny bug 2024-08-08 10:09:33 +08:00
moontidef
b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem 2024-08-05 23:55:45 +08:00
huangpan.foo
44e48e2b82 update deepseek template 2024-07-19 15:02:54 +08:00
hiyouga
53b1002fb7 add codegeex4, internlm2.5 2024-07-06 16:16:47 +08:00
hiyouga
9f33f1edf5 fix processors 2024-07-05 08:33:22 +08:00
hiyouga
1771251ce3 fix #4402 #4617
Deprecate reserved_label_len arg
2024-07-01 01:19:27 +08:00
hiyouga
59e0b4f616 fix #4556 2024-06-26 19:43:16 +08:00
hiyouga
41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga
1240bd57d8 Update template.py 2024-06-24 23:12:59 +08:00
mMrBun
20e2e6fdcb Add tool_format to overwrite tool formatter template 2024-06-22 02:13:23 +08:00
hiyouga
db9a1912e3 remove dup template 2024-06-22 01:31:32 +08:00
hiyouga
2b596fb55f fix jinja template 2024-06-19 20:03:50 +08:00
hiyouga
4cff6a4ad5 fix templates 2024-06-19 17:44:05 +08:00
hiyouga
6d2bf216ac fix bug 2024-06-19 03:49:23 +08:00
hiyouga
4f22eae8f4 use prefix to replace force system 2024-06-19 03:39:52 +08:00
hiyouga
cd75b1fe9d fix tool formatter, allow parallel function #4362 2024-06-19 03:23:51 +08:00
hoshi-hiyouga
c0ca42566c Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
2024-06-19 03:18:55 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
mMrBun
0f2609ce19 Merge branch 'hiyouga:main' into main 2024-06-09 18:17:24 +08:00
mMrBun
cb1cbcb293 Implemented the tool_formatter and tool_extractor for glm4 tool_format 2024-06-09 18:16:15 +08:00
hiyouga
5aa4ce4756 release v0.8.0 2024-06-08 05:20:54 +08:00
hiyouga
74f96efef9 rename files 2024-06-07 00:09:06 +08:00
hiyouga
f48f5e646e support glm-4 2024-06-05 15:16:38 +08:00
hiyouga
d0aa36b8ad fix cohere system 2024-05-29 20:58:23 +08:00
hiyouga
0930f58699 fix #3965 2024-05-29 20:55:51 +08:00
hiyouga
89ca832740 update readme 2024-05-29 18:39:11 +08:00
hzhaoy
0dd632fe9e add TeleChat-12B/TeleChat-12B-v2 models 2024-05-29 15:00:37 +08:00
Yimi81
dc07413e7d fix yi template 2024-05-27 13:11:25 +00:00
hiyouga
c1fdf81df6 tiny fix 2024-05-27 20:54:26 +08:00
hoshi-hiyouga
f1002b9f93 Update template.py 2024-05-27 20:51:56 +08:00
hoshi-hiyouga
122213a7a7 Update template.py 2024-05-27 20:51:26 +08:00
Jianbai Ye
cff815391f add openchat-3.6-8B support 2024-05-27 20:42:08 +08:00
hiyouga
5581cb2e4e update readme 2024-05-27 18:14:02 +08:00
hiyouga
542229abb3 fix paligemma inference 2024-05-20 23:36:43 +08:00
hiyouga
d52fae2fa8 fix chat engines
do not use pop(key, default) since api assigns None to dict values
2024-05-20 00:36:43 +08:00