hiyouga
|
8cafc7b055
|
video datasets
|
2024-09-05 02:04:17 +08:00 |
|
hiyouga
|
dabad5570b
|
update get template
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
d41d43a7c3
|
fix #5344
|
2024-09-04 03:06:06 +08:00 |
|
hiyouga
|
9967ccb3ae
|
fix mixed mm inputs and rlhf-v
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
a2a8c0b92c
|
add test mm plugin
|
2024-08-31 01:53:38 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
simonJJJ
|
aeb85f200b
|
initial-commit
|
2024-08-28 16:51:35 +08:00 |
|
hoshi-hiyouga
|
15be296347
|
Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
|
2024-08-20 00:09:03 +08:00 |
|
hoshi-hiyouga
|
ec72eeca52
|
Update template.py
|
2024-08-20 00:03:33 +08:00 |
|
hoshi-hiyouga
|
5f3300ec5d
|
Update template.py
|
2024-08-19 23:40:16 +08:00 |
|
Huiyu Chen
|
2502833a77
|
Add SailorLLM template
|
2024-08-15 15:10:14 +08:00 |
|
“Wzw”
|
bcbbf45063
|
fix Llama-template's system prompt bug
|
2024-08-12 19:22:12 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
hoshi-hiyouga
|
51542cb15f
|
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
|
2024-08-09 17:58:27 +08:00 |
|
hoshi-hiyouga
|
4f62e1cb24
|
Update template.py
|
2024-08-09 16:27:42 +08:00 |
|
“Wzw”
|
b5ca86cc07
|
fix mask_history tiny bug
|
2024-08-08 10:09:33 +08:00 |
|
moontidef
|
b82ecbedd0
|
fix: fix the deepseekcoder template to avoid repeat problem
|
2024-08-05 23:55:45 +08:00 |
|
huangpan.foo
|
44e48e2b82
|
update deepseek template
|
2024-07-19 15:02:54 +08:00 |
|
hiyouga
|
53b1002fb7
|
add codegeex4, internlm2.5
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
9f33f1edf5
|
fix processors
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
1771251ce3
|
fix #4402 #4617
Deprecate reserved_label_len arg
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
59e0b4f616
|
fix #4556
|
2024-06-26 19:43:16 +08:00 |
|
hiyouga
|
41086059b1
|
tiny fix
|
2024-06-25 01:15:19 +08:00 |
|
hoshi-hiyouga
|
1240bd57d8
|
Update template.py
|
2024-06-24 23:12:59 +08:00 |
|
mMrBun
|
20e2e6fdcb
|
Add tool_format to overwrite tool formatter template
|
2024-06-22 02:13:23 +08:00 |
|
hiyouga
|
db9a1912e3
|
remove dup template
|
2024-06-22 01:31:32 +08:00 |
|
hiyouga
|
2b596fb55f
|
fix jinja template
|
2024-06-19 20:03:50 +08:00 |
|
hiyouga
|
4cff6a4ad5
|
fix templates
|
2024-06-19 17:44:05 +08:00 |
|
hiyouga
|
6d2bf216ac
|
fix bug
|
2024-06-19 03:49:23 +08:00 |
|
hiyouga
|
4f22eae8f4
|
use prefix to replace force system
|
2024-06-19 03:39:52 +08:00 |
|
hiyouga
|
cd75b1fe9d
|
fix tool formatter, allow parallel function #4362
|
2024-06-19 03:23:51 +08:00 |
|
hoshi-hiyouga
|
c0ca42566c
|
Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
|
2024-06-19 03:18:55 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
mMrBun
|
0f2609ce19
|
Merge branch 'hiyouga:main' into main
|
2024-06-09 18:17:24 +08:00 |
|
mMrBun
|
cb1cbcb293
|
Implemented the tool_formatter and tool_extractor for glm4 tool_format
|
2024-06-09 18:16:15 +08:00 |
|
hiyouga
|
5aa4ce4756
|
release v0.8.0
|
2024-06-08 05:20:54 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
f48f5e646e
|
support glm-4
|
2024-06-05 15:16:38 +08:00 |
|
hiyouga
|
d0aa36b8ad
|
fix cohere system
|
2024-05-29 20:58:23 +08:00 |
|
hiyouga
|
0930f58699
|
fix #3965
|
2024-05-29 20:55:51 +08:00 |
|
hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
0dd632fe9e
|
add TeleChat-12B/TeleChat-12B-v2 models
|
2024-05-29 15:00:37 +08:00 |
|
Yimi81
|
dc07413e7d
|
fix yi template
|
2024-05-27 13:11:25 +00:00 |
|
hiyouga
|
c1fdf81df6
|
tiny fix
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
f1002b9f93
|
Update template.py
|
2024-05-27 20:51:56 +08:00 |
|
hoshi-hiyouga
|
122213a7a7
|
Update template.py
|
2024-05-27 20:51:26 +08:00 |
|
Jianbai Ye
|
cff815391f
|
add openchat-3.6-8B support
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
5581cb2e4e
|
update readme
|
2024-05-27 18:14:02 +08:00 |
|
hiyouga
|
542229abb3
|
fix paligemma inference
|
2024-05-20 23:36:43 +08:00 |
|
hiyouga
|
d52fae2fa8
|
fix chat engines
do not use pop(key, default) since api assigns None to dict values
|
2024-05-20 00:36:43 +08:00 |
|