BUAADreamer
|
01ca056965
|
fix template
|
2024-09-29 22:56:36 +08:00 |
|
BUAADreamer
|
96bec6817a
|
fix template
|
2024-09-29 22:55:45 +08:00 |
|
BUAADreamer
|
bec1cb8d55
|
fix constants
|
2024-09-29 22:40:43 +08:00 |
|
BUAADreamer
|
65a8923f5a
|
add more llava-next series template
|
2024-09-29 21:29:29 +08:00 |
|
BUAADreamer
|
6642cd501d
|
add llava-next/llava-next-video/video-llava
|
2024-09-28 00:57:03 +08:00 |
|
Zhangchi Feng
|
900631755b
|
Merge branch 'hiyouga:main' into main
|
2024-09-27 18:14:39 +08:00 |
|
hiyouga
|
ba52103ba7
|
optionally replace jinja template
|
2024-09-25 23:02:02 +08:00 |
|
Zhangchi Feng
|
4643089a7d
|
Merge branch 'hiyouga:main' into main
|
2024-09-10 13:20:24 +08:00 |
|
BUAADreamer
|
31259e7e0c
|
support llava-next(video)
|
2024-09-10 12:31:53 +08:00 |
|
hiyouga
|
bdde35fd2e
|
update accelerate ver for schedule_free optimizers
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
54c6905937
|
add docstrings, refactor logger
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
94d5b1bd8f
|
add e2e tests
|
2024-09-05 21:52:28 +08:00 |
|
hoshi-hiyouga
|
1274356263
|
Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
|
2024-09-05 21:35:42 +08:00 |
|
liudan
|
3d3fbaaff9
|
根据代码规范修改了代码
|
2024-09-05 20:17:55 +08:00 |
|
hiyouga
|
8cafc7b055
|
video datasets
|
2024-09-05 02:04:17 +08:00 |
|
liudan
|
d7ba97be48
|
增加了对minicpm3.0的适配'
|
2024-09-04 23:10:05 +08:00 |
|
hiyouga
|
dabad5570b
|
update get template
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
d41d43a7c3
|
fix #5344
|
2024-09-04 03:06:06 +08:00 |
|
hiyouga
|
9967ccb3ae
|
fix mixed mm inputs and rlhf-v
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
a2a8c0b92c
|
add test mm plugin
|
2024-08-31 01:53:38 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
simonJJJ
|
aeb85f200b
|
initial-commit
|
2024-08-28 16:51:35 +08:00 |
|
hoshi-hiyouga
|
15be296347
|
Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
|
2024-08-20 00:09:03 +08:00 |
|
hoshi-hiyouga
|
ec72eeca52
|
Update template.py
|
2024-08-20 00:03:33 +08:00 |
|
hoshi-hiyouga
|
5f3300ec5d
|
Update template.py
|
2024-08-19 23:40:16 +08:00 |
|
Huiyu Chen
|
2502833a77
|
Add SailorLLM template
|
2024-08-15 15:10:14 +08:00 |
|
“Wzw”
|
bcbbf45063
|
fix Llama-template's system prompt bug
|
2024-08-12 19:22:12 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
hoshi-hiyouga
|
51542cb15f
|
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
|
2024-08-09 17:58:27 +08:00 |
|
hoshi-hiyouga
|
4f62e1cb24
|
Update template.py
|
2024-08-09 16:27:42 +08:00 |
|
“Wzw”
|
b5ca86cc07
|
fix mask_history tiny bug
|
2024-08-08 10:09:33 +08:00 |
|
moontidef
|
b82ecbedd0
|
fix: fix the deepseekcoder template to avoid repeat problem
|
2024-08-05 23:55:45 +08:00 |
|
huangpan.foo
|
44e48e2b82
|
update deepseek template
|
2024-07-19 15:02:54 +08:00 |
|
hiyouga
|
53b1002fb7
|
add codegeex4, internlm2.5
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
9f33f1edf5
|
fix processors
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
1771251ce3
|
fix #4402 #4617
Deprecate reserved_label_len arg
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
59e0b4f616
|
fix #4556
|
2024-06-26 19:43:16 +08:00 |
|
hiyouga
|
41086059b1
|
tiny fix
|
2024-06-25 01:15:19 +08:00 |
|
hoshi-hiyouga
|
1240bd57d8
|
Update template.py
|
2024-06-24 23:12:59 +08:00 |
|
mMrBun
|
20e2e6fdcb
|
Add tool_format to overwrite tool formatter template
|
2024-06-22 02:13:23 +08:00 |
|
hiyouga
|
db9a1912e3
|
remove dup template
|
2024-06-22 01:31:32 +08:00 |
|
hiyouga
|
2b596fb55f
|
fix jinja template
|
2024-06-19 20:03:50 +08:00 |
|
hiyouga
|
4cff6a4ad5
|
fix templates
|
2024-06-19 17:44:05 +08:00 |
|
hiyouga
|
6d2bf216ac
|
fix bug
|
2024-06-19 03:49:23 +08:00 |
|
hiyouga
|
4f22eae8f4
|
use prefix to replace force system
|
2024-06-19 03:39:52 +08:00 |
|
hiyouga
|
cd75b1fe9d
|
fix tool formatter, allow parallel function #4362
|
2024-06-19 03:23:51 +08:00 |
|
hoshi-hiyouga
|
c0ca42566c
|
Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
|
2024-06-19 03:18:55 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
mMrBun
|
0f2609ce19
|
Merge branch 'hiyouga:main' into main
|
2024-06-09 18:17:24 +08:00 |
|
mMrBun
|
cb1cbcb293
|
Implemented the tool_formatter and tool_extractor for glm4 tool_format
|
2024-06-09 18:16:15 +08:00 |
|