Commit Graph

243 Commits

Author SHA1 Message Date
menibrief
9c1bbfac21 fix phi-small template 2024-09-18 23:52:30 +03:00
BUAADreamer
e387216d49 try to past test 2024-09-10 13:25:30 +08:00
Zhangchi Feng
4643089a7d Merge branch 'hiyouga:main' into main 2024-09-10 13:20:24 +08:00
BUAADreamer
7b4ba0efb6 try to past test 2024-09-10 13:12:51 +08:00
BUAADreamer
31259e7e0c support llava-next(video) 2024-09-10 12:31:53 +08:00
hiyouga
bdde35fd2e update accelerate ver for schedule_free optimizers 2024-09-09 22:51:08 +08:00
hiyouga
6dc2b00fa2 fix mm plugin 2024-09-09 22:41:28 +08:00
hiyouga
c93795ae14 fix qwen2vl preprocess 2024-09-09 22:33:33 +08:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hiyouga
76f2e59504 tiny fix 2024-09-05 23:41:16 +08:00
hiyouga
94d5b1bd8f add e2e tests 2024-09-05 21:52:28 +08:00
hoshi-hiyouga
1274356263 Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
2024-09-05 21:35:42 +08:00
liudan
3d3fbaaff9 根据代码规范修改了代码 2024-09-05 20:17:55 +08:00
hiyouga
359ef8bb0e support Yi-Coder models 2024-09-05 03:12:24 +08:00
hiyouga
8cafc7b055 video datasets 2024-09-05 02:04:17 +08:00
liudan
d7ba97be48 增加了对minicpm3.0的适配' 2024-09-04 23:10:05 +08:00
hiyouga
dabad5570b update get template 2024-09-04 22:36:20 +08:00
hoshi-hiyouga
8f441c2b3a Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
Add batch size of map function in the preprocessed dataset
2024-09-04 22:09:36 +08:00
hoshi-hiyouga
44d6947e55 fix #5228 2024-09-04 19:10:30 +08:00
hiyouga
d41d43a7c3 fix #5344 2024-09-04 03:06:06 +08:00
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
69d0acacc3 fix #5338 2024-09-03 17:45:17 +08:00
naem1023
209313eeea feat: add batch size of map function in the preprocessed dataset 2024-09-02 13:52:47 +09:00
hiyouga
3a6f19f017 tiny fix 2024-09-02 01:33:22 +08:00
hiyouga
ce8c5a2647 add image num check 2024-09-02 01:31:36 +08:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
64cb947c60 fix bug 2024-09-01 21:07:49 +08:00
hiyouga
9967ccb3ae fix mixed mm inputs and rlhf-v 2024-09-01 20:52:47 +08:00
hiyouga
a2a8c0b92c add test mm plugin 2024-08-31 01:53:38 +08:00
hiyouga
bee1bd43b9 tiny fix 2024-08-30 03:21:50 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hoshi-hiyouga
a8f22d8895 fix bug 2024-08-30 02:05:26 +08:00
simonJJJ
734e019cc1 update 2024-08-28 20:22:46 +08:00
simonJJJ
aeb85f200b initial-commit 2024-08-28 16:51:35 +08:00
hoshi-hiyouga
15be296347 Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
2024-08-20 00:09:03 +08:00
hoshi-hiyouga
ec72eeca52 Update template.py 2024-08-20 00:03:33 +08:00
hoshi-hiyouga
5f3300ec5d Update template.py 2024-08-19 23:40:16 +08:00
Huiyu Chen
2502833a77 Add SailorLLM template 2024-08-15 15:10:14 +08:00
“Wzw”
bcbbf45063 fix Llama-template's system prompt bug 2024-08-12 19:22:12 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hoshi-hiyouga
51542cb15f Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
2024-08-09 17:58:27 +08:00
hoshi-hiyouga
4f62e1cb24 Update template.py 2024-08-09 16:27:42 +08:00
“Wzw”
2fa1e0b2ad mask_history args verify valid 2024-08-08 10:12:01 +08:00
“Wzw”
b5ca86cc07 fix mask_history tiny bug 2024-08-08 10:09:33 +08:00
moontidef
b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem 2024-08-05 23:55:45 +08:00
hoshi-hiyouga
8a2846cfe1 Merge pull request #4892 from piamo/main
update deepseek template
2024-07-26 11:49:34 +08:00
hiyouga
091010492b fix #4928 2024-07-24 17:00:29 +08:00
hiyouga
4135e69406 fix flashattn + packing 2024-07-21 17:07:45 +08:00
huangpan.foo
44e48e2b82 update deepseek template 2024-07-19 15:02:54 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00