Commit Graph

122 Commits

Author SHA1 Message Date
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
69d0acacc3 fix #5338 2024-09-03 17:45:17 +08:00
hiyouga
3a6f19f017 tiny fix 2024-09-02 01:33:22 +08:00
hiyouga
ce8c5a2647 add image num check 2024-09-02 01:31:36 +08:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
64cb947c60 fix bug 2024-09-01 21:07:49 +08:00
hiyouga
9967ccb3ae fix mixed mm inputs and rlhf-v 2024-09-01 20:52:47 +08:00
hiyouga
a2a8c0b92c add test mm plugin 2024-08-31 01:53:38 +08:00
hiyouga
bee1bd43b9 tiny fix 2024-08-30 03:21:50 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hoshi-hiyouga
a8f22d8895 fix bug 2024-08-30 02:05:26 +08:00
simonJJJ
734e019cc1 update 2024-08-28 20:22:46 +08:00
simonJJJ
aeb85f200b initial-commit 2024-08-28 16:51:35 +08:00
hoshi-hiyouga
15be296347 Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
2024-08-20 00:09:03 +08:00
hoshi-hiyouga
ec72eeca52 Update template.py 2024-08-20 00:03:33 +08:00
hoshi-hiyouga
5f3300ec5d Update template.py 2024-08-19 23:40:16 +08:00
Huiyu Chen
2502833a77 Add SailorLLM template 2024-08-15 15:10:14 +08:00
“Wzw”
bcbbf45063 fix Llama-template's system prompt bug 2024-08-12 19:22:12 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hoshi-hiyouga
51542cb15f Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
2024-08-09 17:58:27 +08:00
hoshi-hiyouga
4f62e1cb24 Update template.py 2024-08-09 16:27:42 +08:00
“Wzw”
2fa1e0b2ad mask_history args verify valid 2024-08-08 10:12:01 +08:00
“Wzw”
b5ca86cc07 fix mask_history tiny bug 2024-08-08 10:09:33 +08:00
moontidef
b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem 2024-08-05 23:55:45 +08:00
hoshi-hiyouga
8a2846cfe1 Merge pull request #4892 from piamo/main
update deepseek template
2024-07-26 11:49:34 +08:00
hiyouga
091010492b fix #4928 2024-07-24 17:00:29 +08:00
hiyouga
4135e69406 fix flashattn + packing 2024-07-21 17:07:45 +08:00
huangpan.foo
44e48e2b82 update deepseek template 2024-07-19 15:02:54 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hoshi-hiyouga
15b399a82f Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
2024-07-15 01:00:34 +08:00
hoshi-hiyouga
df52fb05b1 Update preprocess.py 2024-07-15 00:55:36 +08:00
hoshi-hiyouga
84e4047f8a Update parser.py 2024-07-15 00:55:21 +08:00
hoshi-hiyouga
97a0e291c7 Update data_utils.py 2024-07-15 00:54:34 +08:00
hoshi-hiyouga
a5b809516e Update loader.py 2024-07-15 00:50:06 +08:00
hoshi-hiyouga
3d39d74003 Update parser.py 2024-07-14 23:04:34 +08:00
hiyouga
2f6af73da2 fix gemma2 attention 2024-07-13 23:33:45 +08:00
hiyouga
6b48308ef9 fix #4792 2024-07-13 22:07:58 +08:00
hiyouga
53b1002fb7 add codegeex4, internlm2.5 2024-07-06 16:16:47 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
9f33f1edf5 fix processors 2024-07-05 08:33:22 +08:00
hiyouga
e43809bced fix #4683 2024-07-05 00:58:05 +08:00
hzhaoy
738df47748 tiny fix 2024-07-04 10:20:28 +08:00
hiyouga
44747cebd2 tiny fix 2024-07-04 03:02:23 +08:00
hiyouga
b5d101e1bf fix data map for packing 2024-07-04 03:01:31 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hoshi-hiyouga
87d9b2d005 Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
2024-07-04 01:18:54 +08:00
hiyouga
cce7083024 update packing 2024-07-04 01:10:55 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
hiyouga
c47ab6c072 improve rlhf 2024-07-02 22:23:08 +08:00