hiyouga
|
5019c6148b
|
fix #5338
Former-commit-id: a66ddfea218feefde50fa097d20b4bcbe89ab791
|
2024-09-03 17:45:17 +08:00 |
|
naem1023
|
3622856994
|
feat: add batch size of map function in the preprocessed dataset
Former-commit-id: 94b6cf06c2f84d0619b1a2dccaf8abb51de9951c
|
2024-09-02 13:52:47 +09:00 |
|
hiyouga
|
f203a9d78e
|
tiny fix
Former-commit-id: 8b4f408da110d74285bae20bbd969013a979964b
|
2024-09-02 01:33:22 +08:00 |
|
hiyouga
|
bae73e676c
|
add image num check
Former-commit-id: 15201113bf16b748c0a758c7a5b363da8272e0e6
|
2024-09-02 01:31:36 +08:00 |
|
hiyouga
|
60cf12727b
|
add rlhf-v dataset
Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
559b84dceb
|
fix bug
Former-commit-id: 6e19e56000dd18d5faf84ceabce8d7708ff21e4d
|
2024-09-01 21:07:49 +08:00 |
|
hiyouga
|
7e4c5d4bb3
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
43654028eb
|
add test mm plugin
Former-commit-id: ddea5cca5a3174de1dcc7fdee8ec69e77700b6bf
|
2024-08-31 01:53:38 +08:00 |
|
hiyouga
|
665db18661
|
tiny fix
Former-commit-id: 830511a6d0216da99520aee8b3a753d347a71fa9
|
2024-08-30 03:21:50 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hoshi-hiyouga
|
7466fd4387
|
fix bug
Former-commit-id: 365e6df71509569f59c40743c115f1a4b945ef0f
|
2024-08-30 02:05:26 +08:00 |
|
simonJJJ
|
5b4b60cfb5
|
update
Former-commit-id: a968a416d5e513320c97109229ca1e6ddc003cb1
|
2024-08-28 20:22:46 +08:00 |
|
simonJJJ
|
0f3d54d8a0
|
initial-commit
Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928
|
2024-08-28 16:51:35 +08:00 |
|
hoshi-hiyouga
|
388f0a6e05
|
Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
Former-commit-id: 0b57175d3bd029675dae2f55995b7eeb4e9adc7a
|
2024-08-20 00:09:03 +08:00 |
|
hoshi-hiyouga
|
8c13c02c55
|
Update template.py
Former-commit-id: f5a075cb1c90f05bb0de26c6aea718f556c54623
|
2024-08-20 00:03:33 +08:00 |
|
hoshi-hiyouga
|
b681f24f43
|
Update template.py
Former-commit-id: c6822a217e1c296f4aedd9a2c7610acd1dbd443e
|
2024-08-19 23:40:16 +08:00 |
|
Huiyu Chen
|
22a79c169d
|
Add SailorLLM template
Former-commit-id: a594abe0321a718394a97b5a48ded16e2012c1f0
|
2024-08-15 15:10:14 +08:00 |
|
“Wzw”
|
e734222373
|
fix Llama-template's system prompt bug
Former-commit-id: 2e3eddcd0918b0c968ded0df7c82e3dcff870381
|
2024-08-12 19:22:12 +08:00 |
|
hiyouga
|
5af32ce705
|
follow #5115
Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a
|
2024-08-09 18:03:00 +08:00 |
|
hoshi-hiyouga
|
4e8861e653
|
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
Former-commit-id: 2c6dae45f7a7b72c961489ac407b1b444ab7752e
|
2024-08-09 17:58:27 +08:00 |
|
hoshi-hiyouga
|
46f834ec75
|
Update template.py
Former-commit-id: ae2a5221c109ae3474d219c37433be767abbee91
|
2024-08-09 16:27:42 +08:00 |
|
“Wzw”
|
6ec64a7e56
|
mask_history args verify valid
Former-commit-id: 2f8388b4f4195d934400ad9267d72e10ca4105a3
|
2024-08-08 10:12:01 +08:00 |
|
“Wzw”
|
d71446e387
|
fix mask_history tiny bug
Former-commit-id: cac07aac6196be026f723b2397a343d4fb675973
|
2024-08-08 10:09:33 +08:00 |
|
moontidef
|
7b4f5d3b21
|
fix: fix the deepseekcoder template to avoid repeat problem
Former-commit-id: 56294831115f095135f72490a8a435434b2f0a11
|
2024-08-05 23:55:45 +08:00 |
|
hoshi-hiyouga
|
9f74d36ba4
|
Merge pull request #4892 from piamo/main
update deepseek template
Former-commit-id: 3233efc8404972098665286d9dec7312dd6ecfab
|
2024-07-26 11:49:34 +08:00 |
|
hiyouga
|
ff5ba97970
|
fix #4928
Former-commit-id: 6d557e8959678f9d4edbcb3d5a6dfba14b429b18
|
2024-07-24 17:00:29 +08:00 |
|
hiyouga
|
a770afbff2
|
fix flashattn + packing
Former-commit-id: 4adc6ce4abc718c25f39b316bfc3352d0d01ed1e
|
2024-07-21 17:07:45 +08:00 |
|
huangpan.foo
|
b1a5bf025b
|
update deepseek template
Former-commit-id: f5ca86ec95bb301df42ffaa6923fc3037a224e34
|
2024-07-19 15:02:54 +08:00 |
|
hiyouga
|
4c1513a845
|
follow #4878 fix #4684
Former-commit-id: 4715e5c5b8040b21e5f401f7e969b9fd2757d520
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
c1e1918db1
|
仅仅训练最后一轮对话
Former-commit-id: ab6198e4c099edeb1a400f58729cd617e8cd8e50
|
2024-07-18 15:30:25 +08:00 |
|
hoshi-hiyouga
|
68365045b4
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 51eb379b44fad0336fc96c329ec98dc4528b9c2c
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
0bc52c0aae
|
Update preprocess.py
Former-commit-id: da92f4a1b9c12a8e2489b964baba5e2c8e739ef1
|
2024-07-15 00:55:36 +08:00 |
|
hoshi-hiyouga
|
6bf2663b8e
|
Update parser.py
Former-commit-id: 145687997c86b8785e37dd60fbb9f3a5986730a6
|
2024-07-15 00:55:21 +08:00 |
|
hoshi-hiyouga
|
d337de668e
|
Update data_utils.py
Former-commit-id: 5c2a0e3b1d1afd2a9219d935d3421fffffc3a2c9
|
2024-07-15 00:54:34 +08:00 |
|
hoshi-hiyouga
|
ec372f91e9
|
Update loader.py
Former-commit-id: 860e3eb374947b72dcae88cab0a93ef561e3bfb3
|
2024-07-15 00:50:06 +08:00 |
|
hoshi-hiyouga
|
ee17741591
|
Update parser.py
Former-commit-id: b9760df588e64270a140d9111241c62c1cefe781
|
2024-07-14 23:04:34 +08:00 |
|
hiyouga
|
5ab997d484
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
dfc7a7d5cd
|
fix #4792
Former-commit-id: d7547d6b9e4c660897e3ce0f4022e08686c172d5
|
2024-07-13 22:07:58 +08:00 |
|
hiyouga
|
0d6ec70c6f
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
codingma
|
5f2bd04799
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
9a1a5f9778
|
fix processors
Former-commit-id: 7215f3a8612b570cd322802d14db532927900117
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
edc8aefa59
|
fix #4683
Former-commit-id: cbff0ea0db6971f8ced503a2f0cb6bc43e7037ac
|
2024-07-05 00:58:05 +08:00 |
|
hzhaoy
|
c6f1bc65c0
|
tiny fix
Former-commit-id: 8f43ad988a4fd518a708fba53a173596ce2c59dd
|
2024-07-04 10:20:28 +08:00 |
|
hiyouga
|
0517d7bee5
|
tiny fix
Former-commit-id: 935703b46d2871ce1014832da067dfe4a50c0610
|
2024-07-04 03:02:23 +08:00 |
|
hiyouga
|
5bc0b9b31c
|
fix data map for packing
Former-commit-id: ee6f8f926f084a195b2dbbd074e041e6c62c6ef4
|
2024-07-04 03:01:31 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a90c6306f8
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
60558388ec
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
5acaa476d6
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|