[data] add coig-p dataset (#7657)

This commit is contained in:
hoshi-hiyouga
2025-04-09 21:18:25 +08:00
committed by GitHub
parent 7dd35cff8a
commit cca359fb6d
11 changed files with 325 additions and 915 deletions

View File

@@ -85,7 +85,7 @@
### 预训练数据集
- [样例数据集](c4_demo.json)
- [样例数据集](c4_demo.jsonl)
在预训练时,只有 `text` 列中的内容会用于模型学习。