19 Commits

Author SHA1 Message Date
hiyouga
1cd0ea1f13 add MathInstruct dataset
Former-commit-id: 026af87e7fce091a0cda1afd6df3d6ab6189de9a
2023-09-13 22:30:14 +08:00
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
2023-09-01 19:00:45 +08:00
codemayq
d9b9d9d1fe add ad gen dataset
Former-commit-id: 604f85487b46b3eb01b68cb2cc6535b7cb5527a7
2023-08-27 20:35:32 +08:00
codemayq
4b29d9d2b0 add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b41c9a36291d8bee46d7d807c898c21
2023-08-23 18:54:23 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00
hiyouga
4898a0a865 restore from git lfs
Former-commit-id: b9cdff41bbb6084380606cde6c875c994b6b1868
2023-08-01 16:33:25 +08:00
hiyouga
3b8e33d91c use git lfs
Former-commit-id: 82e793ddb42d4f1369516dde63dbe4fed28f2e1d
2023-08-01 10:14:08 +08:00
hiyouga
ba911f988d update dataset
Former-commit-id: f5c2ccdde45bfa5648443a901b2ac397d532eceb
2023-07-26 17:05:12 +08:00
hiyouga
d46c136c0e update dataset
Former-commit-id: 182b42504399d2755897b9737db1d36655a0fa50
2023-07-23 20:01:43 +08:00
hiyouga
fa47c99fa9 add datasets
Former-commit-id: 7159bc54ed0f1bba974662a87ba5039d9aacadee
2023-07-19 20:59:15 +08:00
hiyouga
2b0fced03b fix Baichuan-13B
Former-commit-id: 08439d29b2031ffbe77fe581c148e6d94e68bfc4
2023-07-13 23:08:45 +08:00
zxbsmk
3b15aacf02 Support for WebNovel dataset
Former-commit-id: 4955dc9eed33a904e4e2b9d5985b3fda87c3674a
2023-07-12 17:29:47 +08:00
hiyouga
92070c3d7a add open assistant dataset
Former-commit-id: 3154fec979aba48f54b7afde3740c4990d445a41
2023-06-28 23:09:33 +08:00
hiyouga
9155401bf9 add belle multiturn dataset
Former-commit-id: 334d1a6d26a0c814b86bdfe68fe291c0513123fd
2023-06-16 20:01:16 +08:00
hiyouga
1fbda5d139 support RM metrics, add generating Args
Former-commit-id: cec6524d6b1be65c5d171a5b3dcaae7818132bc5
2023-06-12 15:48:48 +08:00
BUAADreamer
53727aee3e add code for reading from multi files in one directory
Former-commit-id: 3dd5f9a874d66353bb4379bbe39a89cd425dac3d
2023-06-10 15:53:47 +08:00
hiyouga
ddb456bbcb remove dummy code
Former-commit-id: a72492e6490c44a7edccd572da73c47d6f278cc7
2023-05-30 16:28:00 +08:00
hiyouga
4c7c96e656 add pre-training script
Former-commit-id: 8ff96509fa621054368919988a15b50da1891852
2023-05-29 21:37:22 +08:00
hiyouga
54b8ce7b63 Initial commit
Former-commit-id: 769c6ab56be0c9d26e9289f61ac54a4068d935c1
2023-05-28 18:09:04 +08:00