Commit Graph

110 Commits

Author SHA1 Message Date
hiyouga
fa47c99fa9 add datasets
Former-commit-id: 7159bc54ed
2023-07-19 20:59:15 +08:00
hiyouga
2b0fced03b fix Baichuan-13B
Former-commit-id: 08439d29b2
2023-07-13 23:08:45 +08:00
zxbsmk
3b15aacf02 Support for WebNovel dataset
Former-commit-id: 4955dc9eed
2023-07-12 17:29:47 +08:00
hiyouga
92070c3d7a add open assistant dataset
Former-commit-id: 3154fec979
2023-06-28 23:09:33 +08:00
hiyouga
9155401bf9 add belle multiturn dataset
Former-commit-id: 334d1a6d26
2023-06-16 20:01:16 +08:00
hiyouga
1fbda5d139 support RM metrics, add generating Args
Former-commit-id: cec6524d6b
2023-06-12 15:48:48 +08:00
BUAADreamer
53727aee3e add code for reading from multi files in one directory
Former-commit-id: 3dd5f9a874
2023-06-10 15:53:47 +08:00
hiyouga
ddb456bbcb remove dummy code
Former-commit-id: a72492e649
2023-05-30 16:28:00 +08:00
hiyouga
4c7c96e656 add pre-training script
Former-commit-id: 8ff96509fa
2023-05-29 21:37:22 +08:00
hiyouga
54b8ce7b63 Initial commit
Former-commit-id: 769c6ab56b
2023-05-28 18:09:04 +08:00