hiyouga
|
a1b0655457
|
support sharegpt format, add datasets
Former-commit-id: a837172413
|
2023-11-02 23:10:04 +08:00 |
|
hiyouga
|
1cd0ea1f13
|
add MathInstruct dataset
Former-commit-id: 026af87e7f
|
2023-09-13 22:30:14 +08:00 |
|
hiyouga
|
a4fd976048
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f7
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
d9b9d9d1fe
|
add ad gen dataset
Former-commit-id: 604f85487b
|
2023-08-27 20:35:32 +08:00 |
|
codemayq
|
b032dc4c4e
|
add readme for dataset
Former-commit-id: cece66d48a
|
2023-08-23 19:55:45 +08:00 |
|
codemayq
|
4b29d9d2b0
|
add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
802494e20a
|
update template
Former-commit-id: 4318347d3f
|
2023-08-22 19:46:09 +08:00 |
|
Peter Pan
|
23443e9696
|
add rm dataset explanation
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Former-commit-id: b0ca8fe634
|
2023-08-22 01:33:59 -04:00 |
|
hiyouga
|
abdfa26d06
|
support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
4898a0a865
|
restore from git lfs
Former-commit-id: b9cdff41bb
|
2023-08-01 16:33:25 +08:00 |
|
hiyouga
|
3b8e33d91c
|
use git lfs
Former-commit-id: 82e793ddb4
|
2023-08-01 10:14:08 +08:00 |
|
hiyouga
|
ba911f988d
|
update dataset
Former-commit-id: f5c2ccdde4
|
2023-07-26 17:05:12 +08:00 |
|
hiyouga
|
d46c136c0e
|
update dataset
Former-commit-id: 182b425043
|
2023-07-23 20:01:43 +08:00 |
|
hiyouga
|
261ca840d0
|
update readme, fix web ui postprocess
Former-commit-id: 035c966d5c
|
2023-07-22 14:29:22 +08:00 |
|
mrhan1993
|
cdd887908c
|
根据GLM Efficient Tuning添加中文README,web添加了server_port
Former-commit-id: 9f0b57b370
|
2023-07-21 16:57:58 +08:00 |
|
hiyouga
|
fa47c99fa9
|
add datasets
Former-commit-id: 7159bc54ed
|
2023-07-19 20:59:15 +08:00 |
|
hiyouga
|
2b0fced03b
|
fix Baichuan-13B
Former-commit-id: 08439d29b2
|
2023-07-13 23:08:45 +08:00 |
|
zxbsmk
|
3b15aacf02
|
Support for WebNovel dataset
Former-commit-id: 4955dc9eed
|
2023-07-12 17:29:47 +08:00 |
|
hiyouga
|
92070c3d7a
|
add open assistant dataset
Former-commit-id: 3154fec979
|
2023-06-28 23:09:33 +08:00 |
|
hiyouga
|
9155401bf9
|
add belle multiturn dataset
Former-commit-id: 334d1a6d26
|
2023-06-16 20:01:16 +08:00 |
|
hiyouga
|
1fbda5d139
|
support RM metrics, add generating Args
Former-commit-id: cec6524d6b
|
2023-06-12 15:48:48 +08:00 |
|
BUAADreamer
|
465264f852
|
update json line file to .jsonl
Former-commit-id: e3b53a67c7
|
2023-06-11 18:59:19 +08:00 |
|
BUAADreamer
|
c4128832e5
|
add some
Former-commit-id: 676d910260
|
2023-06-11 18:55:53 +08:00 |
|
BUAADreamer
|
b1c6ee9cf5
|
add code for reading from multi files in one directory
Former-commit-id: a2af9df5a9
|
2023-06-10 16:27:30 +08:00 |
|
BUAADreamer
|
53727aee3e
|
add code for reading from multi files in one directory
Former-commit-id: 3dd5f9a874
|
2023-06-10 15:53:47 +08:00 |
|
hiyouga
|
ddb456bbcb
|
remove dummy code
Former-commit-id: a72492e649
|
2023-05-30 16:28:00 +08:00 |
|
hiyouga
|
4c7c96e656
|
add pre-training script
Former-commit-id: 8ff96509fa
|
2023-05-29 21:37:22 +08:00 |
|
hiyouga
|
54b8ce7b63
|
Initial commit
Former-commit-id: 769c6ab56b
|
2023-05-28 18:09:04 +08:00 |
|