yuze.zyz
c523613f0a
support ms dataset
...
Former-commit-id: 9c2247d700763f480d88a5dd46480cb32cfc174e
2023-12-08 18:00:57 +08:00
hiyouga
9a6b694e12
fix #1696
...
Former-commit-id: bf6f6aeefe65b4949633648b8711525c0029c001
2023-12-01 15:34:50 +08:00
Marco
a26f68ba47
Update dataset_info.json
...
Added the Nectar dataset already preprocessed and divided in sft and rl to which I added a preprompt to each instruction since it has been seen that this increase instruction following
Former-commit-id: 9468ee9012bfe7124fc5cc2acebcfe03a6d0cdee
2023-11-30 16:21:34 +01:00
hiyouga
303956cbb9
update dataset
...
Former-commit-id: 7b1aa6f63c79c0d9cb5249fdb0d6a5f9a04f36bd
2023-11-17 23:19:12 +08:00
hiyouga
f441932bd1
support full-parameter PPO
...
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
2023-11-16 02:08:04 +08:00
hiyouga
38755bced7
add template, modify datasets
...
Former-commit-id: 386f590209e466b51c17a7ac8cee55fc3ce928d7
2023-11-09 15:53:23 +08:00
hiyouga
b2bf10661b
update data readme
...
Former-commit-id: 2b5e33c338e6e8b10c4cbaa68ed26ef3b38ad5f9
2023-11-03 00:15:23 +08:00
hiyouga
a9db89a025
update data readme (zh)
...
Former-commit-id: cc8ffa10d877f5893f3940204e5bec6f3266559f
2023-11-02 23:42:49 +08:00
hiyouga
a1b0655457
support sharegpt format, add datasets
...
Former-commit-id: a8371724130db2fbd7273a480e2acb251e382aec
2023-11-02 23:10:04 +08:00
hiyouga
1cd0ea1f13
add MathInstruct dataset
...
Former-commit-id: 026af87e7fce091a0cda1afd6df3d6ab6189de9a
2023-09-13 22:30:14 +08:00
hiyouga
a4fd976048
refactor dataset_attr, add eos in pt, fix #757
...
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
2023-09-01 19:00:45 +08:00
codemayq
d9b9d9d1fe
add ad gen dataset
...
Former-commit-id: 604f85487b46b3eb01b68cb2cc6535b7cb5527a7
2023-08-27 20:35:32 +08:00
codemayq
b032dc4c4e
add readme for dataset
...
Former-commit-id: cece66d48a770e3e418496445d4040e3cafa9411
2023-08-23 19:55:45 +08:00
codemayq
4b29d9d2b0
add dataset stage and filter dataset when stage chosen in webui
...
Former-commit-id: c0e4d1e81b41c9a36291d8bee46d7d807c898c21
2023-08-23 18:54:23 +08:00
hiyouga
802494e20a
update template
...
Former-commit-id: 4318347d3f1982c773dad1074636ec7b550770fd
2023-08-22 19:46:09 +08:00
Peter Pan
23443e9696
add rm dataset explanation
...
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Former-commit-id: b0ca8fe634c35073bb156447ff45c5a8eb54aca1
2023-08-22 01:33:59 -04:00
hiyouga
abdfa26d06
support DPO training (2305.18290)
...
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00
hiyouga
4898a0a865
restore from git lfs
...
Former-commit-id: b9cdff41bbb6084380606cde6c875c994b6b1868
2023-08-01 16:33:25 +08:00
hiyouga
3b8e33d91c
use git lfs
...
Former-commit-id: 82e793ddb42d4f1369516dde63dbe4fed28f2e1d
2023-08-01 10:14:08 +08:00
hiyouga
ba911f988d
update dataset
...
Former-commit-id: f5c2ccdde45bfa5648443a901b2ac397d532eceb
2023-07-26 17:05:12 +08:00
hiyouga
d46c136c0e
update dataset
...
Former-commit-id: 182b42504399d2755897b9737db1d36655a0fa50
2023-07-23 20:01:43 +08:00
hiyouga
261ca840d0
update readme, fix web ui postprocess
...
Former-commit-id: 035c966d5c1a2c7b9e9cba8ad06182a6672eabd4
2023-07-22 14:29:22 +08:00
mrhan1993
cdd887908c
根据GLM Efficient Tuning添加中文README,web添加了server_port
...
Former-commit-id: 9f0b57b3701fa73f719cd5a319b1584454481bbb
2023-07-21 16:57:58 +08:00
hiyouga
fa47c99fa9
add datasets
...
Former-commit-id: 7159bc54ed0f1bba974662a87ba5039d9aacadee
2023-07-19 20:59:15 +08:00
hiyouga
2b0fced03b
fix Baichuan-13B
...
Former-commit-id: 08439d29b2031ffbe77fe581c148e6d94e68bfc4
2023-07-13 23:08:45 +08:00
zxbsmk
3b15aacf02
Support for WebNovel dataset
...
Former-commit-id: 4955dc9eed33a904e4e2b9d5985b3fda87c3674a
2023-07-12 17:29:47 +08:00
hiyouga
92070c3d7a
add open assistant dataset
...
Former-commit-id: 3154fec979aba48f54b7afde3740c4990d445a41
2023-06-28 23:09:33 +08:00
hiyouga
9155401bf9
add belle multiturn dataset
...
Former-commit-id: 334d1a6d26a0c814b86bdfe68fe291c0513123fd
2023-06-16 20:01:16 +08:00
hiyouga
1fbda5d139
support RM metrics, add generating Args
...
Former-commit-id: cec6524d6b1be65c5d171a5b3dcaae7818132bc5
2023-06-12 15:48:48 +08:00
BUAADreamer
465264f852
update json line file to .jsonl
...
Former-commit-id: e3b53a67c7004769cbc6b3a17089f772687d9657
2023-06-11 18:59:19 +08:00
BUAADreamer
c4128832e5
add some
...
Former-commit-id: 676d910260f3bd0e360c40f8340f01b88a7fa06c
2023-06-11 18:55:53 +08:00
BUAADreamer
b1c6ee9cf5
add code for reading from multi files in one directory
...
Former-commit-id: a2af9df5a99ad529d0a280099b115cde69e02973
2023-06-10 16:27:30 +08:00
BUAADreamer
53727aee3e
add code for reading from multi files in one directory
...
Former-commit-id: 3dd5f9a874d66353bb4379bbe39a89cd425dac3d
2023-06-10 15:53:47 +08:00
hiyouga
ddb456bbcb
remove dummy code
...
Former-commit-id: a72492e6490c44a7edccd572da73c47d6f278cc7
2023-05-30 16:28:00 +08:00
hiyouga
4c7c96e656
add pre-training script
...
Former-commit-id: 8ff96509fa621054368919988a15b50da1891852
2023-05-29 21:37:22 +08:00
hiyouga
54b8ce7b63
Initial commit
...
Former-commit-id: 769c6ab56be0c9d26e9289f61ac54a4068d935c1
2023-05-28 18:09:04 +08:00