Commit Graph

108 Commits

Author SHA1 Message Date
hiyouga
08bd0440b5 add llava 1k datasets 2024-05-27 19:57:33 +08:00
hiyouga
4d647ddba5 Update README_zh.md 2024-05-21 18:30:59 +08:00
hiyouga
7262679666 fix #3818 2024-05-20 21:43:19 +08:00
hiyouga
ca48f90f1e update data readme 2024-05-18 21:37:38 +08:00
hiyouga
18cbf8561d update data readme 2024-05-18 21:15:20 +08:00
hiyouga
c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
enji.zhou
db1d5a4f51 add kto 2024-05-17 13:09:17 +08:00
hiyouga
58c522cd5c remove checksum and fix ui args 2024-05-12 01:10:30 +08:00
codingma
d5520b6017 fix sha1 of glaive_toolcall dataset 2024-05-09 16:33:45 +08:00
hiyouga
1ccbfe562d remove big file 2024-05-07 22:14:06 +08:00
hiyouga
09f3ef1de4 fix stop param 2024-05-07 00:41:04 +08:00
hoshi-hiyouga
d6ca7853fa Merge pull request #3588 from ZeyuTeng96/patch-1
update hf_hub_url for nectar_rm in dataset_info
2024-05-07 00:06:11 +08:00
hoshi-hiyouga
c3910ab98a Update dataset_info.json 2024-05-07 00:05:45 +08:00
hiyouga
f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
ZeyuTeng96
044af36442 update hf_hub_url for nectar_rm in dataset_info
Hi there,

I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating.

See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar
2024-05-06 16:44:50 +08:00
hoshi-hiyouga
d4d9180c40 Update README_zh.md 2024-05-02 02:14:55 +08:00
hoshi-hiyouga
b072ec9d1b Update README.md 2024-05-02 02:13:46 +08:00
Lao
ce17eccf45 Update README_zh.md 2024-04-28 23:31:37 +08:00
khazic
288911fc7b Upgrade the second sharegpt format 2024-04-28 14:30:05 +08:00
khazic
d1ba32e4bb added the second sharegpt format 2024-04-28 14:27:45 +08:00
hiyouga
5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hoshi-hiyouga
8f91420223 Merge pull request #3471 from BUAADreamer/main
add llava_150k en/zh mllm sft data
2024-04-26 23:36:41 +08:00
hoshi-hiyouga
c29b257007 Update dataset_info.json 2024-04-26 23:34:34 +08:00
BUAADreamer
a177872010 add llava_150k en/zh mllm sft data 2024-04-26 23:18:58 +08:00
hiyouga
168f56683a release v0.7.0 2024-04-26 23:18:00 +08:00
hiyouga
e057c8de48 support mllm hf inference 2024-04-26 05:34:58 +08:00
hoshi-hiyouga
f8c26e6a34 Update dataset_info.json 2024-04-26 03:03:36 +08:00
hoshi-hiyouga
5ef293387f Update mllm_demo.json 2024-04-26 02:58:45 +08:00
hoshi-hiyouga
7dcae3dba3 Update and rename llava_instruct_example.json to mllm_demo.json 2024-04-26 02:57:54 +08:00
BUAADreamer
42c90c8183 merge data part to the text stream 2024-04-25 19:58:47 +08:00
BUAADreamer
c6dd89918f merge data part to the text stream 2024-04-25 19:19:59 +08:00
BUAADreamer
cfb485eddf add llava and instructblip 2024-04-25 00:22:43 +08:00
BUAADreamer
4dcb11eab7 add multimodal LLM BLIP-2 and InstructBLIP 2024-04-23 18:45:43 +08:00
hiyouga
6339edefff add dpo mix dataset 2024-04-20 01:31:38 +08:00
hiyouga
d1fb6c72b5 fix #3247 2024-04-12 17:41:33 +08:00
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
li.yunhao
9c2ef9cdf4 fix pile datset hf hub url 2024-03-30 16:06:10 +08:00
hiyouga
3271af2afc add orca_dpo_pairs dataset 2024-03-20 20:09:06 +08:00
SirlyDreamer
e165965341 Follow HF_ENDPOINT environment variable 2024-03-20 08:31:30 +00:00
hiyouga
be99799413 update parser 2024-03-10 13:35:20 +08:00
hiyouga
894d183214 update readme, add starcoder2, cosmopedia 2024-03-03 01:01:46 +08:00
hiyouga
32884523c5 update data 2024-03-02 19:37:18 +08:00
hiyouga
1630a4cb8f fix #2533 2024-02-21 22:47:48 +08:00
hiyouga
22acab8aff fix #2481 2024-02-15 19:07:47 +08:00
hiyouga
a754f6e9ec update data/readme 2024-02-10 21:04:29 +08:00
hiyouga
7d2dc83c5e improve aligner 2024-02-10 16:39:19 +08:00
Mark Mueller
1d3598afa1 Slim Orca data parsing 2024-02-08 19:32:20 +01:00
Johann-Peter Hartmann
49c69ea4b9 WS fix 2024-02-06 20:13:04 +01:00
Johann-Peter Hartmann
1126563505 add ranking to dpo dataset 2024-02-06 20:12:36 +01:00
Johann-Peter Hartmann
870182c3a9 remove comma 2024-02-03 08:48:39 +01:00