hoshi-hiyouga
a13bdb9a2b
Update dataset_info.json
...
Former-commit-id: c3910ab98ae11b52ff6e6d1faafd3e63256d908e
2024-05-07 00:05:45 +08:00
hiyouga
92cafef325
update example docs
...
Former-commit-id: f02f87c6fbd20adae105c83526baa23dba2042fd
2024-05-06 22:51:02 +08:00
ZeyuTeng96
96354ca55f
update hf_hub_url for nectar_rm in dataset_info
...
Hi there,
I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating.
See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar
Former-commit-id: 044af364425766ba23373ff21577bc4a9de18e39
2024-05-06 16:44:50 +08:00
hoshi-hiyouga
eea8a79e35
Update README_zh.md
...
Former-commit-id: d4d9180c401cb210654792d8052313e8db17fc51
2024-05-02 02:14:55 +08:00
hoshi-hiyouga
2186deceac
Update README.md
...
Former-commit-id: b072ec9d1b18f7e9d5d2c9529eac55d29ca832c8
2024-05-02 02:13:46 +08:00
Lao
f15836c77a
Update README_zh.md
...
Former-commit-id: ce17eccf451649728cf7b45312fd7f75d3a8a246
2024-04-28 23:31:37 +08:00
khazic
db316422a4
Upgrade the second sharegpt format
...
Former-commit-id: 288911fc7b1e12e53f3396c371cf4b4c7300b4bf
2024-04-28 14:30:05 +08:00
khazic
6f0b412265
added the second sharegpt format
...
Former-commit-id: d1ba32e4bb70489a9e6f5d3657988c9b7553a157
2024-04-28 14:27:45 +08:00
hiyouga
c9fce361fb
update readme
...
Former-commit-id: 5ee04d418c2e66a292e7da6d393843fcf3b71dc1
2024-04-26 23:39:19 +08:00
hoshi-hiyouga
76f767d5b0
Merge pull request #3471 from BUAADreamer/main
...
add llava_150k en/zh mllm sft data
Former-commit-id: 8f9142022382d0eedce4356744a281b2ace3b703
2024-04-26 23:36:41 +08:00
hoshi-hiyouga
5ad1c3dd36
Update dataset_info.json
...
Former-commit-id: c29b257007a8de9735ecaf52afffa80fdcee6a24
2024-04-26 23:34:34 +08:00
BUAADreamer
044668af10
add llava_150k en/zh mllm sft data
...
Former-commit-id: a17787201082951ae39c3c10436be4c16346f16a
2024-04-26 23:18:58 +08:00
hiyouga
eb14501a52
release v0.7.0
...
Former-commit-id: 168f56683ae4909ae50edd4859032fad60149d00
2024-04-26 23:18:00 +08:00
hiyouga
d2df4c22ab
support mllm hf inference
...
Former-commit-id: e057c8de486bfbc829240924f9238d6212c917f1
2024-04-26 05:34:58 +08:00
hoshi-hiyouga
3e832e53be
Update dataset_info.json
...
Former-commit-id: f8c26e6a346ca0f18f3b05b6fc7413f3625fb220
2024-04-26 03:03:36 +08:00
hoshi-hiyouga
6275682325
Update mllm_demo.json
...
Former-commit-id: 5ef293387f8bded42364984f804fb8f665ef1f89
2024-04-26 02:58:45 +08:00
hoshi-hiyouga
82b61ccda6
Update and rename llava_instruct_example.json to mllm_demo.json
...
Former-commit-id: 7dcae3dba3dbda4953a0e3993e279cc8c21fc976
2024-04-26 02:57:54 +08:00
BUAADreamer
56028422e8
merge data part to the text stream
...
Former-commit-id: 42c90c8183a49cadb2c2abcc58f6ea27d325231d
2024-04-25 19:58:47 +08:00
BUAADreamer
b6d78b2a64
merge data part to the text stream
...
Former-commit-id: c6dd89918feb25fe8c07857162421ad1706f791f
2024-04-25 19:19:59 +08:00
BUAADreamer
31bce63a10
add llava and instructblip
...
Former-commit-id: cfb485eddff0130422416b50c50e171fccc8103e
2024-04-25 00:22:43 +08:00
BUAADreamer
175b56bced
add multimodal LLM BLIP-2 and InstructBLIP
...
Former-commit-id: 4dcb11eab7bbeac866043d2a7c748b8d06fbd243
2024-04-23 18:45:43 +08:00
hiyouga
12290955d8
add dpo mix dataset
...
Former-commit-id: 6339edefff4eb23a4052fd273d1348f5ab59b47c
2024-04-20 01:31:38 +08:00
hiyouga
db42378f29
fix #3247
...
Former-commit-id: d1fb6c72b532bfd4ccd5b19f56708c8391fa53aa
2024-04-12 17:41:33 +08:00
hiyouga
2f878bde11
support ORPO
...
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
2024-03-31 18:29:50 +08:00
li.yunhao
0330ba2ae6
fix pile datset hf hub url
...
Former-commit-id: 9c2ef9cdf47e16985e07421b0bea414d161e2456
2024-03-30 16:06:10 +08:00
zhangzc
05afeb304d
Supports custom data set sampling quantity
...
Former-commit-id: 449e2aa38e3a6cf301a43c12c121ac24ebf12027
2024-03-27 14:22:50 +08:00
hiyouga
6646e18c02
add orca_dpo_pairs dataset
...
Former-commit-id: 3271af2afc90f10dcb101aeb9d7e4ef254d2dc0e
2024-03-20 20:09:06 +08:00
SirlyDreamer
78359638e3
Follow HF_ENDPOINT environment variable
...
Former-commit-id: e165965341a150f6faa2c072a9281ad99d7e5ce8
2024-03-20 08:31:30 +00:00
hiyouga
566bfad930
update parser
...
Former-commit-id: be99799413e1ba37807a02838bf2d87fd966bf55
2024-03-10 13:35:20 +08:00
hiyouga
9ae1514a75
update readme, add starcoder2, cosmopedia
...
Former-commit-id: 894d183214417b10af64d6add7be082d63e8b1f3
2024-03-03 01:01:46 +08:00
hiyouga
3b16912235
update data
...
Former-commit-id: 32884523c577f329354decb4c916bf1f1bbc9dff
2024-03-02 19:37:18 +08:00
hiyouga
7e2d8b170a
fix #2533
...
Former-commit-id: 1630a4cb8f19a348014113421134b5730d52932f
2024-02-21 22:47:48 +08:00
hiyouga
62b78001b7
fix #2481
...
Former-commit-id: 22acab8aff8cadbba2a67e56af5701c0261ade49
2024-02-15 19:07:47 +08:00
hiyouga
9cf5d89bd1
update data/readme
...
Former-commit-id: a754f6e9ec157ba76178fa8ea8111e0c7b06008b
2024-02-10 21:04:29 +08:00
hiyouga
db2051684b
improve aligner
...
Former-commit-id: 7d2dc83c5e2085da6273241269c9e9d7509ae51b
2024-02-10 16:39:19 +08:00
Mark Mueller
4bd7b8375e
Slim Orca data parsing
...
Former-commit-id: 1d3598afa10797ba0ce30d44f52e7994587c0ce8
2024-02-08 19:32:20 +01:00
Johann-Peter Hartmann
ace1770085
WS fix
...
Former-commit-id: 49c69ea4b97a2507819996dea41a755a29e35e79
2024-02-06 20:13:04 +01:00
Johann-Peter Hartmann
6ff4e9e62c
add ranking to dpo dataset
...
Former-commit-id: 1126563505924a2d7946fa3fad0d9d1756faf987
2024-02-06 20:12:36 +01:00
Johann-Peter Hartmann
77746ad86c
remove comma
...
Former-commit-id: 870182c3a9ce3168db5b40a45daebe33c3d6f0e1
2024-02-03 08:48:39 +01:00
Johann-Peter Hartmann
81edbd1472
Merge branch 'hiyouga:main' into main
...
Former-commit-id: 4e27950acbfaef0ab6b4295d554c8897baedcff0
2024-01-31 14:05:52 +01:00
hiyouga
7beeae2209
fix autoset attn impl, update data readme
...
Former-commit-id: 521ad765521bb65aff5a29a8125a2b26ef00bff4
2024-01-31 11:58:07 +08:00
Johann-Peter Hartmann
c264eb4793
Add support for german datasets
...
Former-commit-id: d9a8301ed46d821c3303b14966978e1165d12f2c
2024-01-30 10:18:01 +01:00
hiyouga
cd4d38e0cc
Update dataset_info.json
...
Former-commit-id: dbaaa4546ec681cfc84da015a67e2a9c79173e02
2024-01-23 00:10:32 +08:00
hiyouga
509e35ffc8
fix #2282 and update tool prompt
...
Former-commit-id: b2fb0eca56ee835438cc20e83f36ac3f6eb95c83
2024-01-22 22:27:30 +08:00
hiyouga
48cab43cb5
add array param format
...
Former-commit-id: 486cc8d3600397812e3927d43ab4181f4e86f5dd
2024-01-21 22:17:48 +08:00
hiyouga
e95c0242a8
fix dataset
...
Former-commit-id: 487dee066f51caea86dc65847bd326e13445de09
2024-01-18 12:59:30 +08:00
hiyouga
7f12aedc08
enable cutoff len
...
Former-commit-id: f1067d2b585cf24b3e48463692ac99a8222161c9
2024-01-18 12:25:42 +08:00
hiyouga
4e3bfb799d
support function calling
...
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
a52aafdbdc
tiny update
...
Former-commit-id: 5b93d545e2090d8d6db2cee3a047565f834e87f1
2023-12-25 18:29:34 +08:00
hiyouga
1af13cb737
add models
...
Former-commit-id: 709ac8870a17a96e786b32c75ad8c4e573148cee
2023-12-18 19:09:31 +08:00