Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
hiyouga
9822cb7bac
fix dataset
...
Former-commit-id: 046b6fb118e3ea75062c6a759720a1759639e93c
2024-11-27 06:27:44 +00:00
hiyouga
ab3782b0fa
add marco-o1 and openo1 dataset
...
Former-commit-id: 17afb7d4103499a9a090a6624896cfa123e9e1d6
2024-11-27 04:20:23 +00:00
hoshi-hiyouga
4f1d5b6396
update dataset
...
Former-commit-id: 5214d3ea06ac73f1179ca9574d7c7030c92b5ee1
2024-11-25 21:47:04 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
huniu20
132c1f1b0f
1. add model and dataset info to support webui
...
Former-commit-id: 0f669f221a31622ec7a53d0baab5da6a7891f9b6
2024-10-10 16:46:34 +08:00
hiyouga
9df7a26e6b
video datasets
...
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
2024-09-05 02:04:17 +08:00
hiyouga
af8c4b4e20
add vl_feedback dataset
...
Former-commit-id: 57497135bf0a956af9c6893177ee97504b9f34ac
2024-09-04 03:13:03 +08:00
hiyouga
549adc888b
add pokemon dataset
...
Former-commit-id: 194064fdae0226dd22522586c9d47c5866a71a8e
2024-09-02 01:02:25 +08:00
hiyouga
bfdcc6bacf
add rlhf-v dataset
...
Former-commit-id: 8e49940746c1a6ff910f07dbefbec14af9d0f3c6
2024-09-01 22:57:41 +08:00
hiyouga
a83756b5e9
refactor mm training
...
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
simonJJJ
8a09b1e732
initial-commit
...
Former-commit-id: aeb85f200bd824748008dae6047c2607dfcdf174
2024-08-28 16:51:35 +08:00
hiyouga
bea270042b
add magpie ultra dataset
...
Former-commit-id: c75b5b83c4982a6da1512ad6f9cc4d98cc761094
2024-08-09 20:28:55 +08:00
hiyouga
14bc7b0551
fix up
...
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
2024-07-15 01:04:56 +08:00
codingma
74f0d02eb8
1. add custom eval dataset support
...
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
2024-07-05 15:52:10 +08:00
hiyouga
9e5988717d
tiny fix
...
Former-commit-id: 344b9a36b2e0b60ee61fba171b35a391e3517fed
2024-06-18 23:32:18 +08:00
Eli Costa
6bbb8b4cd8
Add Magpie and Webinstruct dataset samples
...
Adds two dataset samples claimed superior performance: Magpie (from Allen AI) and Webinstruct (from TIGER-Lab).
Former-commit-id: 74e49cca957d0bacd2c1d688e995a7370bef69f7
2024-06-15 19:31:56 -03:00
hiyouga
e89d1b1ec3
add neo-sft dataset
...
Former-commit-id: c7a5620ccc72b7574255ea764693ccb866c48263
2024-06-13 01:00:56 +08:00
hiyouga
3547a26f86
add ultrafeedback and fineweb #4085 #4132
...
Former-commit-id: 12d79f89c5082eb29842b501e1cb88433a248ba3
2024-06-08 02:42:34 +08:00
hiyouga
b88ecd71fd
fix full/freeze tuning for mllm
...
Former-commit-id: 08564838bd02651668845ed74e2e60561e5b6d8c
2024-05-27 20:37:57 +08:00
BUAADreamer
f9ced0480e
Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory
...
Former-commit-id: 576b0206c27f93ffe19e3b7e6df58a3cd2abbb1d
2024-05-27 20:11:23 +08:00
BUAADreamer
4a958ab909
Merge branch 'hiyouga:main' into main
...
Former-commit-id: e2022ce4e90b115fb8271ef0f6bf05e8f39c997f
2024-05-27 20:10:58 +08:00
BUAADreamer
ea78a629ba
remove mllm_pt_demo.json
...
Former-commit-id: f665342a2752ffb5d715f134603d84e5228f55dc
2024-05-27 20:10:31 +08:00
hiyouga
db569a2d61
add llava 1k datasets
...
Former-commit-id: 08bd0440b52dbe2e6d28323900ca1a07751605f9
2024-05-27 19:57:33 +08:00
BUAADreamer
071d674065
support pretraining of llava
...
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hiyouga
13d7b48efe
improve KTO impl., replace datasets
...
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
enji.zhou
03956053b8
add kto
...
Former-commit-id: db1d5a4f51faae61fe18666057353747b01f5b8d
2024-05-17 13:09:17 +08:00
hiyouga
51e0f095a9
remove checksum and fix ui args
...
Former-commit-id: 58c522cd5cc4498a3fa8ed99424b5d63c9e56ccb
2024-05-12 01:10:30 +08:00
codingma
e017fb67d0
fix sha1 of glaive_toolcall dataset
...
Former-commit-id: d5520b6017df01e807fe3a913ee6654814359d5d
2024-05-09 16:33:45 +08:00
hiyouga
38c6ce9311
remove big file
...
Former-commit-id: 1ccbfe562dabe9a75df729c960e09d6a8bd6382c
2024-05-07 22:14:06 +08:00
hiyouga
175a7ea951
fix stop param
...
Former-commit-id: 09f3ef1de49f97001faa91ef3dc2bd16790f9717
2024-05-07 00:41:04 +08:00
hoshi-hiyouga
14c3c8cc8f
Merge pull request #3588 from ZeyuTeng96/patch-1
...
update hf_hub_url for nectar_rm in dataset_info
Former-commit-id: d6ca7853faf083a7ff5c60feb940983d2577326d
2024-05-07 00:06:11 +08:00
hoshi-hiyouga
a13bdb9a2b
Update dataset_info.json
...
Former-commit-id: c3910ab98ae11b52ff6e6d1faafd3e63256d908e
2024-05-07 00:05:45 +08:00
hiyouga
92cafef325
update example docs
...
Former-commit-id: f02f87c6fbd20adae105c83526baa23dba2042fd
2024-05-06 22:51:02 +08:00
ZeyuTeng96
96354ca55f
update hf_hub_url for nectar_rm in dataset_info
...
Hi there,
I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating.
See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar
Former-commit-id: 044af364425766ba23373ff21577bc4a9de18e39
2024-05-06 16:44:50 +08:00
hiyouga
c9fce361fb
update readme
...
Former-commit-id: 5ee04d418c2e66a292e7da6d393843fcf3b71dc1
2024-04-26 23:39:19 +08:00
hoshi-hiyouga
76f767d5b0
Merge pull request #3471 from BUAADreamer/main
...
add llava_150k en/zh mllm sft data
Former-commit-id: 8f9142022382d0eedce4356744a281b2ace3b703
2024-04-26 23:36:41 +08:00
hoshi-hiyouga
5ad1c3dd36
Update dataset_info.json
...
Former-commit-id: c29b257007a8de9735ecaf52afffa80fdcee6a24
2024-04-26 23:34:34 +08:00
BUAADreamer
044668af10
add llava_150k en/zh mllm sft data
...
Former-commit-id: a17787201082951ae39c3c10436be4c16346f16a
2024-04-26 23:18:58 +08:00
hiyouga
eb14501a52
release v0.7.0
...
Former-commit-id: 168f56683ae4909ae50edd4859032fad60149d00
2024-04-26 23:18:00 +08:00
hiyouga
d2df4c22ab
support mllm hf inference
...
Former-commit-id: e057c8de486bfbc829240924f9238d6212c917f1
2024-04-26 05:34:58 +08:00
hoshi-hiyouga
3e832e53be
Update dataset_info.json
...
Former-commit-id: f8c26e6a346ca0f18f3b05b6fc7413f3625fb220
2024-04-26 03:03:36 +08:00
BUAADreamer
56028422e8
merge data part to the text stream
...
Former-commit-id: 42c90c8183a49cadb2c2abcc58f6ea27d325231d
2024-04-25 19:58:47 +08:00
BUAADreamer
b6d78b2a64
merge data part to the text stream
...
Former-commit-id: c6dd89918feb25fe8c07857162421ad1706f791f
2024-04-25 19:19:59 +08:00
BUAADreamer
31bce63a10
add llava and instructblip
...
Former-commit-id: cfb485eddff0130422416b50c50e171fccc8103e
2024-04-25 00:22:43 +08:00
BUAADreamer
175b56bced
add multimodal LLM BLIP-2 and InstructBLIP
...
Former-commit-id: 4dcb11eab7bbeac866043d2a7c748b8d06fbd243
2024-04-23 18:45:43 +08:00
hiyouga
12290955d8
add dpo mix dataset
...
Former-commit-id: 6339edefff4eb23a4052fd273d1348f5ab59b47c
2024-04-20 01:31:38 +08:00
hiyouga
db42378f29
fix #3247
...
Former-commit-id: d1fb6c72b532bfd4ccd5b19f56708c8391fa53aa
2024-04-12 17:41:33 +08:00
li.yunhao
0330ba2ae6
fix pile datset hf hub url
...
Former-commit-id: 9c2ef9cdf47e16985e07421b0bea414d161e2456
2024-03-30 16:06:10 +08:00
hiyouga
6646e18c02
add orca_dpo_pairs dataset
...
Former-commit-id: 3271af2afc90f10dcb101aeb9d7e4ef254d2dc0e
2024-03-20 20:09:06 +08:00