Yaowei Zheng
|
2b27283ba0
|
[assets] update readme (#9143)
|
2025-09-16 17:04:19 +08:00 |
|
Muqi Li
|
00c4988f89
|
[assets] fix incorrect user_tag in dataset_info.json to prevent skipped entries (#8197)
|
2025-05-28 18:01:37 +08:00 |
|
hoshi-hiyouga
|
8d472c20cb
|
[model] add seed coder and qwen3 quant models (#8039)
|
2025-05-13 15:59:55 +08:00 |
|
hoshi-hiyouga
|
34fdabe005
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
Kingsley
|
7d8bee96fc
|
[data] Fix bugs of use_audio_in_video in Qwen2.5 Omni (#7638)
* cache _mm_inputs
* nit
* support for use_audio_in_video
* remove cache
* fix data
* Update mllm_video_audio_demo.json
|
2025-04-08 18:40:10 +08:00 |
|
Victor Nogueira
|
3dff4ecca8
|
[dataset] fix ultrachat_200k dataset (#7259)
The `HuggingFaceH4/ultrachat_200k` dataset doesn't contain the default "train" split. The correct split is "train_sft".
|
2025-03-13 20:20:18 +08:00 |
|
hoshi-hiyouga
|
beb1a9f9d9
|
[data] add r1 distill dataset (#6983)
Former-commit-id: 2591a3fa8b
|
2025-02-18 17:25:09 +08:00 |
|
hoshi-hiyouga
|
fcd0f0480d
|
[dataset] add openthought (#6866)
Former-commit-id: 1356f9d840
|
2025-02-09 00:53:01 +08:00 |
|
Zhangchi Feng
|
01915eaf40
|
[model] support audio (#6701)
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c7842948
|
2025-02-05 04:59:09 +08:00 |
|
hiyouga
|
9822cb7bac
|
fix dataset
Former-commit-id: 046b6fb118
|
2024-11-27 06:27:44 +00:00 |
|
hiyouga
|
ab3782b0fa
|
add marco-o1 and openo1 dataset
Former-commit-id: 17afb7d410
|
2024-11-27 04:20:23 +00:00 |
|
hoshi-hiyouga
|
4f1d5b6396
|
update dataset
Former-commit-id: 5214d3ea06
|
2024-11-25 21:47:04 +08:00 |
|
hiyouga
|
0d8aa6e6ef
|
use pre-commit
Former-commit-id: 21db8ed2f4
|
2024-10-29 09:07:46 +00:00 |
|
huniu20
|
132c1f1b0f
|
1. add model and dataset info to support webui
Former-commit-id: 0f669f221a
|
2024-10-10 16:46:34 +08:00 |
|
hiyouga
|
9df7a26e6b
|
video datasets
Former-commit-id: 8cafc7b055
|
2024-09-05 02:04:17 +08:00 |
|
hiyouga
|
af8c4b4e20
|
add vl_feedback dataset
Former-commit-id: 57497135bf
|
2024-09-04 03:13:03 +08:00 |
|
hiyouga
|
549adc888b
|
add pokemon dataset
Former-commit-id: 194064fdae
|
2024-09-02 01:02:25 +08:00 |
|
hiyouga
|
bfdcc6bacf
|
add rlhf-v dataset
Former-commit-id: 8e49940746
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32
|
2024-08-30 02:14:31 +08:00 |
|
simonJJJ
|
8a09b1e732
|
initial-commit
Former-commit-id: aeb85f200b
|
2024-08-28 16:51:35 +08:00 |
|
hiyouga
|
bea270042b
|
add magpie ultra dataset
Former-commit-id: c75b5b83c4
|
2024-08-09 20:28:55 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d5
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
9e5988717d
|
tiny fix
Former-commit-id: 344b9a36b2
|
2024-06-18 23:32:18 +08:00 |
|
Eli Costa
|
6bbb8b4cd8
|
Add Magpie and Webinstruct dataset samples
Adds two dataset samples claimed superior performance: Magpie (from Allen AI) and Webinstruct (from TIGER-Lab).
Former-commit-id: 74e49cca95
|
2024-06-15 19:31:56 -03:00 |
|
hiyouga
|
e89d1b1ec3
|
add neo-sft dataset
Former-commit-id: c7a5620ccc
|
2024-06-13 01:00:56 +08:00 |
|
hiyouga
|
3547a26f86
|
add ultrafeedback and fineweb #4085 #4132
Former-commit-id: 12d79f89c5
|
2024-06-08 02:42:34 +08:00 |
|
hiyouga
|
b88ecd71fd
|
fix full/freeze tuning for mllm
Former-commit-id: 08564838bd
|
2024-05-27 20:37:57 +08:00 |
|
BUAADreamer
|
f9ced0480e
|
Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory
Former-commit-id: 576b0206c2
|
2024-05-27 20:11:23 +08:00 |
|
BUAADreamer
|
4a958ab909
|
Merge branch 'hiyouga:main' into main
Former-commit-id: e2022ce4e9
|
2024-05-27 20:10:58 +08:00 |
|
BUAADreamer
|
ea78a629ba
|
remove mllm_pt_demo.json
Former-commit-id: f665342a27
|
2024-05-27 20:10:31 +08:00 |
|
hiyouga
|
db569a2d61
|
add llava 1k datasets
Former-commit-id: 08bd0440b5
|
2024-05-27 19:57:33 +08:00 |
|
BUAADreamer
|
071d674065
|
support pretraining of llava
Former-commit-id: 29a6d5bdb8
|
2024-05-21 08:57:14 +08:00 |
|
hiyouga
|
13d7b48efe
|
improve KTO impl., replace datasets
Former-commit-id: c450ee87a3
|
2024-05-18 03:44:56 +08:00 |
|
enji.zhou
|
03956053b8
|
add kto
Former-commit-id: db1d5a4f51
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
51e0f095a9
|
remove checksum and fix ui args
Former-commit-id: 58c522cd5c
|
2024-05-12 01:10:30 +08:00 |
|
codingma
|
e017fb67d0
|
fix sha1 of glaive_toolcall dataset
Former-commit-id: d5520b6017
|
2024-05-09 16:33:45 +08:00 |
|
hiyouga
|
38c6ce9311
|
remove big file
Former-commit-id: 1ccbfe562d
|
2024-05-07 22:14:06 +08:00 |
|
hiyouga
|
175a7ea951
|
fix stop param
Former-commit-id: 09f3ef1de4
|
2024-05-07 00:41:04 +08:00 |
|
hoshi-hiyouga
|
14c3c8cc8f
|
Merge pull request #3588 from ZeyuTeng96/patch-1
update hf_hub_url for nectar_rm in dataset_info
Former-commit-id: d6ca7853fa
|
2024-05-07 00:06:11 +08:00 |
|
hoshi-hiyouga
|
a13bdb9a2b
|
Update dataset_info.json
Former-commit-id: c3910ab98a
|
2024-05-07 00:05:45 +08:00 |
|
hiyouga
|
92cafef325
|
update example docs
Former-commit-id: f02f87c6fb
|
2024-05-06 22:51:02 +08:00 |
|
ZeyuTeng96
|
96354ca55f
|
update hf_hub_url for nectar_rm in dataset_info
Hi there,
I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating.
See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar
Former-commit-id: 044af36442
|
2024-05-06 16:44:50 +08:00 |
|
hiyouga
|
c9fce361fb
|
update readme
Former-commit-id: 5ee04d418c
|
2024-04-26 23:39:19 +08:00 |
|
hoshi-hiyouga
|
76f767d5b0
|
Merge pull request #3471 from BUAADreamer/main
add llava_150k en/zh mllm sft data
Former-commit-id: 8f91420223
|
2024-04-26 23:36:41 +08:00 |
|
hoshi-hiyouga
|
5ad1c3dd36
|
Update dataset_info.json
Former-commit-id: c29b257007
|
2024-04-26 23:34:34 +08:00 |
|
BUAADreamer
|
044668af10
|
add llava_150k en/zh mllm sft data
Former-commit-id: a177872010
|
2024-04-26 23:18:58 +08:00 |
|
hiyouga
|
eb14501a52
|
release v0.7.0
Former-commit-id: 168f56683a
|
2024-04-26 23:18:00 +08:00 |
|
hiyouga
|
d2df4c22ab
|
support mllm hf inference
Former-commit-id: e057c8de48
|
2024-04-26 05:34:58 +08:00 |
|
hoshi-hiyouga
|
3e832e53be
|
Update dataset_info.json
Former-commit-id: f8c26e6a34
|
2024-04-26 03:03:36 +08:00 |
|