Commit Graph

147 Commits

Author SHA1 Message Date
hoshi-hiyouga
1356f9d840 [dataset] add openthought (#6866) 2025-02-09 00:53:01 +08:00
Zhangchi Feng
24c7842948 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-02-05 04:59:09 +08:00
hiyouga
046b6fb118 fix dataset 2024-11-27 06:27:44 +00:00
hiyouga
17afb7d410 add marco-o1 and openo1 dataset 2024-11-27 04:20:23 +00:00
hoshi-hiyouga
5214d3ea06 update dataset 2024-11-25 21:47:04 +08:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
huniu20
0f669f221a 1. add model and dataset info to support webui 2024-10-10 16:46:34 +08:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hiyouga
70e36ff2f4 update data readme 2024-09-05 04:44:49 +08:00
hiyouga
6055fe02de update data readme 2024-09-05 04:25:27 +08:00
hiyouga
8cafc7b055 video datasets 2024-09-05 02:04:17 +08:00
hiyouga
57497135bf add vl_feedback dataset 2024-09-04 03:13:03 +08:00
hiyouga
194064fdae add pokemon dataset 2024-09-02 01:02:25 +08:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
a244f143f4 optimize predict vram 2024-08-30 23:08:45 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
simonJJJ
aeb85f200b initial-commit 2024-08-28 16:51:35 +08:00
hiyouga
c75b5b83c4 add magpie ultra dataset 2024-08-09 20:28:55 +08:00
hiyouga
608de799a2 add unittest 2024-07-19 01:06:27 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hoshi-hiyouga
9d64507bd5 Update README.md 2024-07-14 21:27:04 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
9ab0401948 update data 2024-06-19 02:48:43 +08:00
hiyouga
344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
Eli Costa
74e49cca95 Add Magpie and Webinstruct dataset samples
Adds two dataset samples claimed superior performance: Magpie (from Allen AI) and Webinstruct (from TIGER-Lab).
2024-06-15 19:31:56 -03:00
hiyouga
c7a5620ccc add neo-sft dataset 2024-06-13 01:00:56 +08:00
hiyouga
12d79f89c5 add ultrafeedback and fineweb #4085 #4132 2024-06-08 02:42:34 +08:00
hoshi-hiyouga
483eb47e5d Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num
Add dataset sample num
2024-05-30 00:25:45 +08:00
hoshi-hiyouga
c8ae7e0e65 Update README_zh.md 2024-05-30 00:04:47 +08:00
hoshi-hiyouga
3761d7d5dd Update README.md 2024-05-30 00:04:26 +08:00
hiyouga
08564838bd fix full/freeze tuning for mllm 2024-05-27 20:37:57 +08:00
BUAADreamer
576b0206c2 Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory 2024-05-27 20:11:23 +08:00
BUAADreamer
e2022ce4e9 Merge branch 'hiyouga:main' into main 2024-05-27 20:10:58 +08:00
BUAADreamer
f665342a27 remove mllm_pt_demo.json 2024-05-27 20:10:31 +08:00
hiyouga
08bd0440b5 add llava 1k datasets 2024-05-27 19:57:33 +08:00
seanzhang-zhichen
27cb51f7f8 Merge branch 'main' into add_dataset_sample_num 2024-05-24 15:57:47 +08:00
BUAADreamer
8d53ec2b5f Merge branch 'hiyouga:main' into main 2024-05-21 22:18:20 +08:00
hiyouga
4d647ddba5 Update README_zh.md 2024-05-21 18:30:59 +08:00
BUAADreamer
29a6d5bdb8 support pretraining of llava 2024-05-21 08:57:14 +08:00
hiyouga
7262679666 fix #3818 2024-05-20 21:43:19 +08:00
zhangzc
d956041640 fix conflict 2024-05-20 17:10:01 +08:00
hiyouga
ca48f90f1e update data readme 2024-05-18 21:37:38 +08:00
hiyouga
18cbf8561d update data readme 2024-05-18 21:15:20 +08:00
hiyouga
c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
enji.zhou
db1d5a4f51 add kto 2024-05-17 13:09:17 +08:00
hiyouga
58c522cd5c remove checksum and fix ui args 2024-05-12 01:10:30 +08:00
codingma
d5520b6017 fix sha1 of glaive_toolcall dataset 2024-05-09 16:33:45 +08:00
hiyouga
1ccbfe562d remove big file 2024-05-07 22:14:06 +08:00
hiyouga
09f3ef1de4 fix stop param 2024-05-07 00:41:04 +08:00
hoshi-hiyouga
d6ca7853fa Merge pull request #3588 from ZeyuTeng96/patch-1
update hf_hub_url for nectar_rm in dataset_info
2024-05-07 00:06:11 +08:00