LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-29 10:10:35 +08:00

Author	SHA1	Message	Date
Yaowei Zheng	2b27283ba0	[assets] update readme (#9143 )	2025-09-16 17:04:19 +08:00
Muqi Li	00c4988f89	[assets] fix incorrect user_tag in dataset_info.json to prevent skipped entries (#8197 )	2025-05-28 18:01:37 +08:00
hoshi-hiyouga	8d472c20cb	[model] add seed coder and qwen3 quant models (#8039 )	2025-05-13 15:59:55 +08:00
hoshi-hiyouga	34fdabe005	[data] add coig-p dataset (#7657 )	2025-04-09 21:18:25 +08:00
Kingsley	7d8bee96fc	[data] Fix bugs of `use_audio_in_video` in Qwen2.5 Omni (#7638 ) * cache _mm_inputs * nit * support for use_audio_in_video * remove cache * fix data * Update mllm_video_audio_demo.json	2025-04-08 18:40:10 +08:00
Victor Nogueira	3dff4ecca8	[dataset] fix ultrachat_200k dataset (#7259 ) The `HuggingFaceH4/ultrachat_200k` dataset doesn't contain the default "train" split. The correct split is "train_sft".	2025-03-13 20:20:18 +08:00
hoshi-hiyouga	beb1a9f9d9	[data] add r1 distill dataset (#6983 ) Former-commit-id: `2591a3fa8b`	2025-02-18 17:25:09 +08:00
hoshi-hiyouga	fcd0f0480d	[dataset] add openthought (#6866 ) Former-commit-id: `1356f9d840`	2025-02-09 00:53:01 +08:00
Zhangchi Feng	01915eaf40	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `24c7842948`	2025-02-05 04:59:09 +08:00
hiyouga	9822cb7bac	fix dataset Former-commit-id: `046b6fb118`	2024-11-27 06:27:44 +00:00
hiyouga	ab3782b0fa	add marco-o1 and openo1 dataset Former-commit-id: `17afb7d410`	2024-11-27 04:20:23 +00:00
hoshi-hiyouga	4f1d5b6396	update dataset Former-commit-id: `5214d3ea06`	2024-11-25 21:47:04 +08:00
hiyouga	0d8aa6e6ef	use pre-commit Former-commit-id: `21db8ed2f4`	2024-10-29 09:07:46 +00:00
huniu20	132c1f1b0f	1. add model and dataset info to support webui Former-commit-id: `0f669f221a`	2024-10-10 16:46:34 +08:00
hiyouga	9df7a26e6b	video datasets Former-commit-id: `8cafc7b055`	2024-09-05 02:04:17 +08:00
hiyouga	af8c4b4e20	add vl_feedback dataset Former-commit-id: `57497135bf`	2024-09-04 03:13:03 +08:00
hiyouga	549adc888b	add pokemon dataset Former-commit-id: `194064fdae`	2024-09-02 01:02:25 +08:00
hiyouga	bfdcc6bacf	add rlhf-v dataset Former-commit-id: `8e49940746`	2024-09-01 22:57:41 +08:00
hiyouga	a83756b5e9	refactor mm training Former-commit-id: `3382317e32`	2024-08-30 02:14:31 +08:00
simonJJJ	8a09b1e732	initial-commit Former-commit-id: `aeb85f200b`	2024-08-28 16:51:35 +08:00
hiyouga	bea270042b	add magpie ultra dataset Former-commit-id: `c75b5b83c4`	2024-08-09 20:28:55 +08:00
hiyouga	14bc7b0551	fix up Former-commit-id: `29ebcd75d5`	2024-07-15 01:04:56 +08:00
codingma	74f0d02eb8	1. add custom eval dataset support 2. merge load dataset and split dataset function Former-commit-id: `76f3bbcfc0`	2024-07-05 15:52:10 +08:00
hiyouga	9e5988717d	tiny fix Former-commit-id: `344b9a36b2`	2024-06-18 23:32:18 +08:00
Eli Costa	6bbb8b4cd8	Add Magpie and Webinstruct dataset samples Adds two dataset samples claimed superior performance: Magpie (from Allen AI) and Webinstruct (from TIGER-Lab). Former-commit-id: `74e49cca95`	2024-06-15 19:31:56 -03:00
hiyouga	e89d1b1ec3	add neo-sft dataset Former-commit-id: `c7a5620ccc`	2024-06-13 01:00:56 +08:00
hiyouga	3547a26f86	add ultrafeedback and fineweb #4085 #4132 Former-commit-id: `12d79f89c5`	2024-06-08 02:42:34 +08:00
hiyouga	b88ecd71fd	fix full/freeze tuning for mllm Former-commit-id: `08564838bd`	2024-05-27 20:37:57 +08:00
BUAADreamer	f9ced0480e	Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory Former-commit-id: `576b0206c2`	2024-05-27 20:11:23 +08:00
BUAADreamer	4a958ab909	Merge branch 'hiyouga:main' into main Former-commit-id: `e2022ce4e9`	2024-05-27 20:10:58 +08:00
BUAADreamer	ea78a629ba	remove mllm_pt_demo.json Former-commit-id: `f665342a27`	2024-05-27 20:10:31 +08:00
hiyouga	db569a2d61	add llava 1k datasets Former-commit-id: `08bd0440b5`	2024-05-27 19:57:33 +08:00
BUAADreamer	071d674065	support pretraining of llava Former-commit-id: `29a6d5bdb8`	2024-05-21 08:57:14 +08:00
hiyouga	13d7b48efe	improve KTO impl., replace datasets Former-commit-id: `c450ee87a3`	2024-05-18 03:44:56 +08:00
enji.zhou	03956053b8	add kto Former-commit-id: `db1d5a4f51`	2024-05-17 13:09:17 +08:00
hiyouga	51e0f095a9	remove checksum and fix ui args Former-commit-id: `58c522cd5c`	2024-05-12 01:10:30 +08:00
codingma	e017fb67d0	fix sha1 of glaive_toolcall dataset Former-commit-id: `d5520b6017`	2024-05-09 16:33:45 +08:00
hiyouga	38c6ce9311	remove big file Former-commit-id: `1ccbfe562d`	2024-05-07 22:14:06 +08:00
hiyouga	175a7ea951	fix stop param Former-commit-id: `09f3ef1de4`	2024-05-07 00:41:04 +08:00
hoshi-hiyouga	14c3c8cc8f	Merge pull request #3588 from ZeyuTeng96/patch-1 update hf_hub_url for nectar_rm in dataset_info Former-commit-id: `d6ca7853fa`	2024-05-07 00:06:11 +08:00
hoshi-hiyouga	a13bdb9a2b	Update dataset_info.json Former-commit-id: `c3910ab98a`	2024-05-07 00:05:45 +08:00
hiyouga	92cafef325	update example docs Former-commit-id: `f02f87c6fb`	2024-05-06 22:51:02 +08:00
ZeyuTeng96	96354ca55f	update hf_hub_url for nectar_rm in dataset_info Hi there, I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating. See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar Former-commit-id: `044af36442`	2024-05-06 16:44:50 +08:00
hiyouga	c9fce361fb	update readme Former-commit-id: `5ee04d418c`	2024-04-26 23:39:19 +08:00
hoshi-hiyouga	76f767d5b0	Merge pull request #3471 from BUAADreamer/main add llava_150k en/zh mllm sft data Former-commit-id: `8f91420223`	2024-04-26 23:36:41 +08:00
hoshi-hiyouga	5ad1c3dd36	Update dataset_info.json Former-commit-id: `c29b257007`	2024-04-26 23:34:34 +08:00
BUAADreamer	044668af10	add llava_150k en/zh mllm sft data Former-commit-id: `a177872010`	2024-04-26 23:18:58 +08:00
hiyouga	eb14501a52	release v0.7.0 Former-commit-id: `168f56683a`	2024-04-26 23:18:00 +08:00
hiyouga	d2df4c22ab	support mllm hf inference Former-commit-id: `e057c8de48`	2024-04-26 05:34:58 +08:00
hoshi-hiyouga	3e832e53be	Update dataset_info.json Former-commit-id: `f8c26e6a34`	2024-04-26 03:03:36 +08:00

1 2 3

110 Commits