hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
a8c9d5663d
[data] fix predict dataset ( #6972 )
...
Former-commit-id: bdb581c4a8
2025-02-17 20:29:40 +08:00
hoshi-hiyouga
475a355b82
[assets] update wechat ( #6963 )
...
Former-commit-id: ad0c6c8916
2025-02-17 15:23:17 +08:00
Zhangchi Feng
3dc938268c
[data] fix minicpmo template ( #6946 )
...
Former-commit-id: 2faf8aeff8
2025-02-15 00:37:41 +08:00
Eric Tang
e55ec42d3c
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 6edd4992d7
2025-02-14 21:55:41 +08:00
hoshi-hiyouga
2baf8bf03d
[misc] fix lora regex ( #6944 )
...
* fix lora regex
* fix
Former-commit-id: 1ada3ae5a3
2025-02-14 21:38:43 +08:00
hoshi-hiyouga
13e1b7ee2b
[misc] fix grad ckpt ( #6931 )
...
Former-commit-id: c31c63b411
2025-02-13 23:27:51 +08:00
hoshi-hiyouga
cd493b91de
[model] add liger kernel to qwen2_5 vl ( #6930 )
...
* add liger kernel to qwen2_5 vl
* fix patch
* fix patch
Former-commit-id: 797043d29c
2025-02-13 23:05:54 +08:00
Billy Cao
48173b606c
[trainer] fix gen_kwarg to eval during training ( #5451 )
...
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 11eac71c13
2025-02-13 02:35:06 +08:00
SrWYG
0ad9f7f058
[data] evaluate on each dataset ( #5522 )
...
* [Update] loader.py , evaluate will run separate evaluations on each dataset.
`If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation`
seq2seqtrainner support eval_dataset as Dict.
* fix format
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 1e35967ae1
2025-02-13 02:19:03 +08:00
Noah
1adb46875f
[data] improve error handling ( #6128 )
...
* sync from upstream
* update
* update
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 4c7bfebcf1
2025-02-13 01:39:41 +08:00
hoshi-hiyouga
9b852ebe25
[misc] update readme ( #6918 )
...
Former-commit-id: 8956c93d9b
2025-02-13 01:01:41 +08:00
hoshi-hiyouga
07aa7b71a3
[misc] update readme ( #6917 )
...
Former-commit-id: 499ea45d1f
2025-02-13 00:58:10 +08:00
hoshi-hiyouga
1679930e00
[breaking change] refactor data pipeline ( #6901 )
...
* refactor data
* rename file
Former-commit-id: 617c8ab467
2025-02-13 00:39:20 +08:00
Eric Tang
d50e04b805
[misc] support for launching LLaMA-Factory with uv run ( #6907 )
...
* yay
* uv with ray temporary commit
* remove ray specific code for now
* cleanup
Former-commit-id: f8a206125d
2025-02-13 00:38:44 +08:00
Eric Tang
e515fe62de
[example] fix path to ray example ( #6906 )
...
Former-commit-id: ee5fe216dc
2025-02-13 00:29:32 +08:00
hoshi-hiyouga
036fb0d561
[misc] fix grad ckpt func ( #6916 )
...
Former-commit-id: e34c3c06da
2025-02-13 00:17:18 +08:00
marko1616
bae934dea3
[trainer] fix llama3.2 vision kto train ( #6904 )
...
Former-commit-id: b7fd1e9c00
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
2e2f6bea07
[data] feat: auto template ( #6905 )
...
* support auto template
* add unittest
Former-commit-id: 2f8b6847f5
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
1b02183da9
[misc] update readme ( #6903 )
...
Former-commit-id: 18179a3823
2025-02-11 22:51:26 +08:00
hoshi-hiyouga
197aa3baf4
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
2e954d8fd2
[data] refactor template ( #6896 )
...
Former-commit-id: d1b8aa3835
2025-02-11 17:59:25 +08:00
codingma
fafa3add84
support ollama modelfile export ( #4686 )
...
Former-commit-id: 7f354b80bc
2025-02-11 17:52:24 +08:00
hoshi-hiyouga
593acca556
[data] refactor mm plugin ( #6895 )
...
* refactor plugin
* lint
Former-commit-id: aca63bfcca
2025-02-11 16:34:49 +08:00
HJ
188f22d8a7
[data] fix qwen_2_5_vl video processing ( #6868 )
...
* fix qwen_2_5_vl video processing
* Update mm_plugin.py
* Update mm_plugin.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 9153a7bd83
2025-02-11 16:14:50 +08:00
hoshi-hiyouga
703bb9cc18
[assets] update wechat ( #6892 )
...
Former-commit-id: fc5d47401f
2025-02-11 13:56:26 +08:00
Zhangchi Feng
5433b318bb
[da'ta] fix minicpmv plugin ( #6890 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
* fix no mm inputs
Former-commit-id: 764627645a
2025-02-11 13:30:44 +08:00
HJ
fe4f4e9758
[data] fix: sharegpt converter ( #6879 )
...
* fix-sharegpt-format
* fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 0fb44cb3a5
2025-02-10 21:59:12 +08:00
hoshi-hiyouga
1bb3d17d9e
[data] fix mllama collator ( #6874 )
...
Former-commit-id: b68199db27
2025-02-09 22:42:25 +08:00
hoshi-hiyouga
b93333685b
[test] align test cases ( #6865 )
...
* align test cases
* fix function formatter
Former-commit-id: f6f3f8d0fc
2025-02-09 01:03:49 +08:00
hoshi-hiyouga
fcd0f0480d
[dataset] add openthought ( #6866 )
...
Former-commit-id: 1356f9d840
2025-02-09 00:53:01 +08:00
hoshi-hiyouga
ff6658ad27
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
28037c7834
fix qwen2vl plugin ( #6855 )
...
Former-commit-id: 40048ab77a
2025-02-08 10:59:10 +08:00
hoshi-hiyouga
f70208e1c0
[misc] allow extra args ( #6831 )
...
Former-commit-id: 74ade3a176
2025-02-06 12:38:08 +08:00
hoshi-hiyouga
7aa9767dc2
[assets] update wechat ( #6830 )
...
Former-commit-id: 6dad536968
2025-02-06 12:02:05 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
Yueqi Song
e665e1fed5
[data] allow thought in function call ( #6797 )
...
* Update template.py
* Update template.py
* use formatter
* fix regex
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: a5e943f7bc
2025-02-05 02:26:23 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
Yueqi Song
8504bde893
[data] fix qwen tool template ( #6796 )
...
* Update tool_utils.py
* fix unittest
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: dd6b7d203e
2025-02-05 00:02:00 +08:00
Zhangchi Feng
85f22d01bf
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
Former-commit-id: ab9bd068ef
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
822d5d362c
[assets] update wechat ( #6810 )
...
Former-commit-id: 069a477d16
2025-02-04 21:17:40 +08:00
neavo
32163e7ce0
[readme] update flash attention installation instruction on win platform ( #6788 )
...
* Update README_zh.md
* Update README.md
Former-commit-id: a417bcf8d9
2025-02-01 12:43:29 +08:00
hoshi-hiyouga
454140d912
[misc] update workflows ( #6787 )
...
Former-commit-id: b5fda21288
2025-02-01 04:54:42 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
f6779b0e0c
[breaking] support transformers 4.48 ( #6628 )
...
Former-commit-id: 15357cdad9
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
245de012ca
[webui] improve webui & reasoning mode ( #6778 )
...
Former-commit-id: 45e68b9f09
2025-01-31 00:09:21 +08:00
codingma
f143360ee6
[assets] update wechat ( #6771 )
...
Former-commit-id: 4fb6059f48
2025-01-29 12:31:24 +08:00