hoshi-hiyouga
1679930e00
[breaking change] refactor data pipeline ( #6901 )
...
* refactor data
* rename file
Former-commit-id: 617c8ab467
2025-02-13 00:39:20 +08:00
Eric Tang
d50e04b805
[misc] support for launching LLaMA-Factory with uv run ( #6907 )
...
* yay
* uv with ray temporary commit
* remove ray specific code for now
* cleanup
Former-commit-id: f8a206125d
2025-02-13 00:38:44 +08:00
Eric Tang
e515fe62de
[example] fix path to ray example ( #6906 )
...
Former-commit-id: ee5fe216dc
2025-02-13 00:29:32 +08:00
hoshi-hiyouga
036fb0d561
[misc] fix grad ckpt func ( #6916 )
...
Former-commit-id: e34c3c06da
2025-02-13 00:17:18 +08:00
marko1616
bae934dea3
[trainer] fix llama3.2 vision kto train ( #6904 )
...
Former-commit-id: b7fd1e9c00
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
2e2f6bea07
[data] feat: auto template ( #6905 )
...
* support auto template
* add unittest
Former-commit-id: 2f8b6847f5
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
1b02183da9
[misc] update readme ( #6903 )
...
Former-commit-id: 18179a3823
2025-02-11 22:51:26 +08:00
hoshi-hiyouga
197aa3baf4
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
2e954d8fd2
[data] refactor template ( #6896 )
...
Former-commit-id: d1b8aa3835
2025-02-11 17:59:25 +08:00
codingma
fafa3add84
support ollama modelfile export ( #4686 )
...
Former-commit-id: 7f354b80bc
2025-02-11 17:52:24 +08:00
hoshi-hiyouga
593acca556
[data] refactor mm plugin ( #6895 )
...
* refactor plugin
* lint
Former-commit-id: aca63bfcca
2025-02-11 16:34:49 +08:00
HJ
188f22d8a7
[data] fix qwen_2_5_vl video processing ( #6868 )
...
* fix qwen_2_5_vl video processing
* Update mm_plugin.py
* Update mm_plugin.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 9153a7bd83
2025-02-11 16:14:50 +08:00
hoshi-hiyouga
703bb9cc18
[assets] update wechat ( #6892 )
...
Former-commit-id: fc5d47401f
2025-02-11 13:56:26 +08:00
Zhangchi Feng
5433b318bb
[da'ta] fix minicpmv plugin ( #6890 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
* fix no mm inputs
Former-commit-id: 764627645a
2025-02-11 13:30:44 +08:00
HJ
fe4f4e9758
[data] fix: sharegpt converter ( #6879 )
...
* fix-sharegpt-format
* fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 0fb44cb3a5
2025-02-10 21:59:12 +08:00
hoshi-hiyouga
1bb3d17d9e
[data] fix mllama collator ( #6874 )
...
Former-commit-id: b68199db27
2025-02-09 22:42:25 +08:00
hoshi-hiyouga
b93333685b
[test] align test cases ( #6865 )
...
* align test cases
* fix function formatter
Former-commit-id: f6f3f8d0fc
2025-02-09 01:03:49 +08:00
hoshi-hiyouga
fcd0f0480d
[dataset] add openthought ( #6866 )
...
Former-commit-id: 1356f9d840
2025-02-09 00:53:01 +08:00
hoshi-hiyouga
ff6658ad27
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
28037c7834
fix qwen2vl plugin ( #6855 )
...
Former-commit-id: 40048ab77a
2025-02-08 10:59:10 +08:00
hoshi-hiyouga
f70208e1c0
[misc] allow extra args ( #6831 )
...
Former-commit-id: 74ade3a176
2025-02-06 12:38:08 +08:00
hoshi-hiyouga
7aa9767dc2
[assets] update wechat ( #6830 )
...
Former-commit-id: 6dad536968
2025-02-06 12:02:05 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
Yueqi Song
e665e1fed5
[data] allow thought in function call ( #6797 )
...
* Update template.py
* Update template.py
* use formatter
* fix regex
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: a5e943f7bc
2025-02-05 02:26:23 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
Yueqi Song
8504bde893
[data] fix qwen tool template ( #6796 )
...
* Update tool_utils.py
* fix unittest
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: dd6b7d203e
2025-02-05 00:02:00 +08:00
Zhangchi Feng
85f22d01bf
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
Former-commit-id: ab9bd068ef
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
822d5d362c
[assets] update wechat ( #6810 )
...
Former-commit-id: 069a477d16
2025-02-04 21:17:40 +08:00
neavo
32163e7ce0
[readme] update flash attention installation instruction on win platform ( #6788 )
...
* Update README_zh.md
* Update README.md
Former-commit-id: a417bcf8d9
2025-02-01 12:43:29 +08:00
hoshi-hiyouga
454140d912
[misc] update workflows ( #6787 )
...
Former-commit-id: b5fda21288
2025-02-01 04:54:42 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
f6779b0e0c
[breaking] support transformers 4.48 ( #6628 )
...
Former-commit-id: 15357cdad9
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
245de012ca
[webui] improve webui & reasoning mode ( #6778 )
...
Former-commit-id: 45e68b9f09
2025-01-31 00:09:21 +08:00
codingma
f143360ee6
[assets] update wechat ( #6771 )
...
Former-commit-id: 4fb6059f48
2025-01-29 12:31:24 +08:00
qvlehao
f5350b103b
[model] add deepseek-R1 & show think process ( #6767 )
...
Former-commit-id: 28417f862a
2025-01-29 12:16:26 +08:00
yinpu
aa7c07caf0
fix: avoid redundant normalization in DPO's SFT loss calculation ( #6722 )
...
Former-commit-id: 0f45982bac
2025-01-21 13:38:02 +08:00
engchina
324f07613a
[webui] support ja ( #6698 )
...
* add support for japanese language
* add support for japanese language
---------
Co-authored-by: engchina <atjapan2015@gmail.com >
Former-commit-id: de9bc3fefa
2025-01-20 19:46:38 +08:00
hoshi-hiyouga
0c59483368
[assets] update wechat ( #6710 )
...
Former-commit-id: 3962645ac0
2025-01-20 16:29:24 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
ee0b3b1e1a
[assets] update wechat ( #6692 )
...
Former-commit-id: 17b470630d
2025-01-18 12:35:03 +08:00
hoshi-hiyouga
f87c788154
[misc] update mm plugin ( #6691 )
...
Former-commit-id: c0caa7afc6
2025-01-17 23:04:26 +08:00
hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
770433fa33
[webui] upgrade to gradio 5 ( #6688 )
...
Former-commit-id: 4d0f662dbe
2025-01-17 20:15:42 +08:00
hoshi-hiyouga
788accb601
fix qwen2 moe ( #6684 )
...
Former-commit-id: 7bf09abf1c
2025-01-17 13:46:09 +08:00
Zhangchi Feng
555f17c1ee
[data] Fix minicpmv/o dpo training ( #6657 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
Former-commit-id: 027942789b
2025-01-15 17:30:37 +08:00
steveepreston
8895cf1152
Update val_size english description ( #6653 )
...
* Update `val_size` Description in locales.py
* Update `val_size` Description in data_args.py
* Remove extra space in data_args.py
Former-commit-id: 76675b654e
2025-01-15 16:00:20 +08:00
hoshi-hiyouga
320e40d873
update readme ( #6648 )
...
Former-commit-id: 563be2286a
2025-01-15 11:06:19 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00