hiyouga
|
c1fdf81df6
|
tiny fix
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
87ea0a8bcd
|
Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support
|
2024-05-27 20:52:37 +08:00 |
|
hoshi-hiyouga
|
f1002b9f93
|
Update template.py
|
2024-05-27 20:51:56 +08:00 |
|
hoshi-hiyouga
|
122213a7a7
|
Update template.py
|
2024-05-27 20:51:26 +08:00 |
|
Jianbai Ye
|
cff815391f
|
add openchat-3.6-8B support
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
08564838bd
|
fix full/freeze tuning for mllm
|
2024-05-27 20:37:57 +08:00 |
|
hoshi-hiyouga
|
838f2fb3e4
|
Merge pull request #3835 from BUAADreamer/main
fix some features in llava-style training
|
2024-05-27 20:23:45 +08:00 |
|
hiyouga
|
e626e26446
|
support Aya23
|
2024-05-27 20:23:24 +08:00 |
|
BUAADreamer
|
ea2afd429e
|
Merge branch 'hiyouga:main' into main
|
2024-05-27 19:00:48 +08:00 |
|
BUAADreamer
|
57eb13b75d
|
add regex of only tune lm and mm_proj
|
2024-05-27 18:59:00 +08:00 |
|
hiyouga
|
efa4b196ca
|
add phi-3 7b/14b, mistral v0.3 models
|
2024-05-27 18:20:16 +08:00 |
|
hiyouga
|
5581cb2e4e
|
update readme
|
2024-05-27 18:14:02 +08:00 |
|
BUAADreamer
|
4bc7c10c00
|
Merge branch 'hiyouga:main' into main
|
2024-05-27 11:54:01 +08:00 |
|
hiyouga
|
cb63b32986
|
support SimPO #3900
|
2024-05-26 23:46:33 +08:00 |
|
BUAADreamer
|
60170a1da4
|
Merge branch 'hiyouga:main' into main
|
2024-05-25 14:18:49 +08:00 |
|
hiyouga
|
063f91cc80
|
fix #3853
|
2024-05-24 23:29:45 +08:00 |
|
BUAADreamer
|
047a06a1e5
|
Merge branch 'hiyouga:main' into main
|
2024-05-24 09:50:00 +08:00 |
|
hiyouga
|
3a023bca2a
|
refactor data preprocessing, fix mllm rlhf
|
2024-05-24 04:08:25 +08:00 |
|
hiyouga
|
de0e67aff1
|
fix paligemma sft
requires transformers>=4.41.1
|
2024-05-24 00:23:40 +08:00 |
|
hiyouga
|
67ebc7b388
|
fix oom issues in export
|
2024-05-23 23:32:45 +08:00 |
|
BUAADreamer
|
8d53ec2b5f
|
Merge branch 'hiyouga:main' into main
|
2024-05-21 22:18:20 +08:00 |
|
hiyouga
|
7134fb02bb
|
fix paligemma sft
|
2024-05-21 20:03:09 +08:00 |
|
hiyouga
|
335501e228
|
fix #3847
|
2024-05-21 17:53:06 +08:00 |
|
BUAADreamer
|
29a6d5bdb8
|
support pretraining of llava
|
2024-05-21 08:57:14 +08:00 |
|
hiyouga
|
2a67457e39
|
support paligemma
|
2024-05-21 00:01:22 +08:00 |
|
hiyouga
|
e55c85ac72
|
fix paligemma data preprocess
|
2024-05-20 23:51:32 +08:00 |
|
hiyouga
|
542229abb3
|
fix paligemma inference
|
2024-05-20 23:36:43 +08:00 |
|
hiyouga
|
9b0f4d7602
|
add kto to webui
|
2024-05-20 21:20:25 +08:00 |
|
hiyouga
|
d52fae2fa8
|
fix chat engines
do not use pop(key, default) since api assigns None to dict values
|
2024-05-20 00:36:43 +08:00 |
|
hoshi-hiyouga
|
aa0bca49e9
|
Merge pull request #3812 from ycjcl868/feat/chat-support-system-prompt
feat: cli chat support system_message
|
2024-05-20 00:31:32 +08:00 |
|
hoshi-hiyouga
|
a0e8d3d159
|
Update vllm_engine.py
|
2024-05-20 00:31:04 +08:00 |
|
hoshi-hiyouga
|
a943a1034b
|
Update hf_engine.py
|
2024-05-20 00:30:45 +08:00 |
|
hoshi-hiyouga
|
a1fa7aa63b
|
Update generating_args.py
|
2024-05-20 00:29:31 +08:00 |
|
hoshi-hiyouga
|
896c656185
|
Update chat_model.py
|
2024-05-20 00:29:12 +08:00 |
|
hiyouga
|
10573e1639
|
fix jinja template
|
2024-05-19 23:38:30 +08:00 |
|
ycjcl868
|
a08ba254c8
|
feat: cli chat support system_message
|
2024-05-19 23:17:46 +08:00 |
|
hiyouga
|
31a0564d4f
|
fix zero2 high ram usage
|
2024-05-19 21:53:54 +08:00 |
|
hiyouga
|
70214b71b1
|
fix hf gen args
|
2024-05-19 19:39:32 +08:00 |
|
hiyouga
|
8ee8ac6eba
|
fix envs
|
2024-05-19 18:27:18 +08:00 |
|
hiyouga
|
1ebc890a5f
|
fix #3807
|
2024-05-19 17:07:57 +08:00 |
|
hiyouga
|
3c2a992caa
|
safe output path in webui
|
2024-05-18 22:42:28 +08:00 |
|
hiyouga
|
d43822fcc2
|
fix jetmoe z3 block
|
2024-05-18 22:28:45 +08:00 |
|
hiyouga
|
a851056229
|
improve data process logger
|
2024-05-18 22:02:42 +08:00 |
|
hiyouga
|
0edc16769f
|
fix #3803
|
2024-05-18 16:13:14 +08:00 |
|
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
|
hoshi-hiyouga
|
33a354548e
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
|
2024-05-18 03:07:18 +08:00 |
|
hoshi-hiyouga
|
9646727453
|
Update model_args.py
|
2024-05-17 16:16:41 +08:00 |
|
juejuezi
|
b20d62ba3c
|
feat: pass the max_lora_rank parameter to vLLM backend
|
2024-05-17 16:07:39 +08:00 |
|
hiyouga
|
8af9817605
|
add deepseek v2 lite model
|
2024-05-17 13:25:36 +08:00 |
|
enji.zhou
|
db1d5a4f51
|
add kto
|
2024-05-17 13:09:17 +08:00 |
|