1664 Commits

Author SHA1 Message Date
hiyouga
4807c11db8 support SimPO #3900
Former-commit-id: cb63b32986c43f97994211ec34dc5928fc3bb9d7
2024-05-26 23:46:33 +08:00
BUAADreamer
3eaf371a22 Merge branch 'hiyouga:main' into main
Former-commit-id: 60170a1da42a395cf440bbd3825c4e295c31ac38
2024-05-25 14:18:49 +08:00
hiyouga
e5d2ef4434 fix #3853
Former-commit-id: 063f91cc80193853d17c55fe092fb33683f5d39c
2024-05-24 23:29:45 +08:00
seanzhang-zhichen
9c8d79fbe3 Merge branch 'main' into add_dataset_sample_num
Former-commit-id: 27cb51f7f86f97ae231abfdcb0114ff245d7af9c
2024-05-24 15:57:47 +08:00
BUAADreamer
119af92620 Merge branch 'hiyouga:main' into main
Former-commit-id: 047a06a1e5336625acac14b3533ab3ef2b9905be
2024-05-24 09:50:00 +08:00
hiyouga
3e729798df refactor data preprocessing, fix mllm rlhf
Former-commit-id: 3a023bca2a502810a436cfba7708df164754ea62
2024-05-24 04:08:25 +08:00
hoshi-hiyouga
77b5779746 Merge pull request #3876 from dongdongqiang2018/main
added adapted to 910B image

Former-commit-id: a506f3628ba67cc2f7829c8a7da5a044fa8c9164
2024-05-24 01:54:30 +08:00
hiyouga
d3490aceb7 fix paligemma sft
requires transformers>=4.41.1


Former-commit-id: de0e67aff13f191fd899ad717ec349a6bdb14f2a
2024-05-24 00:23:40 +08:00
hiyouga
6d8ef03741 fix oom issues in export
Former-commit-id: 67ebc7b388c61b9d880c02d7fd217c29299fdf43
2024-05-23 23:32:45 +08:00
donggang
3f52df0ca9 adapted to 910B image
Former-commit-id: 2f68a71fc005f40041a85740513f77512e386669
2024-05-23 09:48:22 +00:00
BUAADreamer
d8a27e40e2 Merge branch 'hiyouga:main' into main
Former-commit-id: 8d53ec2b5f37e7b43da8d3e787f68fc1bb15997a
2024-05-21 22:18:20 +08:00
hiyouga
4ddc1c9c16 fix paligemma sft
Former-commit-id: 7134fb02bbdc9421f6c314ae176d5786a8cd768d
2024-05-21 20:03:09 +08:00
hiyouga
a8480baa11 Update README_zh.md
Former-commit-id: 4d647ddba5934b4d9f594c472aa6b46865bb525a
2024-05-21 18:30:59 +08:00
hiyouga
eabaf0def8 update wechat
Former-commit-id: 2670f6fb3ddabc4da3b0ed49baa40950744e75d7
2024-05-21 18:22:32 +08:00
hiyouga
11f79ea20e fix #3847
Former-commit-id: 335501e228b7c2c2e3041533ddd996127b958f23
2024-05-21 17:53:06 +08:00
hiyouga
c03be5fe63 Update wechat.jpg
Former-commit-id: 789e73b0f4c7a71afde488de5e8c3cfccccdddfe
2024-05-21 17:09:43 +08:00
BUAADreamer
071d674065 support pretraining of llava
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hiyouga
cce3892f91 support paligemma
Former-commit-id: 2a67457e3944d5e528286cb7203857c13078c484
2024-05-21 00:01:22 +08:00
hiyouga
a935c5105d fix paligemma data preprocess
Former-commit-id: e55c85ac72f4938738dbce576f83b47a1fea88ae
2024-05-20 23:51:32 +08:00
hiyouga
446c681b58 fix paligemma inference
Former-commit-id: 542229abb3aba2032d4c52a878c0fd35ba299691
2024-05-20 23:36:43 +08:00
hiyouga
7f6c37c68e fix #3818
Former-commit-id: 7262679666bf70816911ff2434c7c7ccbca26378
2024-05-20 21:43:19 +08:00
hiyouga
5351e3945b add kto to webui
Former-commit-id: 9b0f4d7602f70031cc1a5fd66616cbe6bb4d78d3
2024-05-20 21:20:25 +08:00
zhangzc
4b90f04c1f fix conflict
Former-commit-id: d956041640d9abc5e59919a227d27270fb513a7e
2024-05-20 17:10:01 +08:00
hiyouga
864da49139 fix chat engines
do not use pop(key, default) since api assigns None to dict values


Former-commit-id: d52fae2fa866afeb6156dc98388ce5cc6d5eca77
2024-05-20 00:36:43 +08:00
hoshi-hiyouga
6955042c10 Merge pull request #3812 from ycjcl868/feat/chat-support-system-prompt
feat: cli chat support system_message
Former-commit-id: aa0bca49e9940e96a75f61f31c69580052f6ae1d
2024-05-20 00:31:32 +08:00
hoshi-hiyouga
02fdf903e8 Update vllm_engine.py
Former-commit-id: a0e8d3d159444a73a5ff07af3815cd2aaee0b056
2024-05-20 00:31:04 +08:00
hoshi-hiyouga
30b2ec7025 Update hf_engine.py
Former-commit-id: a943a1034b0033e2fae72e3d272817e3adb03fd1
2024-05-20 00:30:45 +08:00
hoshi-hiyouga
a710d97748 Update generating_args.py
Former-commit-id: a1fa7aa63b9b3fade3de6bd27395c1b94068b6d2
2024-05-20 00:29:31 +08:00
hoshi-hiyouga
b293939c24 Update chat_model.py
Former-commit-id: 896c656185e772c2c9ba9e6108de7ceec84ecc85
2024-05-20 00:29:12 +08:00
hiyouga
0e57bb201c fix jinja template
Former-commit-id: 10573e1639e7a71813927a8bfff3b036c21064c3
2024-05-19 23:38:30 +08:00
ycjcl868
b28f9ecaa0 feat: cli chat support system_message
Former-commit-id: a08ba254c8b62bff49b77be3740022105ae9dbb5
2024-05-19 23:17:46 +08:00
hiyouga
8d4a5ebf6e fix zero2 high ram usage
Former-commit-id: 31a0564d4f4886db03250f2c6daee6e042dc3eb4
2024-05-19 21:53:54 +08:00
hiyouga
5f48c282d3 fix hf gen args
Former-commit-id: 70214b71b18c75be59576153951beff1ce9ff8f6
2024-05-19 19:39:32 +08:00
hiyouga
32a65e89e5 fix envs
Former-commit-id: 8ee8ac6eba3e2916cf7c6401a0e9d803b3ff88ba
2024-05-19 18:27:18 +08:00
hiyouga
df4aec7e72 fix #3807
Former-commit-id: 1ebc890a5ff7b034c112bc9cf5cd8a6936613572
2024-05-19 17:07:57 +08:00
hiyouga
62ddab4b3a update readme
Former-commit-id: 2bec28e328ee80026417f3d055f6d6038479fbfa
2024-05-18 23:09:03 +08:00
hiyouga
02f716907e safe output path in webui
Former-commit-id: 3c2a992caa45c9121cd1ccac3de9bbcf1a9b058b
2024-05-18 22:42:28 +08:00
hiyouga
7130efff54 fix jetmoe z3 block
Former-commit-id: d43822fcc220806b9eb7cbf9336ef42a0e6b2a51
2024-05-18 22:28:45 +08:00
hiyouga
519d2511ae improve data process logger
Former-commit-id: a851056229f37391023627180b5712ed64ae3528
2024-05-18 22:02:42 +08:00
hiyouga
c53e626c9a update data readme
Former-commit-id: ca48f90f1eb9828300635bdaee6c10d6cc632d3d
2024-05-18 21:37:38 +08:00
hiyouga
68c07d3e1e update data readme
Former-commit-id: 18cbf8561d6c3fdceac47991ed16d35471823187
2024-05-18 21:15:20 +08:00
hiyouga
1e867c0fa0 fix #3803
Former-commit-id: 0edc16769f7e84b74e5fc6a1382e284632567c4c
2024-05-18 16:13:14 +08:00
hoshi-hiyouga
9fba1bb649 Merge pull request #3799 from hiyouga/dev
improve KTO impl, replace datasets

Former-commit-id: 73d4a8e655a02d4d1db9fb18d0cc718a5a07a143
2024-05-18 03:49:13 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
hoshi-hiyouga
97469892c3 Merge pull request #3785 from enji-zhou/feature/add_kto
add kto

Former-commit-id: 33a354548e78a7f7f51d63f80974920827d30252
2024-05-18 03:07:18 +08:00
hoshi-hiyouga
2d1583faba Merge pull request #3794 from jue-jue-zi/main
feat: pass the `max_lora_rank` parameter to vLLM backend
Former-commit-id: d7ff49f245cd34668cbe43366e5f1890876da5e7
2024-05-17 16:17:30 +08:00
hoshi-hiyouga
e4a2accf4a Update model_args.py
Former-commit-id: 964672745389e35580a7010b0f010bd5ee08d542
2024-05-17 16:16:41 +08:00
juejuezi
20326affde feat: pass the max_lora_rank parameter to vLLM backend
Former-commit-id: b20d62ba3ccc5c02529d19e22b7adcfe8b88c326
2024-05-17 16:07:39 +08:00
hiyouga
9af3dce3c8 add deepseek v2 lite model
Former-commit-id: 8af98176055b6fc28d16b03207b5abaa7de6104a
2024-05-17 13:25:36 +08:00
enji.zhou
03956053b8 add kto
Former-commit-id: db1d5a4f51faae61fe18666057353747b01f5b8d
2024-05-17 13:09:17 +08:00