641 Commits

Author SHA1 Message Date
hiyouga
e5d2ef4434 fix #3853
Former-commit-id: 063f91cc80193853d17c55fe092fb33683f5d39c
2024-05-24 23:29:45 +08:00
seanzhang-zhichen
9c8d79fbe3 Merge branch 'main' into add_dataset_sample_num
Former-commit-id: 27cb51f7f86f97ae231abfdcb0114ff245d7af9c
2024-05-24 15:57:47 +08:00
BUAADreamer
119af92620 Merge branch 'hiyouga:main' into main
Former-commit-id: 047a06a1e5336625acac14b3533ab3ef2b9905be
2024-05-24 09:50:00 +08:00
hiyouga
3e729798df refactor data preprocessing, fix mllm rlhf
Former-commit-id: 3a023bca2a502810a436cfba7708df164754ea62
2024-05-24 04:08:25 +08:00
hiyouga
d3490aceb7 fix paligemma sft
requires transformers>=4.41.1


Former-commit-id: de0e67aff13f191fd899ad717ec349a6bdb14f2a
2024-05-24 00:23:40 +08:00
hiyouga
6d8ef03741 fix oom issues in export
Former-commit-id: 67ebc7b388c61b9d880c02d7fd217c29299fdf43
2024-05-23 23:32:45 +08:00
BUAADreamer
d8a27e40e2 Merge branch 'hiyouga:main' into main
Former-commit-id: 8d53ec2b5f37e7b43da8d3e787f68fc1bb15997a
2024-05-21 22:18:20 +08:00
hiyouga
4ddc1c9c16 fix paligemma sft
Former-commit-id: 7134fb02bbdc9421f6c314ae176d5786a8cd768d
2024-05-21 20:03:09 +08:00
hiyouga
11f79ea20e fix #3847
Former-commit-id: 335501e228b7c2c2e3041533ddd996127b958f23
2024-05-21 17:53:06 +08:00
BUAADreamer
071d674065 support pretraining of llava
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hiyouga
cce3892f91 support paligemma
Former-commit-id: 2a67457e3944d5e528286cb7203857c13078c484
2024-05-21 00:01:22 +08:00
hiyouga
a935c5105d fix paligemma data preprocess
Former-commit-id: e55c85ac72f4938738dbce576f83b47a1fea88ae
2024-05-20 23:51:32 +08:00
hiyouga
446c681b58 fix paligemma inference
Former-commit-id: 542229abb3aba2032d4c52a878c0fd35ba299691
2024-05-20 23:36:43 +08:00
hiyouga
5351e3945b add kto to webui
Former-commit-id: 9b0f4d7602f70031cc1a5fd66616cbe6bb4d78d3
2024-05-20 21:20:25 +08:00
zhangzc
4b90f04c1f fix conflict
Former-commit-id: d956041640d9abc5e59919a227d27270fb513a7e
2024-05-20 17:10:01 +08:00
hiyouga
864da49139 fix chat engines
do not use pop(key, default) since api assigns None to dict values


Former-commit-id: d52fae2fa866afeb6156dc98388ce5cc6d5eca77
2024-05-20 00:36:43 +08:00
hoshi-hiyouga
6955042c10 Merge pull request #3812 from ycjcl868/feat/chat-support-system-prompt
feat: cli chat support system_message
Former-commit-id: aa0bca49e9940e96a75f61f31c69580052f6ae1d
2024-05-20 00:31:32 +08:00
hoshi-hiyouga
02fdf903e8 Update vllm_engine.py
Former-commit-id: a0e8d3d159444a73a5ff07af3815cd2aaee0b056
2024-05-20 00:31:04 +08:00
hoshi-hiyouga
30b2ec7025 Update hf_engine.py
Former-commit-id: a943a1034b0033e2fae72e3d272817e3adb03fd1
2024-05-20 00:30:45 +08:00
hoshi-hiyouga
a710d97748 Update generating_args.py
Former-commit-id: a1fa7aa63b9b3fade3de6bd27395c1b94068b6d2
2024-05-20 00:29:31 +08:00
hoshi-hiyouga
b293939c24 Update chat_model.py
Former-commit-id: 896c656185e772c2c9ba9e6108de7ceec84ecc85
2024-05-20 00:29:12 +08:00
hiyouga
0e57bb201c fix jinja template
Former-commit-id: 10573e1639e7a71813927a8bfff3b036c21064c3
2024-05-19 23:38:30 +08:00
ycjcl868
b28f9ecaa0 feat: cli chat support system_message
Former-commit-id: a08ba254c8b62bff49b77be3740022105ae9dbb5
2024-05-19 23:17:46 +08:00
hiyouga
8d4a5ebf6e fix zero2 high ram usage
Former-commit-id: 31a0564d4f4886db03250f2c6daee6e042dc3eb4
2024-05-19 21:53:54 +08:00
hiyouga
5f48c282d3 fix hf gen args
Former-commit-id: 70214b71b18c75be59576153951beff1ce9ff8f6
2024-05-19 19:39:32 +08:00
hiyouga
32a65e89e5 fix envs
Former-commit-id: 8ee8ac6eba3e2916cf7c6401a0e9d803b3ff88ba
2024-05-19 18:27:18 +08:00
hiyouga
df4aec7e72 fix #3807
Former-commit-id: 1ebc890a5ff7b034c112bc9cf5cd8a6936613572
2024-05-19 17:07:57 +08:00
hiyouga
02f716907e safe output path in webui
Former-commit-id: 3c2a992caa45c9121cd1ccac3de9bbcf1a9b058b
2024-05-18 22:42:28 +08:00
hiyouga
7130efff54 fix jetmoe z3 block
Former-commit-id: d43822fcc220806b9eb7cbf9336ef42a0e6b2a51
2024-05-18 22:28:45 +08:00
hiyouga
519d2511ae improve data process logger
Former-commit-id: a851056229f37391023627180b5712ed64ae3528
2024-05-18 22:02:42 +08:00
hiyouga
1e867c0fa0 fix #3803
Former-commit-id: 0edc16769f7e84b74e5fc6a1382e284632567c4c
2024-05-18 16:13:14 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
hoshi-hiyouga
97469892c3 Merge pull request #3785 from enji-zhou/feature/add_kto
add kto

Former-commit-id: 33a354548e78a7f7f51d63f80974920827d30252
2024-05-18 03:07:18 +08:00
hoshi-hiyouga
e4a2accf4a Update model_args.py
Former-commit-id: 964672745389e35580a7010b0f010bd5ee08d542
2024-05-17 16:16:41 +08:00
juejuezi
20326affde feat: pass the max_lora_rank parameter to vLLM backend
Former-commit-id: b20d62ba3ccc5c02529d19e22b7adcfe8b88c326
2024-05-17 16:07:39 +08:00
hiyouga
9af3dce3c8 add deepseek v2 lite model
Former-commit-id: 8af98176055b6fc28d16b03207b5abaa7de6104a
2024-05-17 13:25:36 +08:00
enji.zhou
03956053b8 add kto
Former-commit-id: db1d5a4f51faae61fe18666057353747b01f5b8d
2024-05-17 13:09:17 +08:00
hiyouga
780a1f5a4e better dtype handle in loading
Former-commit-id: d9f190ff1ea1cc4dd061e8b03d429caea037bca4
2024-05-17 02:14:56 +08:00
hiyouga
f4bf49e891 enable inbrowser in webui
Former-commit-id: 694a05fd044bbbad107ca8fed5494460c78e1981
2024-05-17 00:08:56 +08:00
hiyouga
22f71c152a add falcon 11b
Former-commit-id: d77bed4091a6a8fea682b39d3261e1e93dfe093f
2024-05-17 00:08:33 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00