hiyouga
|
d13b8bee8a
|
fix jetmoe z3 block
Former-commit-id: cb00a14d905395c4b8fadb955f0424a4c56668de
|
2024-05-18 22:28:45 +08:00 |
|
hiyouga
|
0aa072a155
|
improve data process logger
Former-commit-id: 33d0b012b56dbafc9fff87b821c2d1bf1409dbb5
|
2024-05-18 22:02:42 +08:00 |
|
hiyouga
|
9c1c59e481
|
fix #3803
Former-commit-id: 1ef12c95059d14a1717c82ce04e529e7ad6435ed
|
2024-05-18 16:13:14 +08:00 |
|
hiyouga
|
2bff90719b
|
improve KTO impl., replace datasets
Former-commit-id: e56a57ddcf061de6e4acc8679f7dbf0b68364986
|
2024-05-18 03:44:56 +08:00 |
|
hoshi-hiyouga
|
e4570e28a8
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
Former-commit-id: f60faa23e23022fd855dac6b1ecbd21e095bccb5
|
2024-05-18 03:07:18 +08:00 |
|
hoshi-hiyouga
|
0fd1a05cec
|
Update model_args.py
Former-commit-id: f40a2fe5334865763e4d513292d359317b7a091b
|
2024-05-17 16:16:41 +08:00 |
|
juejuezi
|
6373d307ec
|
feat: pass the max_lora_rank parameter to vLLM backend
Former-commit-id: a8756d839405ecb5deabe885cf11d1a61564deee
|
2024-05-17 16:07:39 +08:00 |
|
hiyouga
|
a32c3a50fc
|
add deepseek v2 lite model
Former-commit-id: 5e864e6b721d8b891b1cc2ca2dcac41babb9eaaf
|
2024-05-17 13:25:36 +08:00 |
|
enji.zhou
|
66b5634ebf
|
add kto
Former-commit-id: ec51986cf70b0bdd79b8141e45916670fb97a08e
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
969e605c7e
|
better dtype handle in loading
Former-commit-id: 663f0577dd61a1a31191db2c6fbb0c7cea533b21
|
2024-05-17 02:14:56 +08:00 |
|
hiyouga
|
45329d9e3c
|
enable inbrowser in webui
Former-commit-id: 71fdeedb64b2339eb1c740d670b87e0c03dada68
|
2024-05-17 00:08:56 +08:00 |
|
hiyouga
|
6481321470
|
add falcon 11b
Former-commit-id: 897acc725edc204fad393cc9616828431b4fa768
|
2024-05-17 00:08:33 +08:00 |
|
hiyouga
|
dfa686b617
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|