hiyouga
|
af7748139a
|
bump versions
transformers 4.37.2->4.41.2
datasets 2.14.3->2.16.0
accelerate 0.27.2->0.30.1
peft 0.10.0->0.11.1
trl 0.8.1->0.8.6
Former-commit-id: 876bc92865
|
2024-06-03 18:29:38 +08:00 |
|
hiyouga
|
820404946e
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 8070871732
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
a71a6a05c3
|
update readme
Former-commit-id: 89ca832740
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
ce1be3da4b
|
add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: 0dd632fe9e
|
2024-05-29 15:00:37 +08:00 |
|
hiyouga
|
3ea8f5e6b9
|
support DDP in webui
Former-commit-id: 7c016b22aa
|
2024-05-28 19:24:22 +08:00 |
|
hiyouga
|
0706dbf7e6
|
tiny fix
Former-commit-id: c1fdf81df6
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
ad3ca3f556
|
Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support
Former-commit-id: 87ea0a8bcd
|
2024-05-27 20:52:37 +08:00 |
|
Jianbai Ye
|
d2c1df7f3d
|
add openchat-3.6-8B support
Former-commit-id: cff815391f
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
fc5a6b5c4e
|
support Aya23
Former-commit-id: e626e26446
|
2024-05-27 20:23:24 +08:00 |
|
hiyouga
|
51a1097c64
|
add phi-3 7b/14b, mistral v0.3 models
Former-commit-id: efa4b196ca
|
2024-05-27 18:20:16 +08:00 |
|
hiyouga
|
df33548b39
|
update readme
Former-commit-id: 5581cb2e4e
|
2024-05-27 18:14:02 +08:00 |
|
hiyouga
|
4807c11db8
|
support SimPO #3900
Former-commit-id: cb63b32986
|
2024-05-26 23:46:33 +08:00 |
|
hiyouga
|
11f79ea20e
|
fix #3847
Former-commit-id: 335501e228
|
2024-05-21 17:53:06 +08:00 |
|
hiyouga
|
cce3892f91
|
support paligemma
Former-commit-id: 2a67457e39
|
2024-05-21 00:01:22 +08:00 |
|
hiyouga
|
446c681b58
|
fix paligemma inference
Former-commit-id: 542229abb3
|
2024-05-20 23:36:43 +08:00 |
|
hiyouga
|
32a65e89e5
|
fix envs
Former-commit-id: 8ee8ac6eba
|
2024-05-19 18:27:18 +08:00 |
|
hoshi-hiyouga
|
97469892c3
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
Former-commit-id: 33a354548e
|
2024-05-18 03:07:18 +08:00 |
|
hiyouga
|
9af3dce3c8
|
add deepseek v2 lite model
Former-commit-id: 8af9817605
|
2024-05-17 13:25:36 +08:00 |
|
enji.zhou
|
03956053b8
|
add kto
Former-commit-id: db1d5a4f51
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
22f71c152a
|
add falcon 11b
Former-commit-id: d77bed4091
|
2024-05-17 00:08:33 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc426
|
2024-05-16 18:39:08 +08:00 |
|