hiyouga
|
cfe1e24471
|
support solar 10.7B #1907
Former-commit-id: ecf9b35c612e5514dd25b0d15835d28447a7437e
|
2024-01-14 00:30:30 +08:00 |
|
hiyouga
|
880055bc90
|
support deepseek moe
Former-commit-id: 07fbb32496b9b81c4cfe67cb9a15a6b2c43852c3
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
ad99bd0a14
|
fix phi modules
Former-commit-id: 68d7e925ec51b6ee277513de8f61ac18a8378b98
|
2024-01-13 23:12:47 +08:00 |
|
hiyouga
|
c5f099138d
|
fix #2147
Former-commit-id: 49445a03cd46af4e7036cf444cd041dfab2d8941
|
2024-01-12 03:30:56 +08:00 |
|
hiyouga
|
6e64e02f71
|
fix #2164
Former-commit-id: abe23bb4aca4fa571ebafc329ec9a9d457e37d41
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
73cab9d9d4
|
fix #2161
Former-commit-id: 9acd5a2b678cd07f8e3b48eca76c4cbacb559e37
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
64246d42d2
|
improve web ui
Former-commit-id: 5c0148c018b12b52bc5748acfd6ad43836f2edb5
|
2024-01-10 12:37:45 +08:00 |
|
hiyouga
|
6fa6d4532e
|
improve model export
Former-commit-id: d1b795aac1fccbcb8a9ec2057065c33b46ce1a5a
|
2024-01-09 22:26:24 +08:00 |
|
hiyouga
|
92b9956c06
|
modify weight name
Former-commit-id: 3f3c528fa8056dc1952ea5293bad7e55187983ff
|
2024-01-09 20:22:47 +08:00 |
|
hiyouga
|
4d6669c268
|
fix #1789
Former-commit-id: d86455f685fa531e651333e00b4fe54d895cf2e4
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
89f4ae51f9
|
fix #2127
Former-commit-id: 5a1aa33fa9b546ab520f0ba4cb9d996b87eb71ca
|
2024-01-09 14:49:13 +08:00 |
|
hiyouga
|
af0659f573
|
fix #2125
Former-commit-id: 46a22f4daeafac5b0a695212d060960ff53af613
|
2024-01-08 21:42:25 +08:00 |
|
hiyouga
|
0bef890000
|
fix api server
Former-commit-id: cedd80ba56c0090487f65f4b1227e5615943997f
|
2024-01-07 17:14:42 +08:00 |
|
hiyouga
|
75fe1404b1
|
improve model export
Former-commit-id: 31255147a566a23ce1a48402662d14af8ac267ab
|
2024-01-05 18:51:49 +08:00 |
|
hiyouga
|
b460c9372f
|
fix #2098
Former-commit-id: e62d9158cffbf1044396597ddaf15b1c0bc5f954
|
2024-01-05 17:11:26 +08:00 |
|
hiyouga
|
c3e574ceaa
|
fix qwen template
Former-commit-id: c1923e0daa02b49ac07e96ce29877729acc78d31
|
2024-01-05 16:14:56 +08:00 |
|
hiyouga
|
04ae80a52e
|
fix #2081
Former-commit-id: ec4b539b6c0be11e15d273025c414b694bbd6c9a
|
2024-01-04 23:19:08 +08:00 |
|
hiyouga
|
a7ff095399
|
fix #2090
Former-commit-id: 13ec720990a88b01f7f5e2a99a87f95128dc3537
|
2024-01-04 23:05:08 +08:00 |
|
hiyouga
|
a655dcebaf
|
fix #2067
Former-commit-id: 6cfdeea5261fd5bf6f91ba2bb3efb921a2f3e866
|
2024-01-04 22:53:03 +08:00 |
|
hiyouga
|
8c74851b70
|
fix dispatch
Former-commit-id: deda82638716506dc690902c51276bb1eb0ddd5e
|
2024-01-03 16:33:16 +08:00 |
|
hiyouga
|
7168392a51
|
fix valuehead patch
Former-commit-id: d9cb98362b58b28ae0ee207e7c07e75e5d810876
|
2024-01-03 16:19:23 +08:00 |
|
hiyouga
|
ccc5b324fe
|
fix rm server
Former-commit-id: 81bc1638682a9fd01518f9f25250a6b584d2a9e6
|
2024-01-03 15:30:46 +08:00 |
|
hiyouga
|
e85c205a81
|
fix #2014
Former-commit-id: 077f6bf64e50f01f62aa4a957438bedc4e7925b3
|
2023-12-29 15:17:22 +08:00 |
|
hiyouga
|
7e225be16e
|
add yuan model
Former-commit-id: 6a0377e2e51633bd5fb10fa8628e554565c5ee3e
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
ebb32e85f8
|
fix version
Former-commit-id: dd7500b65d0d548441eece101b60d51fa619cc0f
|
2023-12-29 04:53:36 +08:00 |
|
hiyouga
|
90d279f39f
|
fix args
Former-commit-id: ff18f327a3dc96d9677ef32841e8f29ab2eeb7ef
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
af3f5b6e16
|
fix export format
Former-commit-id: 7c82bd396b9e6ff395850ad544d95cbf1b7557cd
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
53d7c5109f
|
fix ppo trainer
Former-commit-id: ca5b5823b03822ef899405d233a82396be997f44
|
2023-12-28 18:09:28 +08:00 |
|
hiyouga
|
c33fbea469
|
fix bug
Former-commit-id: b06faa1be3f5aa5e0fa31aa31314c213c36c3442
|
2023-12-24 19:20:12 +08:00 |
|
hiyouga
|
921f593632
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
940403720a
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
f869e44fe5
|
fix #1909
Former-commit-id: 3e93c33af9f80e28c9f30af9b7ba20757358afb4
|
2023-12-23 14:42:20 +08:00 |
|
hiyouga
|
306a70c7ba
|
fix unsloth dtype
Former-commit-id: fd22e6546ce5f38a6a075cf894aafc3d206b2fcd
|
2023-12-23 01:59:49 +08:00 |
|
hiyouga
|
d358d955e5
|
fix dpo trainer
Former-commit-id: c160dd7cd86e296e32775ace2e4258a473449c41
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
0fdd6074c3
|
llama board: add unsloth
Former-commit-id: 9477e6f28808ae9deadada1f6cf679a29542c271
|
2023-12-23 00:35:53 +08:00 |
|
hiyouga
|
6faf9c35a9
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
ShaneTian
|
d05febe5de
|
Fix slow model initialization in bfloat16 dtype.
Former-commit-id: cf2e2f6f9b7f09b1e2faf6fbc413e3f62e3846c7
|
2023-12-22 16:27:28 +08:00 |
|
hiyouga
|
67f7034a21
|
fix param type
Former-commit-id: 11b99f344416ade1cdac52e11ba7f36fcf689221
|
2023-12-21 17:33:01 +08:00 |
|
hiyouga
|
79f301a2c6
|
fix ds zero3 check
Former-commit-id: 7f50705b1d821d287bd854211319f697f57b25db
|
2023-12-21 01:19:22 +08:00 |
|
hiyouga
|
31cbc67986
|
match version
Former-commit-id: 16db52522584a8e084d4db2a7c253c8b88f27371
|
2023-12-20 22:17:35 +08:00 |
|
hiyouga
|
acf5241845
|
fix stop words
Former-commit-id: 6ce6cac9fa8f0af33697e824cf93a9a80cdbd064
|
2023-12-20 19:06:43 +08:00 |
|
hiyouga
|
2bce99b82f
|
fix yi template #1895
Former-commit-id: 05b4fa1e2b13a15ee261a151ac8cd0a2ebdf5edc
|
2023-12-20 18:58:16 +08:00 |
|
hiyouga
|
3c330869ef
|
improve quantization
Former-commit-id: 4dde60017ad8208dfea0b2bb61df6a14a35d03e0
|
2023-12-20 18:27:16 +08:00 |
|
hiyouga
|
dba1af4841
|
add max_memory for gptq #1923
Former-commit-id: 9afc42c8b999fbbc206d9a467ca5795b27a10096
|
2023-12-20 18:15:17 +08:00 |
|
hiyouga
|
2b1e52dcc9
|
fix #1073 #1462 #1735 #1908
Former-commit-id: cd8e2535aa66931b24b96e76c2b56ce703a579b1
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
b5238e945a
|
optimize data loading logic
Former-commit-id: 58f669b384582ac90e85de835f1f44f7003f9ec0
|
2023-12-20 16:15:41 +08:00 |
|
hiyouga
|
afc0f29704
|
fix #1909
Former-commit-id: f563e8d28dfa48a60cbe3d295b20f9cf58de296d
|
2023-12-20 16:11:07 +08:00 |
|
hiyouga
|
de0bb1d2da
|
fix mixtral inference #1821
Former-commit-id: 612f9fd19cbd29e8b1785a1576a9668e7dcd264c
|
2023-12-20 15:11:15 +08:00 |
|
hiyouga
|
cc16ece283
|
fix #1900
Former-commit-id: 4c35214396f873588562606b084740b6581188d9
|
2023-12-19 17:21:46 +08:00 |
|
hiyouga
|
4b27cf5460
|
add codegeex template
Former-commit-id: a8222722b8097158f1c92e3729f41d411eff3926
|
2023-12-18 19:52:35 +08:00 |
|