hiyouga
|
fdf5d3ac77
|
fix bugs in webui
Former-commit-id: 4befa74ea630d90e4d7a1f7d7c34d39257717ec1
|
2023-10-15 03:41:58 +08:00 |
|
hiyouga
|
66bb8ccb28
|
refactor webui
Former-commit-id: 813ecd8e51949c21ab6fbaa51cc2b1a84ee07952
|
2023-10-15 03:06:21 +08:00 |
|
hiyouga
|
8ee069d7c3
|
fix loading dtype
Former-commit-id: d54a356128f7e335c12089702cf3de7f5b4baf16
|
2023-10-14 20:15:24 +08:00 |
|
hiyouga
|
0c40610fb5
|
fix #1176 #1177
Former-commit-id: 5627a2b57c270a78095a32083e2dc7aa02162875
|
2023-10-14 20:00:17 +08:00 |
|
hiyouga
|
7fea444d48
|
fix #1184
Former-commit-id: 5b069a967823e659dbc70b0d50361b3ad248087e
|
2023-10-14 19:20:11 +08:00 |
|
hiyouga
|
08bb6baf76
|
fix webui
Former-commit-id: a0fe43aac968d9f6ca4724b8d718b45c03063b91
|
2023-10-13 16:27:59 +08:00 |
|
hiyouga
|
19582ee70f
|
fix ppo args
Former-commit-id: 0f12899951808f53a482082eb116bda309775930
|
2023-10-11 23:40:50 +08:00 |
|
hiyouga
|
f1a8fcf917
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
04008ea0a4
|
add averaging in evaluation
Former-commit-id: b39d6e0b8658e1c69bbaf6bcb6cfaa8f7af30110
|
2023-10-10 23:16:31 +08:00 |
|
hiyouga
|
c678dc262d
|
fix aquila template, repair sft packing mechanism
Former-commit-id: 8c82cfa5dd4bec957426b5bf176d242c77552ab0
|
2023-10-10 18:49:55 +08:00 |
|
hiyouga
|
5bc37ad230
|
tiny fix
Former-commit-id: 31ccd3329ac634b239c43d60bd955cd95670df16
|
2023-10-10 17:41:13 +08:00 |
|
hiyouga
|
7bdcd9d507
|
fix flash shift short attention
Former-commit-id: e44ad23eafa39b3ac0400b6f97cd440106a87f44
|
2023-10-09 17:54:48 +08:00 |
|
hiyouga
|
b3dfd77356
|
fix webui args
Former-commit-id: 64aa75c8cd7c84ab4a0f1dbaf4763765ba973f54
|
2023-10-09 17:13:57 +08:00 |
|
hiyouga
|
0c1e00574d
|
fix shift short attention
Former-commit-id: 9a49cce8e6f6b222f74a07bdab40efee6a77b0f1
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
b1bc191c45
|
update webui #1086
Former-commit-id: 65a48bc398f18f71f5f2659b2070e3b9593af243
|
2023-10-09 14:50:14 +08:00 |
|
hiyouga
|
de8a0d689c
|
fix #1097
Former-commit-id: c5b8796322d9d48e815038f9fecf0ce39036a4ee
|
2023-10-08 22:29:26 +08:00 |
|
hiyouga
|
536c32d8d4
|
add llamafy_qwen.py
Former-commit-id: 6cdc91543c022edcc98076488f06e809fde9bad7
|
2023-10-08 22:05:36 +08:00 |
|
hiyouga
|
025bc4bf5c
|
fix #1068 #1074
Former-commit-id: 26c6bfd21de06cc56be9a58e2ef69045ea70cc14
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
571f091232
|
fix bug in packed sft dataset
Former-commit-id: 51d26b2af6612e65a91c576da5270028da27b322
|
2023-09-28 01:16:46 +08:00 |
|
hiyouga
|
51c6c09f02
|
tiny fix
Former-commit-id: 35b355b76d2a8f8adf3750a905224e52d03d218f
|
2023-09-28 01:03:04 +08:00 |
|
hiyouga
|
d231f97335
|
tiny fix
Former-commit-id: 7451b2ae7e58d0f1857f01a037672a8c53b1bd0d
|
2023-09-28 01:02:11 +08:00 |
|
hiyouga
|
2bacc9789a
|
fix #1064
Former-commit-id: fd4660aa72d981d7efdad465f24a59358626c975
|
2023-09-28 00:53:29 +08:00 |
|
hiyouga
|
b7c28d0378
|
fix bug in pretraining
Former-commit-id: 18a2d90bd6e7c3e1e3513e6f9d895e4048b35b04
|
2023-09-28 00:45:20 +08:00 |
|
hiyouga
|
4617413bde
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
d1d5ecb403
|
fix #1026
Former-commit-id: d0940d0dbd03d4bbcc955304566b0d5507edf9e6
|
2023-09-27 22:57:09 +08:00 |
|
hiyouga
|
57dafc37cc
|
fix #424
Former-commit-id: daaf89f1126112a73b9f115b0f5617a8cd974a3e
|
2023-09-27 22:49:43 +08:00 |
|
hiyouga
|
287aea8d97
|
fix #1032
Former-commit-id: 1235b2da5a79ffefd1342054ea8e7dabf47398c1
|
2023-09-27 22:42:16 +08:00 |
|
hiyouga
|
867e513e18
|
refactor finetuning Args
Former-commit-id: be425a70a4c8f051717cf1e4464dbd79dae4c0b5
|
2023-09-27 22:28:06 +08:00 |
|
hiyouga
|
d6f5a3cae9
|
support LongLoRA
Former-commit-id: 0832ed37e7947d699f17375648a52f80752c2b6b
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
889a24ccfa
|
add CMMLU, update eval script
Former-commit-id: 47f31f06a946eefa5a972e4a566cf3ce05e1e111
|
2023-09-23 21:10:17 +08:00 |
|
hiyouga
|
7a684600a9
|
update evaluate
Former-commit-id: 288137a76ed1528faa39b467da22f6468ba368ee
|
2023-09-23 11:55:31 +08:00 |
|
hiyouga
|
64414c68e9
|
move file
Former-commit-id: 8711ca9b5421f971ee4cb2fada23832f1021577c
|
2023-09-23 11:52:12 +08:00 |
|
hiyouga
|
e4542e1d45
|
add MMLU and C-Eval script
Former-commit-id: 3403f876127b4b99c5e3edb2834cc3b9a3a0063f
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
eb41bc4aae
|
fix #1000
Former-commit-id: 85de2d0a99e4a81fae890a963ccbb5c6142d52d4
|
2023-09-22 15:00:48 +08:00 |
|
hiyouga
|
6360d65dbf
|
fix webui
Former-commit-id: e28485b476816c1bd6c34f7ff9efaa9e3fb85176
|
2023-09-21 19:55:38 +08:00 |
|
hiyouga
|
dd6e9b3cc1
|
tiny fix
Former-commit-id: d24ea58c1a44b94227f4cb60f13fc1dd79997d01
|
2023-09-21 19:52:06 +08:00 |
|
hiyouga
|
d8ab75ee44
|
fix #944
Former-commit-id: 032245647848aaa4167086636b6c985268c5fee3
|
2023-09-21 19:51:02 +08:00 |
|
hiyouga
|
62ac4527b4
|
tiny fix
Former-commit-id: 1a7ddd8c1d20dc251f53923bd0ab9f3f1031dd21
|
2023-09-21 15:25:29 +08:00 |
|
statelesshz
|
b07a921f3e
|
support export model on Ascend NPU
Former-commit-id: 50f94e6d9d62c848db7a3db85fa999d67ddd9f04
|
2023-09-20 10:26:02 +08:00 |
|
hiyouga
|
ebc7b6ebc2
|
fix webui
Former-commit-id: 2aa06a5a74d98ec25ed6e1e39df11230670f5bad
|
2023-09-19 18:35:21 +08:00 |
|
hiyouga
|
c19a5a4575
|
fix error info
Former-commit-id: b90ed220c5e94086d2b73045eff2440ff1b58c5c
|
2023-09-19 18:30:23 +08:00 |
|
hiyouga
|
037584adfc
|
add tests.cal_flops.py
Former-commit-id: 47a119db6c6e937f6ed96f70e3cda6031b9fbd0d
|
2023-09-16 23:40:41 +08:00 |
|
hiyouga
|
2256514acb
|
fix #913
Former-commit-id: d67c11d69277292648dd9889a7321345e2c0c437
|
2023-09-15 20:58:28 +08:00 |
|
hiyouga
|
5c1708266e
|
fix #896
Former-commit-id: 4b70d623d817460de4732749110622e4a1b51958
|
2023-09-14 18:37:34 +08:00 |
|
hiyouga
|
b46a0af117
|
fix #887
Former-commit-id: e131bc03e05ccae3c6ad8bb42ccf2cdcc2cf3cea
|
2023-09-14 17:56:58 +08:00 |
|
mmbwf
|
0c3e98488a
|
Update utils.py
Fix parameters load error.
Former-commit-id: 112850364c7fdb53e3a38d42861404fc519108ce
|
2023-09-14 15:38:04 +08:00 |
|
hiyouga
|
1b762cf22c
|
fix ppo save model
Former-commit-id: 300ca6d904524f46cb520056e1319a1e9a13d169
|
2023-09-12 16:25:29 +08:00 |
|
hiyouga
|
df0b527c1e
|
fix #762 #814
Former-commit-id: 9a30ee5009040afbc524dbac0dad99904b2adf5f
|
2023-09-12 16:10:10 +08:00 |
|
hiyouga
|
5b55347952
|
tiny fix
Former-commit-id: d8ea0691f84c971e6860526714fc9873c350b064
|
2023-09-11 18:27:08 +08:00 |
|
hiyouga
|
02a08422d3
|
Release v0.1.8
Former-commit-id: d9666411375964d334d0a93ec162b27e05f70d49
|
2023-09-11 17:31:34 +08:00 |
|