hiyouga
|
c6dfbfa62c
|
refactor evaluation, upgrade trl to 074
Former-commit-id: ed09ebe2c1926ffdb0520b3866f7fd03a9aed046
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
441d9ae0ef
|
use seed in evaluate.py
Former-commit-id: ab5cac1dfa681933f3266827f80068ce798b4c56
|
2023-11-06 18:17:51 +08:00 |
|
hiyouga
|
ffa58ecc4d
|
fix tokenizer padding side in evaluate.py
Former-commit-id: bcb43ff8ba1946c1f7e7865c9d0fb47ba276935d
|
2023-10-21 00:30:04 +08:00 |
|
hiyouga
|
e79f0755a6
|
fix #1232
Former-commit-id: 49975755d47344e362145c52548fdda8783f2c0c
|
2023-10-20 23:28:52 +08:00 |
|
hiyouga
|
04008ea0a4
|
add averaging in evaluation
Former-commit-id: b39d6e0b8658e1c69bbaf6bcb6cfaa8f7af30110
|
2023-10-10 23:16:31 +08:00 |
|
hiyouga
|
889a24ccfa
|
add CMMLU, update eval script
Former-commit-id: 47f31f06a946eefa5a972e4a566cf3ce05e1e111
|
2023-09-23 21:10:17 +08:00 |
|
hiyouga
|
7a684600a9
|
update evaluate
Former-commit-id: 288137a76ed1528faa39b467da22f6468ba368ee
|
2023-09-23 11:55:31 +08:00 |
|
hiyouga
|
64414c68e9
|
move file
Former-commit-id: 8711ca9b5421f971ee4cb2fada23832f1021577c
|
2023-09-23 11:52:12 +08:00 |
|