hiyouga
|
64fc9ba678
|
refactor evaluation, upgrade trl to 074
Former-commit-id: ed09ebe2c1926ffdb0520b3866f7fd03a9aed046
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
21ac46e439
|
use seed in evaluate.py
Former-commit-id: ab5cac1dfa681933f3266827f80068ce798b4c56
|
2023-11-06 18:17:51 +08:00 |
|
hiyouga
|
c0658711ca
|
fix tokenizer padding side in evaluate.py
Former-commit-id: bcb43ff8ba1946c1f7e7865c9d0fb47ba276935d
|
2023-10-21 00:30:04 +08:00 |
|
hiyouga
|
d602f06882
|
fix #1232
Former-commit-id: 49975755d47344e362145c52548fdda8783f2c0c
|
2023-10-20 23:28:52 +08:00 |
|
hiyouga
|
a2d08ce961
|
add averaging in evaluation
Former-commit-id: b39d6e0b8658e1c69bbaf6bcb6cfaa8f7af30110
|
2023-10-10 23:16:31 +08:00 |
|
hiyouga
|
73c48d0463
|
add CMMLU, update eval script
Former-commit-id: 47f31f06a946eefa5a972e4a566cf3ce05e1e111
|
2023-09-23 21:10:17 +08:00 |
|
hiyouga
|
f7cecd20e3
|
update evaluate
Former-commit-id: 288137a76ed1528faa39b467da22f6468ba368ee
|
2023-09-23 11:55:31 +08:00 |
|
hiyouga
|
2bc64a7636
|
move file
Former-commit-id: 8711ca9b5421f971ee4cb2fada23832f1021577c
|
2023-09-23 11:52:12 +08:00 |
|