hiyouga
|
1cb75c81d7
|
fix #1550
Former-commit-id: c12acd21a5a500892ed739c79327ccd39fddad5b
|
2023-11-17 17:23:13 +08:00 |
|
hiyouga
|
05478a0a2d
|
fix bug in web ui
Former-commit-id: a598f145ec903dd2b2c984d951b6c450b142ece5
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
e83b0cbafb
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
685d0c975a
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
5a206d54c9
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|