hiyouga
|
48d6d925f7
|
fix #1558
Former-commit-id: 1740131d63
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
0ed0b8f9c5
|
fix bug in freeze tuning
Former-commit-id: ff52b1779c
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
678052a7ef
|
fix rlhf callback
Former-commit-id: 1817ffc86f
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
b71da932eb
|
fix bug in PPO training
Former-commit-id: 856522a3df
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
eb5a852dd5
|
fix import bug
Former-commit-id: 35b91ea34c
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce78303600
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1
|
2023-11-15 16:29:09 +08:00 |
|