8 Commits

Author SHA1 Message Date
hiyouga
9b8b6623ac add logits processor
Former-commit-id: f6f4b1554ae1e8849b437d705ffa34ce7ebd56bb
2023-06-03 16:34:54 +08:00
hiyouga
ec48d06b9e remove unused code
Former-commit-id: f5c784dd75e152edb162ea02bf601d279f9de893
2023-06-03 00:10:54 +08:00
hiyouga
09997a25d3 fix layer norm name in PPO
Former-commit-id: 3ffc2efb997b717b3efad92a507584276e4bdfa1
2023-06-02 17:30:01 +08:00
hiyouga
e9ab06678f alter rewards data type
Former-commit-id: 3eb7eb2d37525da50fe401ab7c59532e6e1ef984
2023-06-02 14:19:51 +08:00
hiyouga
693c049eac support BLOOM models
Former-commit-id: 1314b6ea39a01aa8ac325e1d875ac013d43aec45
2023-05-31 16:54:06 +08:00
hiyouga
83fc73c580 tiny fix
Former-commit-id: 08f7e0862b9df353a0e4d8274617c1a5e6fa6619
2023-05-28 21:48:33 +08:00
hiyouga
1fc551e1be use fp16 model, add logcallback
Former-commit-id: bea275d51338b49ce855eec0178e759607265e3d
2023-05-28 21:30:28 +08:00
hiyouga
17024ebc1a Initial commit
Former-commit-id: 5ca8e1d63727e7bcb8cab16542c763c47e48184a
2023-05-28 18:09:04 +08:00