10 Commits

Author SHA1 Message Date
hiyouga
8849bee763 recover logging
Former-commit-id: 13d1f0709c774bb5ec5fb2b4e3c66b2f1226afd2
2023-06-06 21:36:37 +08:00
hiyouga
794c2fd506 support distributed quantized training
Former-commit-id: 4eb17bcf6c8ac51a3ec8cc5459064d1b35c82634
2023-06-06 17:39:41 +08:00
hiyouga
42c9c8de39 add logits processor
Former-commit-id: dca27b4412e8e41cadcd623582222e1c216db78b
2023-06-03 16:34:54 +08:00
hiyouga
1392242958 remove unused code
Former-commit-id: ed6161fa6a5f23dfacc52a2a77ddaeeb4adf8443
2023-06-03 00:10:54 +08:00
hiyouga
6ab22a0181 fix layer norm name in PPO
Former-commit-id: e3aaef7d4a37e4aa388a9158c382db8239843a5e
2023-06-02 17:30:01 +08:00
hiyouga
4003ddcc3b alter rewards data type
Former-commit-id: 50d9a20f8103fcfb92a3e2a5e6f0055d27b29d53
2023-06-02 14:19:51 +08:00
hiyouga
3bfb086399 support BLOOM models
Former-commit-id: 740a5daf5634f70a61b41fa8a31ee4a587fa03f3
2023-05-31 16:54:06 +08:00
hiyouga
b1517e7a0e tiny fix
Former-commit-id: 166c837b95d42513f7b977b189822b5c7980606d
2023-05-28 21:48:33 +08:00
hiyouga
87ba09e035 use fp16 model, add logcallback
Former-commit-id: 0c9fda01e3c61727c939efd9d9398f657a2d69b6
2023-05-28 21:30:28 +08:00
hiyouga
54b8ce7b63 Initial commit
Former-commit-id: 769c6ab56be0c9d26e9289f61ac54a4068d935c1
2023-05-28 18:09:04 +08:00