hiyouga
|
9b8b6623ac
|
add logits processor
Former-commit-id: f6f4b1554ae1e8849b437d705ffa34ce7ebd56bb
|
2023-06-03 16:34:54 +08:00 |
|
hiyouga
|
ec48d06b9e
|
remove unused code
Former-commit-id: f5c784dd75e152edb162ea02bf601d279f9de893
|
2023-06-03 00:10:54 +08:00 |
|
hiyouga
|
217b89cf7e
|
add wechat
Former-commit-id: 0251b1788081a81773d87ca5ca2bac8428489852
|
2023-06-02 21:47:10 +08:00 |
|
hiyouga
|
382afc3822
|
tiny fix
Former-commit-id: d479aaf07a0997737da90c00558c42b32109928f
|
2023-06-02 19:02:25 +08:00 |
|
hiyouga
|
09997a25d3
|
fix layer norm name in PPO
Former-commit-id: 3ffc2efb997b717b3efad92a507584276e4bdfa1
|
2023-06-02 17:30:01 +08:00 |
|
hiyouga
|
58c8b29913
|
fix #1
Former-commit-id: b8e556e4632a79251a2225075c570337eeafa559
|
2023-06-02 14:25:00 +08:00 |
|
hiyouga
|
e9ab06678f
|
alter rewards data type
Former-commit-id: 3eb7eb2d37525da50fe401ab7c59532e6e1ef984
|
2023-06-02 14:19:51 +08:00 |
|
hiyouga
|
896dbfec16
|
fix possibly OOM error
Former-commit-id: 0d590dffb41b0e832d9f87d20a23bcd0acd983aa
|
2023-06-01 23:54:44 +08:00 |
|
hiyouga
|
1512711ca2
|
fix bug at inference
Former-commit-id: df9b41af4401006b8040eb53c44dd290b604e0eb
|
2023-05-31 18:11:53 +08:00 |
|
hiyouga
|
a79df3500b
|
update readme
Former-commit-id: 4054e85c664c541f435619baebcd687f80445d4a
|
2023-05-31 16:57:43 +08:00 |
|
hiyouga
|
693c049eac
|
support BLOOM models
Former-commit-id: 1314b6ea39a01aa8ac325e1d875ac013d43aec45
|
2023-05-31 16:54:06 +08:00 |
|
hoshi-hiyouga
|
7492e8f208
|
Merge pull request #1 from mMrBun/main
Support conversation via API.
Former-commit-id: 5e64c7446718845444a14c5643c1bc5819562d60
|
2023-05-30 16:34:00 +08:00 |
|
hiyouga
|
181c776b58
|
remove dummy code
Former-commit-id: e6bc89d280945bbf48281107145c40a41d7cbd56
|
2023-05-30 16:28:00 +08:00 |
|
mMrBun
|
ef0aceaa50
|
Support conversation via API.
Former-commit-id: 57cfe9128ce1781853555b22b502ec85b8b01941
|
2023-05-30 15:00:28 +08:00 |
|
mMrBun
|
a18c6c0560
|
Support conversation via API.
Former-commit-id: 4d9d0ea083c15fb470ecbb428cb79b6dd48e3e92
|
2023-05-30 14:46:22 +08:00 |
|
hiyouga
|
b6ed5176e1
|
update readme
Former-commit-id: 64f7fa45a4dbd173a3e5eb66d17044aa1243cc4b
|
2023-05-29 21:54:01 +08:00 |
|
hiyouga
|
bda71e579b
|
update readme
Former-commit-id: eed2773df8081b229d13b1d679bf2913715f23ac
|
2023-05-29 21:53:02 +08:00 |
|
hiyouga
|
33fee45217
|
add pre-training script
Former-commit-id: 935d58de2b3a2eadc4f0bed28c3ad7dee32e9fd5
|
2023-05-29 21:37:22 +08:00 |
|
hiyouga
|
304be6dc28
|
fix checkpoint loading
Former-commit-id: d31aa5c2c0bcb6a4ef4a62e21693548dd9acaae6
|
2023-05-29 17:43:16 +08:00 |
|
hiyouga
|
35d04a2c05
|
tiny fix
Former-commit-id: eae79707d31fd8be2cf4bee4d610557bbd49f6e7
|
2023-05-29 09:42:29 +08:00 |
|
hiyouga
|
83fc73c580
|
tiny fix
Former-commit-id: 08f7e0862b9df353a0e4d8274617c1a5e6fa6619
|
2023-05-28 21:48:33 +08:00 |
|
hiyouga
|
1fc551e1be
|
use fp16 model, add logcallback
Former-commit-id: bea275d51338b49ce855eec0178e759607265e3d
|
2023-05-28 21:30:28 +08:00 |
|
hiyouga
|
17024ebc1a
|
Initial commit
Former-commit-id: 5ca8e1d63727e7bcb8cab16542c763c47e48184a
|
2023-05-28 18:09:04 +08:00 |
|