1605 Commits

Author SHA1 Message Date
hiyouga
febe41a481 fix loading valuehead
Former-commit-id: 7872375d7a0c1d8826206631f6717a91ec49f1b3
2023-06-13 11:13:06 +08:00
hiyouga
fdfc22196c fix generating args
Former-commit-id: 52805a8441bd7b324bd89489de60f18f103c8e4c
2023-06-13 01:33:56 +08:00
hiyouga
0da1b7d9ab support RM metrics, add generating Args
Former-commit-id: c461c6190bc124e98dde7f3cf96a59ce40b26fb0
2023-06-12 15:48:48 +08:00
hoshi-hiyouga
43490e7f22 Merge pull request #26 from BUAADreamer/main
add code for reading from multi files in one directory

Former-commit-id: 87bb48eec34f749c55350d337b5ef9710e732151
2023-06-11 19:06:29 +08:00
BUAADreamer
463e0762c4 update json line file to .jsonl
Former-commit-id: 85e7676c3c1422795a047ffa8587bd4063ad7511
2023-06-11 18:59:19 +08:00
BUAADreamer
ac00fcd114 add some
Former-commit-id: 6982a53ed1f6f9fa03e99623b98fff56bf00317e
2023-06-11 18:55:53 +08:00
BUAADreamer
a976cba730 add code for reading from multi files in one directory
Former-commit-id: 9b80cf08b9f0d4aee896b228fb76399e9a7c9d8b
2023-06-10 16:27:30 +08:00
BUAADreamer
2012cb5cbc add code for reading from multi files in one directory
Former-commit-id: b7ebb83a96619e5111b0faa9da9d0feb8d9cdff0
2023-06-10 15:53:47 +08:00
hiyouga
6978c1625a tiny fix
Former-commit-id: c9c795f9c7cd2228410a12af4ec10d3b59be87db
2023-06-07 16:42:31 +08:00
hiyouga
4de7ae365e tiny fix
Former-commit-id: 267703f1db20e5b39c2e80a37e028d908af7ffb1
2023-06-07 16:02:07 +08:00
hiyouga
0092e863c1 tiny fix
Former-commit-id: 4a9bc72d90b65db80b375cd141484abfbb0dcf0d
2023-06-07 12:58:14 +08:00
hiyouga
99964c1013 add templates
Former-commit-id: 1d0686b2cb9edd4a7d320d11e65b50aab0ebd038
2023-06-07 12:40:44 +08:00
hiyouga
5e96a45bec add belle template
Former-commit-id: c489c8ecbaaa511ddc7dc1de685981531eedd38c
2023-06-07 12:30:11 +08:00
hiyouga
f2dda11101 tiny fix
Former-commit-id: 7115bee4310888ec2e5f104e8d2c1f7127fb6ce6
2023-06-07 12:08:39 +08:00
hiyouga
8875f565ad add prompt template class
Former-commit-id: 3d7e3a38d00aa5d9664824093043951af8c3f707
2023-06-07 11:55:25 +08:00
hiyouga
701a1d84c6 fix inference, add prompt template
Former-commit-id: 3940e50c71472b210bbc1b01248bf85a191c4065
2023-06-07 10:52:35 +08:00
hiyouga
0b903fed18 recover logging
Former-commit-id: d74014496e4ccda2de4482075a91747854facddd
2023-06-06 21:36:37 +08:00
hiyouga
5e5db11833 support distributed quantized training
Former-commit-id: 74ff23a4f36f859f791f7b4be6f1877edc68f12f
2023-06-06 17:39:41 +08:00
hiyouga
5bedf2b21a add API demo from #1
Former-commit-id: c955edcef168da44257c5b50d7bc59266d909782
2023-06-05 21:32:18 +08:00
hoshi-hiyouga
a441a411cb Merge pull request #11 from hiyouga/api
Api

Former-commit-id: 9b2f524ea7f3a28f7413b8ce67e585f1596566a5
2023-06-05 20:58:02 +08:00
hiyouga
1df5013e9f fix bug in web demo
Former-commit-id: 01d6d7a910b9845a0ea38632661ce813e5cfe3a2
2023-06-05 17:58:29 +08:00
hiyouga
ae649012de increase max length in cli demo
Former-commit-id: 0113cdb12728419022b5c01c932a5d52e626a200
2023-06-05 16:49:14 +08:00
hiyouga
9b35130c06 implement stream generating
Former-commit-id: 6cc9535975d823ffef7e1686749b69b40347a8ec
2023-06-05 16:43:44 +08:00
hiyouga
2770a2ee58 tiny fix
Former-commit-id: 3c5da617cdab34c6cae038e3a06d0468ae4c6c86
2023-06-05 15:25:22 +08:00
hiyouga
a96cfbee03 tiny fix
Former-commit-id: 5ce3e0056948aded120b63e365a892f9d8c3c840
2023-06-04 16:35:50 +08:00
hiyouga
363e0da084 tiny fix
Former-commit-id: a98ebf62fb82ffe5aaaea6a1ce3d4c60d23a5728
2023-06-04 12:55:40 +08:00
hiyouga
de447e7aeb support QLoRA
Former-commit-id: d89597e28fe9b91246e58c55eeb9082436940481
2023-06-04 00:08:56 +08:00
hiyouga
d2e80fff76 fix int8 inference
Former-commit-id: d05202943e9634526f96d189288f67852d3d1c40
2023-06-03 23:22:05 +08:00
hiyouga
eaf536378c reduce repetition penalty
Former-commit-id: 0381d93fc10ea5346724fe6295caa565a8eb4f61
2023-06-03 21:57:39 +08:00
hiyouga
4e224fac7c fix int8 inference
Former-commit-id: fcf3506bef28504dd679c2210bdc84e5868e05fe
2023-06-03 21:17:47 +08:00
hiyouga
315e2bea67 add ziya prompt template
Former-commit-id: 321e44ac54a91260cf00a4caa1991708814473fc
2023-06-03 19:05:51 +08:00
hiyouga
5389fdacd4 use low_cpu_mem_usage to speed up loading
Former-commit-id: 7891e4c200566a4a47088e93efd1fbebcb46528e
2023-06-03 18:19:01 +08:00
hiyouga
cba25893d3 add logits processor
Former-commit-id: f6f4b1554ae1e8849b437d705ffa34ce7ebd56bb
2023-06-03 16:34:54 +08:00
hiyouga
92acd4f1a5 remove unused code
Former-commit-id: f5c784dd75e152edb162ea02bf601d279f9de893
2023-06-03 00:10:54 +08:00
hiyouga
bc42983d75 add wechat
Former-commit-id: 0251b1788081a81773d87ca5ca2bac8428489852
2023-06-02 21:47:10 +08:00
hiyouga
1e6a013413 tiny fix
Former-commit-id: d479aaf07a0997737da90c00558c42b32109928f
2023-06-02 19:02:25 +08:00
hiyouga
dc7e3f75c9 fix layer norm name in PPO
Former-commit-id: 3ffc2efb997b717b3efad92a507584276e4bdfa1
2023-06-02 17:30:01 +08:00
hiyouga
b5b6767f4b fix #1
Former-commit-id: b8e556e4632a79251a2225075c570337eeafa559
2023-06-02 14:25:00 +08:00
hiyouga
587d0f5311 alter rewards data type
Former-commit-id: 3eb7eb2d37525da50fe401ab7c59532e6e1ef984
2023-06-02 14:19:51 +08:00
hiyouga
7ef5821cba fix possibly OOM error
Former-commit-id: 0d590dffb41b0e832d9f87d20a23bcd0acd983aa
2023-06-01 23:54:44 +08:00
hiyouga
4edd01fc08 fix bug at inference
Former-commit-id: df9b41af4401006b8040eb53c44dd290b604e0eb
2023-05-31 18:11:53 +08:00
hiyouga
3176b86454 update readme
Former-commit-id: 4054e85c664c541f435619baebcd687f80445d4a
2023-05-31 16:57:43 +08:00
hiyouga
a74fcc4149 support BLOOM models
Former-commit-id: 1314b6ea39a01aa8ac325e1d875ac013d43aec45
2023-05-31 16:54:06 +08:00
hoshi-hiyouga
c8a99f2901 Merge pull request #1 from mMrBun/main
Support conversation via API.

Former-commit-id: 5e64c7446718845444a14c5643c1bc5819562d60
2023-05-30 16:34:00 +08:00
hiyouga
f8d03f3aa9 remove dummy code
Former-commit-id: e6bc89d280945bbf48281107145c40a41d7cbd56
2023-05-30 16:28:00 +08:00
mMrBun
35464ed93d Support conversation via API.
Former-commit-id: 57cfe9128ce1781853555b22b502ec85b8b01941
2023-05-30 15:00:28 +08:00
mMrBun
1d8fa08f4b Support conversation via API.
Former-commit-id: 4d9d0ea083c15fb470ecbb428cb79b6dd48e3e92
2023-05-30 14:46:22 +08:00
hiyouga
a02df1f2f8 update readme
Former-commit-id: 64f7fa45a4dbd173a3e5eb66d17044aa1243cc4b
2023-05-29 21:54:01 +08:00
hiyouga
0fa508b904 update readme
Former-commit-id: eed2773df8081b229d13b1d679bf2913715f23ac
2023-05-29 21:53:02 +08:00
hiyouga
bb6f731461 add pre-training script
Former-commit-id: 935d58de2b3a2eadc4f0bed28c3ad7dee32e9fd5
2023-05-29 21:37:22 +08:00