hiyouga
|
e4e36a2d74
|
set use_cache before saving model
Former-commit-id: a6a0161f32f600f3001188ff4c7929c5f13c2a03
|
2023-07-04 23:18:20 +08:00 |
|
hiyouga
|
6df5c4ccef
|
fix seq2seq predictions
Former-commit-id: 045316d62f713311bdabdfb56be442238e03a007
|
2023-07-04 22:56:51 +08:00 |
|
hiyouga
|
d720f67e6c
|
fix typo
Former-commit-id: 23c0d36abe881d9be839d5c647841bdef178307b
|
2023-06-30 10:09:59 +08:00 |
|
hiyouga
|
6290955e84
|
update loading logic
Former-commit-id: f1da17bb0deeb39a29da4dc208951d1ad69bb8ba
|
2023-06-28 12:07:16 +08:00 |
|
hiyouga
|
6b6430489a
|
fix loading best model
Former-commit-id: cf7db6855d353a57344c78d6b56478ffb14ceff2
|
2023-06-28 01:55:12 +08:00 |
|
hiyouga
|
4ae8a20e1d
|
fix RM accuracy
Former-commit-id: 532a385ea60693fdf835e6bc8e240ff8d55ff3a7
|
2023-06-28 01:40:13 +08:00 |
|
hiyouga
|
e19dcc13e3
|
tiny fix
Former-commit-id: 994f2a79831a1dea8425e3eff62f0bc8238b78d6
|
2023-06-27 23:54:24 +08:00 |
|
hiyouga
|
2d22961c7d
|
fix initializing data arguments
Former-commit-id: e6b83c8b87cb93358086121a6f9ccaba5dfa7497
|
2023-06-27 22:50:23 +08:00 |
|
hiyouga
|
640f774d30
|
support save full model, replace BOS token
Former-commit-id: 32e56c290802ba971c08f471b94a33daec85671a
|
2023-06-27 21:40:11 +08:00 |
|
hiyouga
|
33c2b063c6
|
fix decoding in seq2seq
Former-commit-id: 44227f651bf9a6a4741b3e0845cdb5f2ab58ea63
|
2023-06-27 19:33:08 +08:00 |
|
hiyouga
|
a8f580d753
|
fix generation in seq2seq.py
Former-commit-id: f847d196beb6d04e456d64665a10dc9316a869f2
|
2023-06-26 18:07:06 +08:00 |
|
hiyouga
|
3aa1ca66e0
|
support prefixes, loading multiple local files
Former-commit-id: 6672e09836ed0103693a381ece010377bd0ef4f8
|
2023-06-26 15:32:40 +08:00 |
|
hiyouga
|
83346e86af
|
update api
Former-commit-id: a90db46e336a657d5fcf480986bfc68c77ad416b
|
2023-06-26 13:39:57 +08:00 |
|
hiyouga
|
f9332bc329
|
update readme
Former-commit-id: 6b08adc8219caacefa8d7b5a618e33ccd6060eec
|
2023-06-23 00:17:05 +08:00 |
|
hiyouga
|
7daf6c8b8e
|
update API
Former-commit-id: b5c47b0bef022e90e42406e28b6282492419e3fb
|
2023-06-22 20:46:24 +08:00 |
|
hiyouga
|
391bf1c699
|
match api with OpenAI format
Former-commit-id: 9cbe2b98b024393817e86ff8e3ff1636776fa263
|
2023-06-22 20:27:00 +08:00 |
|
Bun
|
810d9e36ea
|
Compatible with OpenAI API.
Former-commit-id: d21d51377bf7834a019efc009f4543b14c438389
|
2023-06-21 14:45:04 +08:00 |
|
hiyouga
|
de2c418637
|
add default template
Former-commit-id: c64fb6b83fdbedd62073417213f0215207ff1311
|
2023-06-16 21:12:17 +08:00 |
|
hiyouga
|
ee22b80ad0
|
fix freeze layers
Former-commit-id: 8a16359c121d543aeea3650612df46fc1bad1428
|
2023-06-16 17:38:21 +08:00 |
|
hiyouga
|
de9da40b18
|
add source prefix
Former-commit-id: 4f0fe959fcd2dded56a95ff3ad620bd381ae17a6
|
2023-06-16 16:32:17 +08:00 |
|
hiyouga
|
3836aadacf
|
support loading lora from hub
Former-commit-id: 0b34c962bc3368dca62b18ad6c27a0293c3affa5
|
2023-06-16 00:02:17 +08:00 |
|
hiyouga
|
194c5d2bee
|
support baichuan model
Former-commit-id: d683042fbcb2ee43b9823262d0a65b64f4cb54cb
|
2023-06-15 16:02:01 +08:00 |
|
hiyouga
|
496846e819
|
fix bug in template vanilla
Former-commit-id: 9b51e44c95af116aec34e7b6495935420f7c6c27
|
2023-06-15 14:36:55 +08:00 |
|
hiyouga
|
c42562d7ae
|
add BOS token in pre-training
Former-commit-id: c57cf5d4a46c57c6f698e5cfd0fd59cce703094d
|
2023-06-15 01:46:17 +08:00 |
|
hiyouga
|
aa1bb8a9a2
|
support multiturn training like FastChat
Former-commit-id: 629cafb1a09924e82d7ea1f9fba318d3f5593196
|
2023-06-14 22:27:39 +08:00 |
|
hiyouga
|
6f655e3916
|
fix loading valuehead
Former-commit-id: 7872375d7a0c1d8826206631f6717a91ec49f1b3
|
2023-06-13 11:13:06 +08:00 |
|
hiyouga
|
6828f07d54
|
fix generating args
Former-commit-id: 52805a8441bd7b324bd89489de60f18f103c8e4c
|
2023-06-13 01:33:56 +08:00 |
|
hiyouga
|
4724ae3492
|
support RM metrics, add generating Args
Former-commit-id: c461c6190bc124e98dde7f3cf96a59ce40b26fb0
|
2023-06-12 15:48:48 +08:00 |
|
BUAADreamer
|
5b93ca6c39
|
add code for reading from multi files in one directory
Former-commit-id: 9b80cf08b9f0d4aee896b228fb76399e9a7c9d8b
|
2023-06-10 16:27:30 +08:00 |
|
BUAADreamer
|
ef6c5ae18a
|
add code for reading from multi files in one directory
Former-commit-id: b7ebb83a96619e5111b0faa9da9d0feb8d9cdff0
|
2023-06-10 15:53:47 +08:00 |
|
hiyouga
|
03c92c79ff
|
tiny fix
Former-commit-id: c9c795f9c7cd2228410a12af4ec10d3b59be87db
|
2023-06-07 16:42:31 +08:00 |
|
hiyouga
|
fc6091e118
|
tiny fix
Former-commit-id: 267703f1db20e5b39c2e80a37e028d908af7ffb1
|
2023-06-07 16:02:07 +08:00 |
|
hiyouga
|
025670b4f6
|
tiny fix
Former-commit-id: 4a9bc72d90b65db80b375cd141484abfbb0dcf0d
|
2023-06-07 12:58:14 +08:00 |
|
hiyouga
|
d6b32dd9ea
|
add templates
Former-commit-id: 1d0686b2cb9edd4a7d320d11e65b50aab0ebd038
|
2023-06-07 12:40:44 +08:00 |
|
hiyouga
|
f57dae4a1a
|
add belle template
Former-commit-id: c489c8ecbaaa511ddc7dc1de685981531eedd38c
|
2023-06-07 12:30:11 +08:00 |
|
hiyouga
|
5e2ec2d104
|
tiny fix
Former-commit-id: 7115bee4310888ec2e5f104e8d2c1f7127fb6ce6
|
2023-06-07 12:08:39 +08:00 |
|
hiyouga
|
b9feb82e4e
|
add prompt template class
Former-commit-id: 3d7e3a38d00aa5d9664824093043951af8c3f707
|
2023-06-07 11:55:25 +08:00 |
|
hiyouga
|
3da427a665
|
fix inference, add prompt template
Former-commit-id: 3940e50c71472b210bbc1b01248bf85a191c4065
|
2023-06-07 10:52:35 +08:00 |
|
hiyouga
|
12094c1db5
|
recover logging
Former-commit-id: d74014496e4ccda2de4482075a91747854facddd
|
2023-06-06 21:36:37 +08:00 |
|
hiyouga
|
bf5ad34196
|
support distributed quantized training
Former-commit-id: 74ff23a4f36f859f791f7b4be6f1877edc68f12f
|
2023-06-06 17:39:41 +08:00 |
|
hiyouga
|
ac6f50dedf
|
add API demo from #1
Former-commit-id: c955edcef168da44257c5b50d7bc59266d909782
|
2023-06-05 21:32:18 +08:00 |
|
hoshi-hiyouga
|
8fd9ef924d
|
Merge pull request #11 from hiyouga/api
Api
Former-commit-id: 9b2f524ea7f3a28f7413b8ce67e585f1596566a5
|
2023-06-05 20:58:02 +08:00 |
|
hiyouga
|
a409e1f42c
|
fix bug in web demo
Former-commit-id: 01d6d7a910b9845a0ea38632661ce813e5cfe3a2
|
2023-06-05 17:58:29 +08:00 |
|
hiyouga
|
3f5869111b
|
increase max length in cli demo
Former-commit-id: 0113cdb12728419022b5c01c932a5d52e626a200
|
2023-06-05 16:49:14 +08:00 |
|
hiyouga
|
f9c51a8340
|
implement stream generating
Former-commit-id: 6cc9535975d823ffef7e1686749b69b40347a8ec
|
2023-06-05 16:43:44 +08:00 |
|
hiyouga
|
a817801c0f
|
tiny fix
Former-commit-id: 3c5da617cdab34c6cae038e3a06d0468ae4c6c86
|
2023-06-05 15:25:22 +08:00 |
|
hiyouga
|
063a83ab4e
|
tiny fix
Former-commit-id: 5ce3e0056948aded120b63e365a892f9d8c3c840
|
2023-06-04 16:35:50 +08:00 |
|
hiyouga
|
eebe71699b
|
tiny fix
Former-commit-id: a98ebf62fb82ffe5aaaea6a1ce3d4c60d23a5728
|
2023-06-04 12:55:40 +08:00 |
|
hiyouga
|
5f44112cf5
|
support QLoRA
Former-commit-id: d89597e28fe9b91246e58c55eeb9082436940481
|
2023-06-04 00:08:56 +08:00 |
|
hiyouga
|
2308d5a179
|
fix int8 inference
Former-commit-id: d05202943e9634526f96d189288f67852d3d1c40
|
2023-06-03 23:22:05 +08:00 |
|