hiyouga
|
0092e863c1
|
tiny fix
Former-commit-id: 4a9bc72d90b65db80b375cd141484abfbb0dcf0d
|
2023-06-07 12:58:14 +08:00 |
|
hiyouga
|
f2dda11101
|
tiny fix
Former-commit-id: 7115bee4310888ec2e5f104e8d2c1f7127fb6ce6
|
2023-06-07 12:08:39 +08:00 |
|
hiyouga
|
8875f565ad
|
add prompt template class
Former-commit-id: 3d7e3a38d00aa5d9664824093043951af8c3f707
|
2023-06-07 11:55:25 +08:00 |
|
hiyouga
|
701a1d84c6
|
fix inference, add prompt template
Former-commit-id: 3940e50c71472b210bbc1b01248bf85a191c4065
|
2023-06-07 10:52:35 +08:00 |
|
hiyouga
|
0b903fed18
|
recover logging
Former-commit-id: d74014496e4ccda2de4482075a91747854facddd
|
2023-06-06 21:36:37 +08:00 |
|
hiyouga
|
5e5db11833
|
support distributed quantized training
Former-commit-id: 74ff23a4f36f859f791f7b4be6f1877edc68f12f
|
2023-06-06 17:39:41 +08:00 |
|
hiyouga
|
2770a2ee58
|
tiny fix
Former-commit-id: 3c5da617cdab34c6cae038e3a06d0468ae4c6c86
|
2023-06-05 15:25:22 +08:00 |
|
hiyouga
|
a96cfbee03
|
tiny fix
Former-commit-id: 5ce3e0056948aded120b63e365a892f9d8c3c840
|
2023-06-04 16:35:50 +08:00 |
|
hiyouga
|
363e0da084
|
tiny fix
Former-commit-id: a98ebf62fb82ffe5aaaea6a1ce3d4c60d23a5728
|
2023-06-04 12:55:40 +08:00 |
|
hiyouga
|
de447e7aeb
|
support QLoRA
Former-commit-id: d89597e28fe9b91246e58c55eeb9082436940481
|
2023-06-04 00:08:56 +08:00 |
|
hiyouga
|
d2e80fff76
|
fix int8 inference
Former-commit-id: d05202943e9634526f96d189288f67852d3d1c40
|
2023-06-03 23:22:05 +08:00 |
|
hiyouga
|
4e224fac7c
|
fix int8 inference
Former-commit-id: fcf3506bef28504dd679c2210bdc84e5868e05fe
|
2023-06-03 21:17:47 +08:00 |
|
hiyouga
|
315e2bea67
|
add ziya prompt template
Former-commit-id: 321e44ac54a91260cf00a4caa1991708814473fc
|
2023-06-03 19:05:51 +08:00 |
|
hiyouga
|
5389fdacd4
|
use low_cpu_mem_usage to speed up loading
Former-commit-id: 7891e4c200566a4a47088e93efd1fbebcb46528e
|
2023-06-03 18:19:01 +08:00 |
|
hiyouga
|
cba25893d3
|
add logits processor
Former-commit-id: f6f4b1554ae1e8849b437d705ffa34ce7ebd56bb
|
2023-06-03 16:34:54 +08:00 |
|
hiyouga
|
587d0f5311
|
alter rewards data type
Former-commit-id: 3eb7eb2d37525da50fe401ab7c59532e6e1ef984
|
2023-06-02 14:19:51 +08:00 |
|
hiyouga
|
7ef5821cba
|
fix possibly OOM error
Former-commit-id: 0d590dffb41b0e832d9f87d20a23bcd0acd983aa
|
2023-06-01 23:54:44 +08:00 |
|
hiyouga
|
a74fcc4149
|
support BLOOM models
Former-commit-id: 1314b6ea39a01aa8ac325e1d875ac013d43aec45
|
2023-05-31 16:54:06 +08:00 |
|
hiyouga
|
f8d03f3aa9
|
remove dummy code
Former-commit-id: e6bc89d280945bbf48281107145c40a41d7cbd56
|
2023-05-30 16:28:00 +08:00 |
|
hiyouga
|
bb6f731461
|
add pre-training script
Former-commit-id: 935d58de2b3a2eadc4f0bed28c3ad7dee32e9fd5
|
2023-05-29 21:37:22 +08:00 |
|
hiyouga
|
6f89f64c73
|
fix checkpoint loading
Former-commit-id: d31aa5c2c0bcb6a4ef4a62e21693548dd9acaae6
|
2023-05-29 17:43:16 +08:00 |
|
hiyouga
|
e158cd8b32
|
tiny fix
Former-commit-id: eae79707d31fd8be2cf4bee4d610557bbd49f6e7
|
2023-05-29 09:42:29 +08:00 |
|
hiyouga
|
a4384e442c
|
use fp16 model, add logcallback
Former-commit-id: bea275d51338b49ce855eec0178e759607265e3d
|
2023-05-28 21:30:28 +08:00 |
|
hiyouga
|
54574f1dfa
|
Initial commit
Former-commit-id: 5ca8e1d63727e7bcb8cab16542c763c47e48184a
|
2023-05-28 18:09:04 +08:00 |
|