hiyouga
|
8849bee763
|
recover logging
Former-commit-id: 13d1f0709c774bb5ec5fb2b4e3c66b2f1226afd2
|
2023-06-06 21:36:37 +08:00 |
|
hiyouga
|
794c2fd506
|
support distributed quantized training
Former-commit-id: 4eb17bcf6c8ac51a3ec8cc5459064d1b35c82634
|
2023-06-06 17:39:41 +08:00 |
|
hiyouga
|
1a49937351
|
add API demo from #1
Former-commit-id: 3d8d5ee5d54102dd73856fac3a80922ea3104a06
|
2023-06-05 21:32:18 +08:00 |
|
hoshi-hiyouga
|
2c34b7f858
|
Merge pull request #11 from hiyouga/api
Api
Former-commit-id: 06e1b120e1a8a399fc1f8b435667bd4d5418d75f
|
2023-06-05 20:58:02 +08:00 |
|
hiyouga
|
dc4d9e514e
|
fix bug in web demo
Former-commit-id: a38d57ddd7fcbd2eb373e79f7236f8d2411c52d5
|
2023-06-05 17:58:29 +08:00 |
|
hiyouga
|
666fe30708
|
increase max length in cli demo
Former-commit-id: 56eb99106aa69eef8dab6b3518779db3373e639b
|
2023-06-05 16:49:14 +08:00 |
|
hiyouga
|
e92ac44cd2
|
implement stream generating
Former-commit-id: fe1d9308163699b7c4dd791915788855b2e6854f
|
2023-06-05 16:43:44 +08:00 |
|
hiyouga
|
e94cf814ff
|
tiny fix
Former-commit-id: 44298c12355082740857ba650bf44a18d4d3b40d
|
2023-06-05 15:25:22 +08:00 |
|
hiyouga
|
b982a9df83
|
tiny fix
Former-commit-id: 38b83533a48e2f5a5817b8fc53a37554c42fd932
|
2023-06-04 16:35:50 +08:00 |
|
hiyouga
|
95a6f1759b
|
tiny fix
Former-commit-id: eac9921e5cc7be2b686731f28b19983a07009128
|
2023-06-04 12:55:40 +08:00 |
|
hiyouga
|
4d4636c48e
|
support QLoRA
Former-commit-id: 3b9eee8cd26cfeef945155815175831dec98eb20
|
2023-06-04 00:08:56 +08:00 |
|
hiyouga
|
08d6079140
|
fix int8 inference
Former-commit-id: 1bd13d7ca197edaa9a1143b061249b4fa6003b97
|
2023-06-03 23:22:05 +08:00 |
|
hiyouga
|
3bf4b20d0b
|
reduce repetition penalty
Former-commit-id: 926291940de4b59a40489e6a509fdc0135c8616d
|
2023-06-03 21:57:39 +08:00 |
|
hiyouga
|
4200d5d558
|
fix int8 inference
Former-commit-id: 0f69a0c19ebe05ba6b2d66b56826b6df100e9f32
|
2023-06-03 21:17:47 +08:00 |
|
hiyouga
|
342ee89d28
|
add ziya prompt template
Former-commit-id: de09ee1315759a085e4fcf20e94963293c881aae
|
2023-06-03 19:05:51 +08:00 |
|
hiyouga
|
c7d71dd8af
|
use low_cpu_mem_usage to speed up loading
Former-commit-id: 771f454ff1deee4929927c58feab7dcd3b854f9c
|
2023-06-03 18:19:01 +08:00 |
|
hiyouga
|
42c9c8de39
|
add logits processor
Former-commit-id: dca27b4412e8e41cadcd623582222e1c216db78b
|
2023-06-03 16:34:54 +08:00 |
|
hiyouga
|
1392242958
|
remove unused code
Former-commit-id: ed6161fa6a5f23dfacc52a2a77ddaeeb4adf8443
|
2023-06-03 00:10:54 +08:00 |
|
hiyouga
|
0f6efcc77a
|
add wechat
Former-commit-id: 72a85ccc39e19a800cf638407fd3b53976a5af15
|
2023-06-02 21:47:10 +08:00 |
|
hiyouga
|
36790c4e32
|
tiny fix
Former-commit-id: b8a034807e97730bc95a5f0bf0ca0763d7c6824c
|
2023-06-02 19:02:25 +08:00 |
|
hiyouga
|
6ab22a0181
|
fix layer norm name in PPO
Former-commit-id: e3aaef7d4a37e4aa388a9158c382db8239843a5e
|
2023-06-02 17:30:01 +08:00 |
|
hiyouga
|
b0e9a673be
|
fix #1
Former-commit-id: bd565af3706d74712e50884797ee2e2056a49809
|
2023-06-02 14:25:00 +08:00 |
|
hiyouga
|
4003ddcc3b
|
alter rewards data type
Former-commit-id: 50d9a20f8103fcfb92a3e2a5e6f0055d27b29d53
|
2023-06-02 14:19:51 +08:00 |
|
hiyouga
|
7e1be4c21a
|
fix possibly OOM error
Former-commit-id: e6126244c161dc87b0d4d45b8976c02fc9933545
|
2023-06-01 23:54:44 +08:00 |
|
hiyouga
|
3f3b475412
|
fix bug at inference
Former-commit-id: fd709eacff09d52636541cba009a1ded6aac22dc
|
2023-05-31 18:11:53 +08:00 |
|
hiyouga
|
79011936c9
|
update readme
Former-commit-id: 38ca4292280df93c2037cf7075f5b0fe5e994a5f
|
2023-05-31 16:57:43 +08:00 |
|
hiyouga
|
3bfb086399
|
support BLOOM models
Former-commit-id: 740a5daf5634f70a61b41fa8a31ee4a587fa03f3
|
2023-05-31 16:54:06 +08:00 |
|
hoshi-hiyouga
|
1a5eacc98a
|
Merge pull request #1 from mMrBun/main
Support conversation via API.
Former-commit-id: c36620ece4b65301d38704a0482730c3e6a9445b
|
2023-05-30 16:34:00 +08:00 |
|
hiyouga
|
ddb456bbcb
|
remove dummy code
Former-commit-id: a72492e6490c44a7edccd572da73c47d6f278cc7
|
2023-05-30 16:28:00 +08:00 |
|
mMrBun
|
bc2e530e16
|
Support conversation via API.
Former-commit-id: 748b804bac8e2c972246bc7f7d50884fe105a7fe
|
2023-05-30 15:00:28 +08:00 |
|
mMrBun
|
21ef968922
|
Support conversation via API.
Former-commit-id: e82168243060ba77ff458a5b1e8479468b8e7265
|
2023-05-30 14:46:22 +08:00 |
|
hiyouga
|
6ba162ef6c
|
update readme
Former-commit-id: 6ccdfb400118774b78dbee5a7aee5969fb9f480c
|
2023-05-29 21:54:01 +08:00 |
|
hiyouga
|
e6a1469467
|
update readme
Former-commit-id: 7698f9aa9a4ea579f1eefaac637675043581e9c8
|
2023-05-29 21:53:02 +08:00 |
|
hiyouga
|
4c7c96e656
|
add pre-training script
Former-commit-id: 8ff96509fa621054368919988a15b50da1891852
|
2023-05-29 21:37:22 +08:00 |
|
hiyouga
|
0ab4419b86
|
fix checkpoint loading
Former-commit-id: c0e5df92d601966444956c65482441bd757fd7a1
|
2023-05-29 17:43:16 +08:00 |
|
hiyouga
|
03338163c2
|
tiny fix
Former-commit-id: ce71cc8b6db5d13b87b7d0302f4176c5c76ac4b2
|
2023-05-29 09:42:29 +08:00 |
|
hiyouga
|
b1517e7a0e
|
tiny fix
Former-commit-id: 166c837b95d42513f7b977b189822b5c7980606d
|
2023-05-28 21:48:33 +08:00 |
|
hiyouga
|
87ba09e035
|
use fp16 model, add logcallback
Former-commit-id: 0c9fda01e3c61727c939efd9d9398f657a2d69b6
|
2023-05-28 21:30:28 +08:00 |
|
hiyouga
|
54b8ce7b63
|
Initial commit
Former-commit-id: 769c6ab56be0c9d26e9289f61ac54a4068d935c1
|
2023-05-28 18:09:04 +08:00 |
|