hiyouga
|
653ce9397e
|
fix freeze layers
Former-commit-id: a6c4b141cd
|
2023-06-16 17:38:21 +08:00 |
|
hiyouga
|
36ea46e85c
|
add source prefix
Former-commit-id: fc4d8155b3
|
2023-06-16 16:32:17 +08:00 |
|
hiyouga
|
c6d56e7109
|
support loading lora from hub
Former-commit-id: 0574b590ef
|
2023-06-16 00:02:17 +08:00 |
|
hiyouga
|
a68808d6d9
|
support baichuan model
Former-commit-id: 0cee6ad67f
|
2023-06-15 16:02:01 +08:00 |
|
hiyouga
|
50494db8d6
|
fix bug in template vanilla
Former-commit-id: c527399424
|
2023-06-15 14:36:55 +08:00 |
|
hiyouga
|
dd1e7ed3cf
|
add BOS token in pre-training
Former-commit-id: d668f8b501
|
2023-06-15 01:46:17 +08:00 |
|
hiyouga
|
3419396945
|
support multiturn training like FastChat
Former-commit-id: b6faf0207d
|
2023-06-14 22:27:39 +08:00 |
|
hiyouga
|
ca90a1e6d9
|
fix loading valuehead
Former-commit-id: 875e8e2349
|
2023-06-13 11:13:06 +08:00 |
|
hiyouga
|
c92bfb158f
|
fix generating args
Former-commit-id: 531a3764d9
|
2023-06-13 01:33:56 +08:00 |
|
hiyouga
|
1fbda5d139
|
support RM metrics, add generating Args
Former-commit-id: cec6524d6b
|
2023-06-12 15:48:48 +08:00 |
|
BUAADreamer
|
b1c6ee9cf5
|
add code for reading from multi files in one directory
Former-commit-id: a2af9df5a9
|
2023-06-10 16:27:30 +08:00 |
|
BUAADreamer
|
53727aee3e
|
add code for reading from multi files in one directory
Former-commit-id: 3dd5f9a874
|
2023-06-10 15:53:47 +08:00 |
|
hiyouga
|
587d7a907f
|
tiny fix
Former-commit-id: 2ba5d69c7f
|
2023-06-07 16:42:31 +08:00 |
|
hiyouga
|
fb9dedcb36
|
tiny fix
Former-commit-id: 16c2860d56
|
2023-06-07 16:02:07 +08:00 |
|
hiyouga
|
37c6234126
|
tiny fix
Former-commit-id: edafb97733
|
2023-06-07 12:58:14 +08:00 |
|
hiyouga
|
c43d3f2460
|
add templates
Former-commit-id: 3875b19a34
|
2023-06-07 12:40:44 +08:00 |
|
hiyouga
|
6e5414bf1d
|
add belle template
Former-commit-id: 17acf3a3eb
|
2023-06-07 12:30:11 +08:00 |
|
hiyouga
|
93f2e35035
|
tiny fix
Former-commit-id: ce43386080
|
2023-06-07 12:08:39 +08:00 |
|
hiyouga
|
ce08b4a7ec
|
add prompt template class
Former-commit-id: 909af8f496
|
2023-06-07 11:55:25 +08:00 |
|
hiyouga
|
4cd43f018b
|
fix inference, add prompt template
Former-commit-id: 5d021d4ad5
|
2023-06-07 10:52:35 +08:00 |
|
hiyouga
|
8849bee763
|
recover logging
Former-commit-id: 13d1f0709c
|
2023-06-06 21:36:37 +08:00 |
|
hiyouga
|
794c2fd506
|
support distributed quantized training
Former-commit-id: 4eb17bcf6c
|
2023-06-06 17:39:41 +08:00 |
|
hiyouga
|
1a49937351
|
add API demo from #1
Former-commit-id: 3d8d5ee5d5
|
2023-06-05 21:32:18 +08:00 |
|
hoshi-hiyouga
|
2c34b7f858
|
Merge pull request #11 from hiyouga/api
Api
Former-commit-id: 06e1b120e1
|
2023-06-05 20:58:02 +08:00 |
|
hiyouga
|
dc4d9e514e
|
fix bug in web demo
Former-commit-id: a38d57ddd7
|
2023-06-05 17:58:29 +08:00 |
|
hiyouga
|
666fe30708
|
increase max length in cli demo
Former-commit-id: 56eb99106a
|
2023-06-05 16:49:14 +08:00 |
|
hiyouga
|
e92ac44cd2
|
implement stream generating
Former-commit-id: fe1d930816
|
2023-06-05 16:43:44 +08:00 |
|
hiyouga
|
e94cf814ff
|
tiny fix
Former-commit-id: 44298c1235
|
2023-06-05 15:25:22 +08:00 |
|
hiyouga
|
b982a9df83
|
tiny fix
Former-commit-id: 38b83533a4
|
2023-06-04 16:35:50 +08:00 |
|
hiyouga
|
95a6f1759b
|
tiny fix
Former-commit-id: eac9921e5c
|
2023-06-04 12:55:40 +08:00 |
|
hiyouga
|
4d4636c48e
|
support QLoRA
Former-commit-id: 3b9eee8cd2
|
2023-06-04 00:08:56 +08:00 |
|
hiyouga
|
08d6079140
|
fix int8 inference
Former-commit-id: 1bd13d7ca1
|
2023-06-03 23:22:05 +08:00 |
|
hiyouga
|
3bf4b20d0b
|
reduce repetition penalty
Former-commit-id: 926291940d
|
2023-06-03 21:57:39 +08:00 |
|
hiyouga
|
4200d5d558
|
fix int8 inference
Former-commit-id: 0f69a0c19e
|
2023-06-03 21:17:47 +08:00 |
|
hiyouga
|
342ee89d28
|
add ziya prompt template
Former-commit-id: de09ee1315
|
2023-06-03 19:05:51 +08:00 |
|
hiyouga
|
c7d71dd8af
|
use low_cpu_mem_usage to speed up loading
Former-commit-id: 771f454ff1
|
2023-06-03 18:19:01 +08:00 |
|
hiyouga
|
42c9c8de39
|
add logits processor
Former-commit-id: dca27b4412
|
2023-06-03 16:34:54 +08:00 |
|
hiyouga
|
1392242958
|
remove unused code
Former-commit-id: ed6161fa6a
|
2023-06-03 00:10:54 +08:00 |
|
hiyouga
|
36790c4e32
|
tiny fix
Former-commit-id: b8a034807e
|
2023-06-02 19:02:25 +08:00 |
|
hiyouga
|
6ab22a0181
|
fix layer norm name in PPO
Former-commit-id: e3aaef7d4a
|
2023-06-02 17:30:01 +08:00 |
|
hiyouga
|
b0e9a673be
|
fix #1
Former-commit-id: bd565af370
|
2023-06-02 14:25:00 +08:00 |
|
hiyouga
|
4003ddcc3b
|
alter rewards data type
Former-commit-id: 50d9a20f81
|
2023-06-02 14:19:51 +08:00 |
|
hiyouga
|
7e1be4c21a
|
fix possibly OOM error
Former-commit-id: e6126244c1
|
2023-06-01 23:54:44 +08:00 |
|
hiyouga
|
3f3b475412
|
fix bug at inference
Former-commit-id: fd709eacff
|
2023-05-31 18:11:53 +08:00 |
|
hiyouga
|
3bfb086399
|
support BLOOM models
Former-commit-id: 740a5daf56
|
2023-05-31 16:54:06 +08:00 |
|
hoshi-hiyouga
|
1a5eacc98a
|
Merge pull request #1 from mMrBun/main
Support conversation via API.
Former-commit-id: c36620ece4
|
2023-05-30 16:34:00 +08:00 |
|
hiyouga
|
ddb456bbcb
|
remove dummy code
Former-commit-id: a72492e649
|
2023-05-30 16:28:00 +08:00 |
|
mMrBun
|
bc2e530e16
|
Support conversation via API.
Former-commit-id: 748b804bac
|
2023-05-30 15:00:28 +08:00 |
|
mMrBun
|
21ef968922
|
Support conversation via API.
Former-commit-id: e821682430
|
2023-05-30 14:46:22 +08:00 |
|
hiyouga
|
4c7c96e656
|
add pre-training script
Former-commit-id: 8ff96509fa
|
2023-05-29 21:37:22 +08:00 |
|