hiyouga
|
c4f1d98a1c
|
fix saving custom code
Former-commit-id: 1e1358431dde1ed774b0e1e48760ca9f0db685ef
|
2023-07-16 18:04:41 +08:00 |
|
hiyouga
|
fd023ad416
|
add custom baichuan-13B code supports left-padding
Former-commit-id: 2c867b9bb1c50cc42acee326f49ceddca0492a55
v0.0.9
|
2023-07-15 22:37:17 +08:00 |
|
hiyouga
|
20e4bf3b5b
|
fix callback
Former-commit-id: 552d773dad0a8e6bcfdd4f4f587e1fe17edcd7b3
|
2023-07-15 22:01:43 +08:00 |
|
hiyouga
|
22e979c482
|
update stream_chat
Former-commit-id: 8528a84e7486351af6b53db91649fa5772a99d13
|
2023-07-15 19:51:02 +08:00 |
|
hiyouga
|
b8b38a9ade
|
create chat model
Former-commit-id: 657cf0f55a7f0886bc837bdd44528971dc5e5caa
|
2023-07-15 19:26:20 +08:00 |
|
hiyouga
|
8ba0996a53
|
Update callbacks.py
Former-commit-id: d640c5545f6b2fcf7840df2b35a80f49a2de4ba5
|
2023-07-15 17:39:16 +08:00 |
|
hiyouga
|
7c0f2ab0a4
|
Update README.md
Former-commit-id: 1e2b7e0c4bc2cbd8d990cef9a85ccd16b971fb84
|
2023-07-15 17:20:39 +08:00 |
|
hiyouga
|
70b5232f9a
|
fix callback
Former-commit-id: 22d9a9c2af6674eb832ae4aee80d679f19b7006f
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
a696148d6b
|
modity code structure
Former-commit-id: f75137661358f9070bc70c341dfa2cc5fd69cf94
|
2023-07-15 16:54:28 +08:00 |
|
hiyouga
|
09b52a3078
|
Update wechat.jpg
Former-commit-id: 2a0f1f8398bd771982d41eb53dd38a0dc8087f05
|
2023-07-14 17:29:43 +08:00 |
|
hiyouga
|
7ea072536b
|
fix eval and pred loss
Former-commit-id: c30db9f1f0db5a6a660cdc60016755241762aae7
|
2023-07-14 13:11:57 +08:00 |
|
hiyouga
|
32ec440d2e
|
fix pretrain
Former-commit-id: a04115ec278e646347a6fec12f562494abd66e51
|
2023-07-13 23:41:54 +08:00 |
|
hiyouga
|
2b0fced03b
|
fix Baichuan-13B
Former-commit-id: 08439d29b2031ffbe77fe581c148e6d94e68bfc4
|
2023-07-13 23:08:45 +08:00 |
|
hoshi-hiyouga
|
cecdf886cb
|
Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset
Former-commit-id: 8cd76ef3c3ac4b92d2ea9b605251a3abbaff90aa
|
2023-07-12 20:11:19 +08:00 |
|
zxbsmk
|
3b15aacf02
|
Support for WebNovel dataset
Former-commit-id: 4955dc9eed33a904e4e2b9d5985b3fda87c3674a
|
2023-07-12 17:29:47 +08:00 |
|
hoshi-hiyouga
|
9c319f3d49
|
Merge pull request #145 from elicassion/patch-1
Fix typo in common.py
Former-commit-id: 894f13e41f31ff49704ee2f633899f9c88035a9c
|
2023-07-12 13:50:39 +08:00 |
|
Jinghuan Shang
|
d42d175670
|
Fix typo in common.py
lastest -> latest
Former-commit-id: dc1e8b7181ba86f028dc22bea9d3c419814c1e19
|
2023-07-11 18:03:53 -04:00 |
|
hiyouga
|
925a026853
|
fix sft encode
Former-commit-id: b2f7cb446591e3722b5be8d250ddfe0caa226384
|
2023-07-11 19:50:33 +08:00 |
|
hiyouga
|
685ba1e02e
|
add baichuan template
Former-commit-id: 1af031c02b3871341cc4d719b5d62cfd02bc126a
|
2023-07-11 18:57:50 +08:00 |
|
hiyouga
|
507726b2f0
|
support Baichuan-13B
Former-commit-id: f936a7af0be98cd886ce9a0611a744033818e20f
|
2023-07-11 16:16:14 +08:00 |
|
hiyouga
|
804cc07db0
|
Update README.md
Former-commit-id: 8447206bbc2ca579e1fc57d6bda76180c7f89c86
|
2023-07-10 23:09:11 +08:00 |
|
hiyouga
|
53b8e444b8
|
Update wechat.jpg
Former-commit-id: 061c324972ac8e20da1445a70332cdd0e347425e
|
2023-07-10 18:41:53 +08:00 |
|
hiyouga
|
e5c7681146
|
Update README.md
Former-commit-id: 4182c7aa8b08c80aac0f7a4fc03164c46d8f322c
|
2023-07-09 14:57:13 +08:00 |
|
hiyouga
|
c812429011
|
update api to match langchain
Former-commit-id: 84a06318d40fb595f3aa6d1141c107ef7710376c
|
2023-07-07 20:35:39 +08:00 |
|
hiyouga
|
28ec96bf19
|
Update README.md
Former-commit-id: 233f20864b69b6483b8fc460df036230c3cbed7d
|
2023-07-07 12:06:28 +08:00 |
|
hiyouga
|
8b4ef062b7
|
support InternLM
Former-commit-id: a2f507c56238d7fb2670edbab52d6b275f245e27
|
2023-07-07 11:02:28 +08:00 |
|
hiyouga
|
50a27509a3
|
fix rouge score
Former-commit-id: caa00d3ac24c989d56887af9d7ca79244e72714a
|
2023-07-06 14:28:34 +08:00 |
|
hiyouga
|
28c1381671
|
update readme
Former-commit-id: 89c623e4bfef760174f013c5de27810cc90e0f6e
|
2023-07-05 23:03:58 +08:00 |
|
hiyouga
|
22fa516bd5
|
fix streaming response in API
Former-commit-id: 4abd2485e10e7198b63a396ce5432ac89c9f2d98
|
2023-07-05 22:42:31 +08:00 |
|
hiyouga
|
bcd7493212
|
fix freeze tuning
Former-commit-id: e6603977f695b71f6a36640256f6653ff513d783
|
2023-07-05 21:18:28 +08:00 |
|
hiyouga
|
265dc1b6a0
|
fix bug in PPO stage
Former-commit-id: a2ba69183b8e72c09242317a34545ab966ea8991
|
2023-07-05 19:14:10 +08:00 |
|
hiyouga
|
7ba52f5b6e
|
fix compute dtype
Former-commit-id: 8e3540c62d748dcecbcb3a29d57abe665e7b31c8
|
2023-07-05 15:13:00 +08:00 |
|
hiyouga
|
c87910ada3
|
support falcon model #72
Former-commit-id: c136f362c1aa75d3374b151188ba4a55d9313a59
|
2023-07-05 15:00:06 +08:00 |
|
hiyouga
|
827ff46008
|
Update wechat.jpg
Former-commit-id: 966b5c70fc88936dd05f667975913577cfa7f2f2
|
2023-07-05 00:22:22 +08:00 |
|
hiyouga
|
9e8854535b
|
fix bleu score
Former-commit-id: cac87fd553500e6fc76430909a469e8b037a46ef
|
2023-07-05 00:11:21 +08:00 |
|
hiyouga
|
023f9367fc
|
set use_cache before saving model
Former-commit-id: 395ed1cf1b811a21fe61dd6b57040653ce98ba49
|
2023-07-04 23:18:20 +08:00 |
|
hiyouga
|
b9d56b2ac5
|
fix seq2seq predictions
Former-commit-id: 65e9ce2cdda3924170774f7b3c6e3fbdbdd87b7f
|
2023-07-04 22:56:51 +08:00 |
|
hoshi-hiyouga
|
ce622cd93e
|
Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user
Former-commit-id: cb26f7892384d7209b7388f0cb39cc173de84998
|
2023-07-03 19:51:46 +08:00 |
|
codemayq
|
7925ffd3cd
|
add the pre-built version of bitsandbytes library for windows user
Former-commit-id: d3b30ecde31ef36c9cb1326666996aee838a55d2
|
2023-07-03 13:58:10 +08:00 |
|
hiyouga
|
8a1cd612bc
|
Update auto_gptq.py
Former-commit-id: 0db9d2911192194878ef4665b2471a5752b64c65
|
2023-07-02 20:56:11 +08:00 |
|
hiyouga
|
202e8f1e02
|
add autogptq
Former-commit-id: cf6d57fd3ee213276e805813a987c4d40395a38c
|
2023-07-02 20:36:37 +08:00 |
|
hiyouga
|
7a0a6ec28d
|
Update wechat.jpg
Former-commit-id: b8e1f09a2e41cf103f7e3d780709a5d9c93caec5
|
2023-06-30 15:45:20 +08:00 |
|
hiyouga
|
4ec1bff116
|
fix typo
Former-commit-id: 92fa515e977889bf1003f174785fe600db14f70e
|
2023-06-30 10:09:59 +08:00 |
|
hiyouga
|
b064d205c6
|
Update README.md
Former-commit-id: 021b035c1e9b1627c26e80229c841f28edc6ab51
|
2023-06-29 19:36:22 +08:00 |
|
hiyouga
|
133399e905
|
rename evaluate.py
Former-commit-id: f14bd729a850795ac45247e728c8b517509674e7
|
2023-06-29 15:40:39 +08:00 |
|
hiyouga
|
36c409697e
|
Update evaluate.py
Former-commit-id: 23a7266272a79310d5b20c2f9653c5cd16f40a53
|
2023-06-29 15:40:03 +08:00 |
|
hiyouga
|
457b53ec6c
|
Update README.md
Former-commit-id: 70592035b81f4f6f682986fa550bf2f08ac5ec77
|
2023-06-29 15:37:19 +08:00 |
|
hiyouga
|
92070c3d7a
|
add open assistant dataset
Former-commit-id: 3154fec979aba48f54b7afde3740c4990d445a41
|
2023-06-28 23:09:33 +08:00 |
|
hiyouga
|
1ce7b5e0f3
|
update loading logic
Former-commit-id: 4d0fddba213beaa55146b047a78963d1d18185a1
|
2023-06-28 12:07:16 +08:00 |
|
hiyouga
|
2ff577810a
|
fix loading best model
Former-commit-id: 0a46313ccaee91b51bec9f9f92e3111a4a04ce2e
|
2023-06-28 01:55:12 +08:00 |
|