hiyouga
|
a31a609377
|
fix callback
Former-commit-id: 065680cd2a410d7ceab10a4a76588df43e286117
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
6261fb362a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|
hiyouga
|
fa06b168ab
|
fix eval and pred loss
Former-commit-id: 2a5a8e0eba279de603c2d25e894b6d2921aaae55
|
2023-07-14 13:11:57 +08:00 |
|
hiyouga
|
961e6a9ba4
|
fix pretrain
Former-commit-id: 75b584875fc75c437491a5155ef310248e9b9dd4
|
2023-07-13 23:41:54 +08:00 |
|
hiyouga
|
316a02696f
|
fix Baichuan-13B
Former-commit-id: 6d9d826b3246349454c68f4d13b862da4de986e2
|
2023-07-13 23:08:45 +08:00 |
|
hoshi-hiyouga
|
d57e0a7006
|
Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset
Former-commit-id: f85ec349832705245d98ca869b06b1beca7192b7
|
2023-07-12 20:11:19 +08:00 |
|
zxbsmk
|
994b21b092
|
Support for WebNovel dataset
Former-commit-id: 655162f530784bc9374962c02d8b414872f83b2f
|
2023-07-12 17:29:47 +08:00 |
|
hoshi-hiyouga
|
6ef45d311a
|
Merge pull request #145 from elicassion/patch-1
Fix typo in common.py
Former-commit-id: 3b1f31941027d0b63c48cb8f68865a5993f81c74
|
2023-07-12 13:50:39 +08:00 |
|
Jinghuan Shang
|
30b2092294
|
Fix typo in common.py
lastest -> latest
Former-commit-id: a0a82a56765ffbdc2aeaa8174835494459ac914a
|
2023-07-11 18:03:53 -04:00 |
|
hiyouga
|
8de7a01887
|
fix sft encode
Former-commit-id: 2369a96a3200593421ae9afb06e08e2ac8010bb2
|
2023-07-11 19:50:33 +08:00 |
|
hiyouga
|
cc290a41e6
|
add baichuan template
Former-commit-id: 22273b3f7ce3b4c13e9e3f3677181d1a6a0f9c84
|
2023-07-11 18:57:50 +08:00 |
|
hiyouga
|
1aa0997391
|
support Baichuan-13B
Former-commit-id: f3edfe7d42d5513fb4177be61ec4f88f1edffb1e
|
2023-07-11 16:16:14 +08:00 |
|
hiyouga
|
61988225a8
|
Update README.md
Former-commit-id: 45a392d3d662f99c7cfed6d3bc6420cc98ed60c3
|
2023-07-10 23:09:11 +08:00 |
|
hiyouga
|
62e775cb75
|
Update README.md
Former-commit-id: 6b5eda6e39c1ff8df48418ca0fc734279e1e5abe
|
2023-07-09 14:57:13 +08:00 |
|
hiyouga
|
bc436066c8
|
update api to match langchain
Former-commit-id: 0016cf6525b151a6e7262967399d67033569b7eb
|
2023-07-07 20:35:39 +08:00 |
|
hiyouga
|
9da7840005
|
Update README.md
Former-commit-id: 3c2b547dab03f3675ec7224f9f558392dc81cac4
|
2023-07-07 12:06:28 +08:00 |
|
hiyouga
|
113cdaf1cb
|
support InternLM
Former-commit-id: a454ef7d57d9c06302d51464cfe39f6d0c48c5a8
|
2023-07-07 11:02:28 +08:00 |
|
hiyouga
|
601b1747d1
|
fix rouge score
Former-commit-id: b84b03a2a5ca72ce0ba71d9a9c3db1a687283fa6
|
2023-07-06 14:28:34 +08:00 |
|
hiyouga
|
e3b779fcb2
|
update readme
Former-commit-id: 38b555c1819bb1fed1c17608ed6110d9572db98c
|
2023-07-05 23:03:58 +08:00 |
|
hiyouga
|
982e76978b
|
fix streaming response in API
Former-commit-id: 72a17ae3b4fac2dc93b04a816f16f863120bc71b
|
2023-07-05 22:42:31 +08:00 |
|
hiyouga
|
d659907f34
|
fix freeze tuning
Former-commit-id: e32a1db967da02f502559df59ec6d1ab4554febf
|
2023-07-05 21:18:28 +08:00 |
|
hiyouga
|
df71d98b37
|
fix bug in PPO stage
Former-commit-id: a27373128d10b4e483d306100d91a55c0b796488
|
2023-07-05 19:14:10 +08:00 |
|
hiyouga
|
4de9ef568a
|
fix compute dtype
Former-commit-id: 5aadbb22730d19570b039462c91df443dbb34b5f
|
2023-07-05 15:13:00 +08:00 |
|
hiyouga
|
f1de82f08e
|
support falcon model #72
Former-commit-id: 72cc3ff0e6de641073de1159196319705f8efe85
|
2023-07-05 15:00:06 +08:00 |
|
hiyouga
|
4b093996a7
|
fix bleu score
Former-commit-id: 6874dce4444e6e6ce9d6125275dbf3dfdfb4fb22
|
2023-07-05 00:11:21 +08:00 |
|
hiyouga
|
e4e36a2d74
|
set use_cache before saving model
Former-commit-id: a6a0161f32f600f3001188ff4c7929c5f13c2a03
|
2023-07-04 23:18:20 +08:00 |
|
hiyouga
|
6df5c4ccef
|
fix seq2seq predictions
Former-commit-id: 045316d62f713311bdabdfb56be442238e03a007
|
2023-07-04 22:56:51 +08:00 |
|
hoshi-hiyouga
|
ecae079e56
|
Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user
Former-commit-id: f1f0e2277469976f8666370645a83b2665a267ec
|
2023-07-03 19:51:46 +08:00 |
|
codemayq
|
77a2b60bc6
|
add the pre-built version of bitsandbytes library for windows user
Former-commit-id: 1cb8429647972147634433e81e3d8df5bded936a
|
2023-07-03 13:58:10 +08:00 |
|
hiyouga
|
6a83f4f793
|
Update auto_gptq.py
Former-commit-id: 84ebfd87692031a6736b86dd8a35f635f14c1c97
|
2023-07-02 20:56:11 +08:00 |
|
hiyouga
|
2537481c34
|
add autogptq
Former-commit-id: 43321557c272862d9c6531fc48a4569cfc88e4e7
|
2023-07-02 20:36:37 +08:00 |
|
hiyouga
|
d720f67e6c
|
fix typo
Former-commit-id: 23c0d36abe881d9be839d5c647841bdef178307b
|
2023-06-30 10:09:59 +08:00 |
|
hiyouga
|
40ab36456c
|
Update README.md
Former-commit-id: f3891213f50ef3da84d4a06ce71ec0806a7a4866
|
2023-06-29 19:36:22 +08:00 |
|
hiyouga
|
52652257c9
|
rename evaluate.py
Former-commit-id: 29d3e62b11964914c636f27beb422b10752cd3ae
|
2023-06-29 15:40:39 +08:00 |
|
hiyouga
|
0dd38a41b6
|
Update evaluate.py
Former-commit-id: 54a48d30b768b0421e760e6eab41fdb12e78995f
|
2023-06-29 15:40:03 +08:00 |
|
hiyouga
|
f39c71b02d
|
Update README.md
Former-commit-id: 5b3f485af23e0a3804106c1cadc90ba02a8675e6
|
2023-06-29 15:37:19 +08:00 |
|
hiyouga
|
90fa2dd935
|
add open assistant dataset
Former-commit-id: 1694cf3078d04a14bce96da04b9d8c52176b1044
|
2023-06-28 23:09:33 +08:00 |
|
hiyouga
|
6290955e84
|
update loading logic
Former-commit-id: f1da17bb0deeb39a29da4dc208951d1ad69bb8ba
|
2023-06-28 12:07:16 +08:00 |
|
hiyouga
|
6b6430489a
|
fix loading best model
Former-commit-id: cf7db6855d353a57344c78d6b56478ffb14ceff2
|
2023-06-28 01:55:12 +08:00 |
|
hiyouga
|
4ae8a20e1d
|
fix RM accuracy
Former-commit-id: 532a385ea60693fdf835e6bc8e240ff8d55ff3a7
|
2023-06-28 01:40:13 +08:00 |
|
hiyouga
|
eca15bf252
|
add star history
Former-commit-id: 08f0dbcac3b72165e8993398111cc2b84006fbe5
|
2023-06-27 23:56:29 +08:00 |
|
hiyouga
|
e19dcc13e3
|
tiny fix
Former-commit-id: 994f2a79831a1dea8425e3eff62f0bc8238b78d6
|
2023-06-27 23:54:24 +08:00 |
|
hiyouga
|
2d22961c7d
|
fix initializing data arguments
Former-commit-id: e6b83c8b87cb93358086121a6f9ccaba5dfa7497
|
2023-06-27 22:50:23 +08:00 |
|
hiyouga
|
640f774d30
|
support save full model, replace BOS token
Former-commit-id: 32e56c290802ba971c08f471b94a33daec85671a
|
2023-06-27 21:40:11 +08:00 |
|
hiyouga
|
33c2b063c6
|
fix decoding in seq2seq
Former-commit-id: 44227f651bf9a6a4741b3e0845cdb5f2ab58ea63
|
2023-06-27 19:33:08 +08:00 |
|
hiyouga
|
a7e53dcfef
|
Update evaluate.py
Former-commit-id: 40201845b707c8b888b17744622ad78b4fb08a09
|
2023-06-26 23:41:33 +08:00 |
|
hiyouga
|
fe7ca5cb63
|
Create evaluate.py
Former-commit-id: 45a6742f00d17e504b54468d8eefdeb2aa877a29
|
2023-06-26 23:30:18 +08:00 |
|
hoshi-hiyouga
|
0ff82b1304
|
Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090
Former-commit-id: c3592307df08f088cf5b71bbc072b58c6c3b491d
|
2023-06-26 20:14:40 +08:00 |
|
Jingsong-Yan
|
d2de3f9e41
|
Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述
Former-commit-id: acdada61a773857e27b8a930cb22eb075eff6fa4
|
2023-06-26 19:45:41 +08:00 |
|
hiyouga
|
d5260ea860
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning
Former-commit-id: 9d0391ad8ba9e794808154c6c5700db96c664f89
|
2023-06-26 18:07:09 +08:00 |
|