2773 Commits

Author SHA1 Message Date
hiyouga
a31a609377 fix callback
Former-commit-id: 065680cd2a410d7ceab10a4a76588df43e286117
2023-07-15 17:18:16 +08:00
hiyouga
6261fb362a modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
2023-07-15 16:54:28 +08:00
hiyouga
fa06b168ab fix eval and pred loss
Former-commit-id: 2a5a8e0eba279de603c2d25e894b6d2921aaae55
2023-07-14 13:11:57 +08:00
hiyouga
961e6a9ba4 fix pretrain
Former-commit-id: 75b584875fc75c437491a5155ef310248e9b9dd4
2023-07-13 23:41:54 +08:00
hiyouga
316a02696f fix Baichuan-13B
Former-commit-id: 6d9d826b3246349454c68f4d13b862da4de986e2
2023-07-13 23:08:45 +08:00
hoshi-hiyouga
d57e0a7006 Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset

Former-commit-id: f85ec349832705245d98ca869b06b1beca7192b7
2023-07-12 20:11:19 +08:00
zxbsmk
994b21b092 Support for WebNovel dataset
Former-commit-id: 655162f530784bc9374962c02d8b414872f83b2f
2023-07-12 17:29:47 +08:00
hoshi-hiyouga
6ef45d311a Merge pull request #145 from elicassion/patch-1
Fix typo in common.py

Former-commit-id: 3b1f31941027d0b63c48cb8f68865a5993f81c74
2023-07-12 13:50:39 +08:00
Jinghuan Shang
30b2092294 Fix typo in common.py
lastest -> latest

Former-commit-id: a0a82a56765ffbdc2aeaa8174835494459ac914a
2023-07-11 18:03:53 -04:00
hiyouga
8de7a01887 fix sft encode
Former-commit-id: 2369a96a3200593421ae9afb06e08e2ac8010bb2
2023-07-11 19:50:33 +08:00
hiyouga
cc290a41e6 add baichuan template
Former-commit-id: 22273b3f7ce3b4c13e9e3f3677181d1a6a0f9c84
2023-07-11 18:57:50 +08:00
hiyouga
1aa0997391 support Baichuan-13B
Former-commit-id: f3edfe7d42d5513fb4177be61ec4f88f1edffb1e
2023-07-11 16:16:14 +08:00
hiyouga
61988225a8 Update README.md
Former-commit-id: 45a392d3d662f99c7cfed6d3bc6420cc98ed60c3
2023-07-10 23:09:11 +08:00
hiyouga
62e775cb75 Update README.md
Former-commit-id: 6b5eda6e39c1ff8df48418ca0fc734279e1e5abe
2023-07-09 14:57:13 +08:00
hiyouga
bc436066c8 update api to match langchain
Former-commit-id: 0016cf6525b151a6e7262967399d67033569b7eb
2023-07-07 20:35:39 +08:00
hiyouga
9da7840005 Update README.md
Former-commit-id: 3c2b547dab03f3675ec7224f9f558392dc81cac4
2023-07-07 12:06:28 +08:00
hiyouga
113cdaf1cb support InternLM
Former-commit-id: a454ef7d57d9c06302d51464cfe39f6d0c48c5a8
2023-07-07 11:02:28 +08:00
hiyouga
601b1747d1 fix rouge score
Former-commit-id: b84b03a2a5ca72ce0ba71d9a9c3db1a687283fa6
2023-07-06 14:28:34 +08:00
hiyouga
e3b779fcb2 update readme
Former-commit-id: 38b555c1819bb1fed1c17608ed6110d9572db98c
2023-07-05 23:03:58 +08:00
hiyouga
982e76978b fix streaming response in API
Former-commit-id: 72a17ae3b4fac2dc93b04a816f16f863120bc71b
2023-07-05 22:42:31 +08:00
hiyouga
d659907f34 fix freeze tuning
Former-commit-id: e32a1db967da02f502559df59ec6d1ab4554febf
2023-07-05 21:18:28 +08:00
hiyouga
df71d98b37 fix bug in PPO stage
Former-commit-id: a27373128d10b4e483d306100d91a55c0b796488
2023-07-05 19:14:10 +08:00
hiyouga
4de9ef568a fix compute dtype
Former-commit-id: 5aadbb22730d19570b039462c91df443dbb34b5f
2023-07-05 15:13:00 +08:00
hiyouga
f1de82f08e support falcon model #72
Former-commit-id: 72cc3ff0e6de641073de1159196319705f8efe85
2023-07-05 15:00:06 +08:00
hiyouga
4b093996a7 fix bleu score
Former-commit-id: 6874dce4444e6e6ce9d6125275dbf3dfdfb4fb22
2023-07-05 00:11:21 +08:00
hiyouga
e4e36a2d74 set use_cache before saving model
Former-commit-id: a6a0161f32f600f3001188ff4c7929c5f13c2a03
2023-07-04 23:18:20 +08:00
hiyouga
6df5c4ccef fix seq2seq predictions
Former-commit-id: 045316d62f713311bdabdfb56be442238e03a007
2023-07-04 22:56:51 +08:00
hoshi-hiyouga
ecae079e56 Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user

Former-commit-id: f1f0e2277469976f8666370645a83b2665a267ec
2023-07-03 19:51:46 +08:00
codemayq
77a2b60bc6 add the pre-built version of bitsandbytes library for windows user
Former-commit-id: 1cb8429647972147634433e81e3d8df5bded936a
2023-07-03 13:58:10 +08:00
hiyouga
6a83f4f793 Update auto_gptq.py
Former-commit-id: 84ebfd87692031a6736b86dd8a35f635f14c1c97
2023-07-02 20:56:11 +08:00
hiyouga
2537481c34 add autogptq
Former-commit-id: 43321557c272862d9c6531fc48a4569cfc88e4e7
2023-07-02 20:36:37 +08:00
hiyouga
d720f67e6c fix typo
Former-commit-id: 23c0d36abe881d9be839d5c647841bdef178307b
2023-06-30 10:09:59 +08:00
hiyouga
40ab36456c Update README.md
Former-commit-id: f3891213f50ef3da84d4a06ce71ec0806a7a4866
2023-06-29 19:36:22 +08:00
hiyouga
52652257c9 rename evaluate.py
Former-commit-id: 29d3e62b11964914c636f27beb422b10752cd3ae
2023-06-29 15:40:39 +08:00
hiyouga
0dd38a41b6 Update evaluate.py
Former-commit-id: 54a48d30b768b0421e760e6eab41fdb12e78995f
2023-06-29 15:40:03 +08:00
hiyouga
f39c71b02d Update README.md
Former-commit-id: 5b3f485af23e0a3804106c1cadc90ba02a8675e6
2023-06-29 15:37:19 +08:00
hiyouga
90fa2dd935 add open assistant dataset
Former-commit-id: 1694cf3078d04a14bce96da04b9d8c52176b1044
2023-06-28 23:09:33 +08:00
hiyouga
6290955e84 update loading logic
Former-commit-id: f1da17bb0deeb39a29da4dc208951d1ad69bb8ba
2023-06-28 12:07:16 +08:00
hiyouga
6b6430489a fix loading best model
Former-commit-id: cf7db6855d353a57344c78d6b56478ffb14ceff2
2023-06-28 01:55:12 +08:00
hiyouga
4ae8a20e1d fix RM accuracy
Former-commit-id: 532a385ea60693fdf835e6bc8e240ff8d55ff3a7
2023-06-28 01:40:13 +08:00
hiyouga
eca15bf252 add star history
Former-commit-id: 08f0dbcac3b72165e8993398111cc2b84006fbe5
2023-06-27 23:56:29 +08:00
hiyouga
e19dcc13e3 tiny fix
Former-commit-id: 994f2a79831a1dea8425e3eff62f0bc8238b78d6
2023-06-27 23:54:24 +08:00
hiyouga
2d22961c7d fix initializing data arguments
Former-commit-id: e6b83c8b87cb93358086121a6f9ccaba5dfa7497
2023-06-27 22:50:23 +08:00
hiyouga
640f774d30 support save full model, replace BOS token
Former-commit-id: 32e56c290802ba971c08f471b94a33daec85671a
2023-06-27 21:40:11 +08:00
hiyouga
33c2b063c6 fix decoding in seq2seq
Former-commit-id: 44227f651bf9a6a4741b3e0845cdb5f2ab58ea63
2023-06-27 19:33:08 +08:00
hiyouga
a7e53dcfef Update evaluate.py
Former-commit-id: 40201845b707c8b888b17744622ad78b4fb08a09
2023-06-26 23:41:33 +08:00
hiyouga
fe7ca5cb63 Create evaluate.py
Former-commit-id: 45a6742f00d17e504b54468d8eefdeb2aa877a29
2023-06-26 23:30:18 +08:00
hoshi-hiyouga
0ff82b1304 Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090

Former-commit-id: c3592307df08f088cf5b71bbc072b58c6c3b491d
2023-06-26 20:14:40 +08:00
Jingsong-Yan
d2de3f9e41 Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述

Former-commit-id: acdada61a773857e27b8a930cb22eb075eff6fa4
2023-06-26 19:45:41 +08:00
hiyouga
d5260ea860 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning
Former-commit-id: 9d0391ad8ba9e794808154c6c5700db96c664f89
2023-06-26 18:07:09 +08:00