Commit Graph

955 Commits

Author SHA1 Message Date
hiyouga
9e8854535b fix bleu score
Former-commit-id: cac87fd553
2023-07-05 00:11:21 +08:00
hiyouga
023f9367fc set use_cache before saving model
Former-commit-id: 395ed1cf1b
2023-07-04 23:18:20 +08:00
hiyouga
b9d56b2ac5 fix seq2seq predictions
Former-commit-id: 65e9ce2cdd
2023-07-04 22:56:51 +08:00
hoshi-hiyouga
ce622cd93e Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user

Former-commit-id: cb26f78923
2023-07-03 19:51:46 +08:00
codemayq
7925ffd3cd add the pre-built version of bitsandbytes library for windows user
Former-commit-id: d3b30ecde3
2023-07-03 13:58:10 +08:00
hiyouga
8a1cd612bc Update auto_gptq.py
Former-commit-id: 0db9d29111
2023-07-02 20:56:11 +08:00
hiyouga
202e8f1e02 add autogptq
Former-commit-id: cf6d57fd3e
2023-07-02 20:36:37 +08:00
hiyouga
7a0a6ec28d Update wechat.jpg
Former-commit-id: b8e1f09a2e
2023-06-30 15:45:20 +08:00
hiyouga
4ec1bff116 fix typo
Former-commit-id: 92fa515e97
2023-06-30 10:09:59 +08:00
hiyouga
b064d205c6 Update README.md
Former-commit-id: 021b035c1e
2023-06-29 19:36:22 +08:00
hiyouga
133399e905 rename evaluate.py
Former-commit-id: f14bd729a8
2023-06-29 15:40:39 +08:00
hiyouga
36c409697e Update evaluate.py
Former-commit-id: 23a7266272
2023-06-29 15:40:03 +08:00
hiyouga
457b53ec6c Update README.md
Former-commit-id: 70592035b8
2023-06-29 15:37:19 +08:00
hiyouga
92070c3d7a add open assistant dataset
Former-commit-id: 3154fec979
2023-06-28 23:09:33 +08:00
hiyouga
1ce7b5e0f3 update loading logic
Former-commit-id: 4d0fddba21
2023-06-28 12:07:16 +08:00
hiyouga
2ff577810a fix loading best model
Former-commit-id: 0a46313cca
2023-06-28 01:55:12 +08:00
hiyouga
c3cd2067b2 fix RM accuracy
Former-commit-id: 7826a8ca77
2023-06-28 01:40:13 +08:00
hiyouga
204541b56c add star history
Former-commit-id: 9cb1af71f3
2023-06-27 23:56:29 +08:00
hiyouga
c5ec4eaef5 tiny fix
Former-commit-id: 450910c1db
2023-06-27 23:54:24 +08:00
hiyouga
e877b8d55b fix initializing data arguments
Former-commit-id: 18f87c1b25
2023-06-27 22:50:23 +08:00
hiyouga
307f5866e9 support save full model, replace BOS token
Former-commit-id: 2e01abfda5
2023-06-27 21:40:11 +08:00
hiyouga
86a7b2dc5d fix decoding in seq2seq
Former-commit-id: 1c732e2537
2023-06-27 19:33:08 +08:00
hiyouga
afc98d2691 Update wechat.jpg
Former-commit-id: 33f2141507
2023-06-27 16:41:09 +08:00
hiyouga
7244f1fdcc Update evaluate.py
Former-commit-id: 4f3772b342
2023-06-26 23:41:33 +08:00
hiyouga
36666b0b77 Create evaluate.py
Former-commit-id: 5a0a9daf74
2023-06-26 23:30:18 +08:00
hoshi-hiyouga
c324f16bfc Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090

Former-commit-id: 907e065454
2023-06-26 20:14:40 +08:00
Jingsong-Yan
e9c1ffc5a6 Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述

Former-commit-id: 90bb5b6f37
2023-06-26 19:45:41 +08:00
hiyouga
50bce79c13 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning
Former-commit-id: 993cabdd4c
2023-06-26 18:07:09 +08:00
hiyouga
c145ca4ad6 fix generation in seq2seq.py
Former-commit-id: 1175948029
2023-06-26 18:07:06 +08:00
hoshi-hiyouga
a9beb90abc Merge pull request #84 from wu-yy/patch-1
Update requirements.txt

Former-commit-id: 95b057f5af
2023-06-26 15:39:08 +08:00
蓝鲸123
784065e16a Update requirements.txt
LlamaTokenizer requires the SentencePiece library but it was not found in your environment.

Former-commit-id: e2a16d549e
2023-06-26 15:36:19 +08:00
hiyouga
8f1d99c926 support prefixes, loading multiple local files
Former-commit-id: cec9760eb8
2023-06-26 15:32:40 +08:00
hiyouga
e4a869dc42 update api
Former-commit-id: f030b09924
2023-06-26 13:39:57 +08:00
hiyouga
194f38df8f Update wechat.jpg
Former-commit-id: d21cc71750
2023-06-25 23:41:11 +08:00
hiyouga
cf29a9af35 update readme
Former-commit-id: 0697643358
2023-06-23 00:17:05 +08:00
hiyouga
0c7eb90f6b update API
Former-commit-id: 614d3a996c
2023-06-22 20:46:24 +08:00
hiyouga
620cd2eb7e match api with OpenAI format
Former-commit-id: 76ecb8c222
2023-06-22 20:27:00 +08:00
hoshi-hiyouga
993d005242 Merge pull request #68 from mMrBun/main
Compatible with OpenAI API.

Former-commit-id: 9324940b76
2023-06-22 15:52:34 +08:00
Bun
cd066afa7b Compatible with OpenAI API.
Former-commit-id: 6e4db0903f
2023-06-21 14:45:04 +08:00
hiyouga
45b4588a3d Update wechat.jpg
Former-commit-id: ded5aa3c3d
2023-06-19 19:46:04 +08:00
hiyouga
eeb78bd75c add default template
Former-commit-id: f621f7631a
2023-06-16 21:12:17 +08:00
hiyouga
9155401bf9 add belle multiturn dataset
Former-commit-id: 334d1a6d26
2023-06-16 20:01:16 +08:00
hiyouga
653ce9397e fix freeze layers
Former-commit-id: a6c4b141cd
2023-06-16 17:38:21 +08:00
hiyouga
36ea46e85c add source prefix
Former-commit-id: fc4d8155b3
2023-06-16 16:32:17 +08:00
hiyouga
c6d56e7109 support loading lora from hub
Former-commit-id: 0574b590ef
2023-06-16 00:02:17 +08:00
hiyouga
a68808d6d9 support baichuan model
Former-commit-id: 0cee6ad67f
2023-06-15 16:02:01 +08:00
hiyouga
50494db8d6 fix bug in template vanilla
Former-commit-id: c527399424
2023-06-15 14:36:55 +08:00
hiyouga
64080d185e Update wechat.jpg
Former-commit-id: 0a36658bb6
2023-06-15 13:48:53 +08:00
hiyouga
dd1e7ed3cf add BOS token in pre-training
Former-commit-id: d668f8b501
2023-06-15 01:46:17 +08:00
hiyouga
3419396945 support multiturn training like FastChat
Former-commit-id: b6faf0207d
2023-06-14 22:27:39 +08:00