Commit Graph

2127 Commits

Author SHA1 Message Date
hiyouga
2b0fced03b fix Baichuan-13B
Former-commit-id: 08439d29b2
2023-07-13 23:08:45 +08:00
hoshi-hiyouga
cecdf886cb Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset

Former-commit-id: 8cd76ef3c3
2023-07-12 20:11:19 +08:00
zxbsmk
3b15aacf02 Support for WebNovel dataset
Former-commit-id: 4955dc9eed
2023-07-12 17:29:47 +08:00
hoshi-hiyouga
9c319f3d49 Merge pull request #145 from elicassion/patch-1
Fix typo in common.py

Former-commit-id: 894f13e41f
2023-07-12 13:50:39 +08:00
Jinghuan Shang
d42d175670 Fix typo in common.py
lastest -> latest

Former-commit-id: dc1e8b7181
2023-07-11 18:03:53 -04:00
hiyouga
925a026853 fix sft encode
Former-commit-id: b2f7cb4465
2023-07-11 19:50:33 +08:00
hiyouga
685ba1e02e add baichuan template
Former-commit-id: 1af031c02b
2023-07-11 18:57:50 +08:00
hiyouga
507726b2f0 support Baichuan-13B
Former-commit-id: f936a7af0b
2023-07-11 16:16:14 +08:00
hiyouga
804cc07db0 Update README.md
Former-commit-id: 8447206bbc
2023-07-10 23:09:11 +08:00
hiyouga
53b8e444b8 Update wechat.jpg
Former-commit-id: 061c324972
2023-07-10 18:41:53 +08:00
hiyouga
e5c7681146 Update README.md
Former-commit-id: 4182c7aa8b
2023-07-09 14:57:13 +08:00
hiyouga
c812429011 update api to match langchain
Former-commit-id: 84a06318d4
2023-07-07 20:35:39 +08:00
hiyouga
28ec96bf19 Update README.md
Former-commit-id: 233f20864b
2023-07-07 12:06:28 +08:00
hiyouga
8b4ef062b7 support InternLM
Former-commit-id: a2f507c562
2023-07-07 11:02:28 +08:00
hiyouga
50a27509a3 fix rouge score
Former-commit-id: caa00d3ac2
2023-07-06 14:28:34 +08:00
hiyouga
28c1381671 update readme
Former-commit-id: 89c623e4bf
2023-07-05 23:03:58 +08:00
hiyouga
22fa516bd5 fix streaming response in API
Former-commit-id: 4abd2485e1
2023-07-05 22:42:31 +08:00
hiyouga
bcd7493212 fix freeze tuning
Former-commit-id: e6603977f6
2023-07-05 21:18:28 +08:00
hiyouga
265dc1b6a0 fix bug in PPO stage
Former-commit-id: a2ba69183b
2023-07-05 19:14:10 +08:00
hiyouga
7ba52f5b6e fix compute dtype
Former-commit-id: 8e3540c62d
2023-07-05 15:13:00 +08:00
hiyouga
c87910ada3 support falcon model #72
Former-commit-id: c136f362c1
2023-07-05 15:00:06 +08:00
hiyouga
827ff46008 Update wechat.jpg
Former-commit-id: 966b5c70fc
2023-07-05 00:22:22 +08:00
hiyouga
9e8854535b fix bleu score
Former-commit-id: cac87fd553
2023-07-05 00:11:21 +08:00
hiyouga
023f9367fc set use_cache before saving model
Former-commit-id: 395ed1cf1b
2023-07-04 23:18:20 +08:00
hiyouga
b9d56b2ac5 fix seq2seq predictions
Former-commit-id: 65e9ce2cdd
2023-07-04 22:56:51 +08:00
hoshi-hiyouga
ce622cd93e Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user

Former-commit-id: cb26f78923
2023-07-03 19:51:46 +08:00
codemayq
7925ffd3cd add the pre-built version of bitsandbytes library for windows user
Former-commit-id: d3b30ecde3
2023-07-03 13:58:10 +08:00
hiyouga
8a1cd612bc Update auto_gptq.py
Former-commit-id: 0db9d29111
2023-07-02 20:56:11 +08:00
hiyouga
202e8f1e02 add autogptq
Former-commit-id: cf6d57fd3e
2023-07-02 20:36:37 +08:00
hiyouga
7a0a6ec28d Update wechat.jpg
Former-commit-id: b8e1f09a2e
2023-06-30 15:45:20 +08:00
hiyouga
4ec1bff116 fix typo
Former-commit-id: 92fa515e97
2023-06-30 10:09:59 +08:00
hiyouga
b064d205c6 Update README.md
Former-commit-id: 021b035c1e
2023-06-29 19:36:22 +08:00
hiyouga
133399e905 rename evaluate.py
Former-commit-id: f14bd729a8
2023-06-29 15:40:39 +08:00
hiyouga
36c409697e Update evaluate.py
Former-commit-id: 23a7266272
2023-06-29 15:40:03 +08:00
hiyouga
457b53ec6c Update README.md
Former-commit-id: 70592035b8
2023-06-29 15:37:19 +08:00
hiyouga
92070c3d7a add open assistant dataset
Former-commit-id: 3154fec979
2023-06-28 23:09:33 +08:00
hiyouga
1ce7b5e0f3 update loading logic
Former-commit-id: 4d0fddba21
2023-06-28 12:07:16 +08:00
hiyouga
2ff577810a fix loading best model
Former-commit-id: 0a46313cca
2023-06-28 01:55:12 +08:00
hiyouga
c3cd2067b2 fix RM accuracy
Former-commit-id: 7826a8ca77
2023-06-28 01:40:13 +08:00
hiyouga
204541b56c add star history
Former-commit-id: 9cb1af71f3
2023-06-27 23:56:29 +08:00
hiyouga
c5ec4eaef5 tiny fix
Former-commit-id: 450910c1db
2023-06-27 23:54:24 +08:00
hiyouga
e877b8d55b fix initializing data arguments
Former-commit-id: 18f87c1b25
2023-06-27 22:50:23 +08:00
hiyouga
307f5866e9 support save full model, replace BOS token
Former-commit-id: 2e01abfda5
2023-06-27 21:40:11 +08:00
hiyouga
86a7b2dc5d fix decoding in seq2seq
Former-commit-id: 1c732e2537
2023-06-27 19:33:08 +08:00
hiyouga
afc98d2691 Update wechat.jpg
Former-commit-id: 33f2141507
2023-06-27 16:41:09 +08:00
hiyouga
7244f1fdcc Update evaluate.py
Former-commit-id: 4f3772b342
2023-06-26 23:41:33 +08:00
hiyouga
36666b0b77 Create evaluate.py
Former-commit-id: 5a0a9daf74
2023-06-26 23:30:18 +08:00
hoshi-hiyouga
c324f16bfc Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090

Former-commit-id: 907e065454
2023-06-26 20:14:40 +08:00
Jingsong-Yan
e9c1ffc5a6 Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述

Former-commit-id: 90bb5b6f37
2023-06-26 19:45:41 +08:00
hiyouga
50bce79c13 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning
Former-commit-id: 993cabdd4c
2023-06-26 18:07:09 +08:00