hiyouga
|
ecfc7d1b50
|
fix #1658
Former-commit-id: 77d1b14fc2d9703d15bbd879f67df037db9fbb28
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
ae1048db6d
|
fix #1659
Former-commit-id: 475a3fa0f4c09d4cfd55ec66271a6d3c9eb5f4d2
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
b015ac35d8
|
support export size setting
Former-commit-id: 859a6ea9425a09d7263f6436d05102df8129c248
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
5f2943dc84
|
support Yi-34B-Chat models
Former-commit-id: ff1c289229ee382d3e76578bbb6a5e299b969ded
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
9697c3e970
|
set version
Former-commit-id: 35c2da3eba064e16b21c20a4cde3355173d5d9fd
|
2023-11-20 22:57:44 +08:00 |
|
hiyouga
|
4966bd7911
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: 9ea93801459b0d271d21a2d730c44abae9106c51
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
f06c4c8f7a
|
update ppo trainer
Former-commit-id: 5021062493ed63ad1f6133cfb543e4e7f528d2cc
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
d72f123851
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 48211e3799a16de946360930d3d92f5a40e9d12d
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
a7b1632ace
|
fix value head model resuming
Former-commit-id: 2a36fd5064f028f394ac07c25440fd5e965a07b8
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
682d81caa9
|
fix #1567
Former-commit-id: 99a3f06377d2886c4000ce7e3583b12ca965534d
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
32545bd6d9
|
better data streaming
Former-commit-id: 00baaa990e099d6b75436eaa7a922a07646afa26
|
2023-11-19 23:32:47 +08:00 |
|
hiyouga
|
d1e03512f4
|
fix model card network issue
Former-commit-id: 211b2db5a8290f6b52f0a076de56fcc2b06671d6
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
8d82d7e994
|
fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547
Former-commit-id: bfb94331657f385f4653ddcb8f7b57d1c052804d
|
2023-11-19 16:29:30 +08:00 |
|
hiyouga
|
a53afb27eb
|
fix #1263
Former-commit-id: 065bfaeed490a4e03fb48a5adc0b8af4d835a767
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
48d6d925f7
|
fix #1558
Former-commit-id: 1740131d63d32aefc0370441baf4716ddb5ebcfe
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
112108d564
|
fix evaluator and cached_file in 4.31.0
Former-commit-id: ff6056405dea8e89a95fd3741fd309d3c7679896
|
2023-11-18 19:39:23 +08:00 |
|
hiyouga
|
0d98d1a28c
|
fix quantization
Former-commit-id: ccb0f58e22f55b15531fd0e85f5935b150575bec
|
2023-11-17 22:21:29 +08:00 |
|
hiyouga
|
f9df6c17ed
|
fix #1550
Former-commit-id: 1bbc1be95eedf0796c0b311568dff8c75f87dfbb
|
2023-11-17 17:23:13 +08:00 |
|
Yuchen Han
|
a419122179
|
Update workflow.py
Former-commit-id: eeb5249d0b6ce0816e1fa47afc3a853c7b267cbf
|
2023-11-17 00:16:27 -08:00 |
|
Yuchen Han
|
ec910a87c0
|
Update finetuning_args.py
Former-commit-id: b24635d22b3084ad29217ef55c1dd1fa4f85a1fb
|
2023-11-17 00:15:51 -08:00 |
|
hiyouga
|
d3c4881ccb
|
fix packages
Former-commit-id: 999bc0ed93d15b5d1082c8f706a7d17c95933d93
|
2023-11-17 16:11:48 +08:00 |
|
Shaowen Wang
|
4ea3144554
|
Fix: Change rouge-chinese package name to rouge_chinese
To reproduce:
python:
importlib.util.find_spec('rouge-chinese') -> None
importlib.util.find_spec('rouge_chinese') -> ModuleSpec(name='rouge_chinese'...)
from rouge_chinese import Rouge
print(Rouge.__module__) -> rouge_chinese
Former-commit-id: 397e9489849d80517b0c467852a7092a0a9626f2
|
2023-11-16 20:12:35 -06:00 |
|
hiyouga
|
5de45bf989
|
fix chatglm template
Former-commit-id: ed9f7705efbed0accf4dc5c9dfa9e3e7e15e1174
|
2023-11-16 22:54:15 +08:00 |
|
hiyouga
|
8454e02313
|
fix web ui demo
Former-commit-id: 10ce87e088ae5934c4c3e61e9d0556cd5378308c
|
2023-11-16 18:41:55 +08:00 |
|
hiyouga
|
be0fb659d2
|
fix web ui demo
Former-commit-id: 1c80e9a09ed9fce0809e2b893d946306df0f9f7c
|
2023-11-16 17:12:23 +08:00 |
|
hiyouga
|
11af6c1e39
|
release v0.3.0
Former-commit-id: c4facc03af20d15d5b09ec77dc3742138db68f9d
|
2023-11-16 16:00:11 +08:00 |
|
hiyouga
|
11de514cc6
|
fix css
Former-commit-id: 08f3c114292676699a1921d4395f268a54763428
|
2023-11-16 15:45:38 +08:00 |
|
hiyouga
|
3f53155a90
|
fix bug in web ui
Former-commit-id: 6efa38be46ed536f80fc67002f23862edcb9df8d
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
e4f97615f0
|
update ppo and demo in webui
Former-commit-id: 7537dd434f4c0f0bde06bd8c2ac69bf622772316
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
0ed0b8f9c5
|
fix bug in freeze tuning
Former-commit-id: ff52b1779c909819d0aef83d3f7ea663199cbe54
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
627212e48b
|
tiny fix
Former-commit-id: 83cee2a6049b8287de1b5ebf41b2a0728e235b11
|
2023-11-16 03:27:19 +08:00 |
|
hiyouga
|
678052a7ef
|
fix rlhf callback
Former-commit-id: 1817ffc86fe3463ea91e9359c0e3611979a9d53e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
b71da932eb
|
fix bug in PPO training
Former-commit-id: 856522a3df4bb9ddfaaa137119eceb9574873950
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
eb5a852dd5
|
fix import bug
Former-commit-id: 35b91ea34caade45dd51813b94da5177b852aa4c
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
0c1fab84f1
|
add demo mode for web UI
Former-commit-id: 8350bcf85d5e59b63da46b540c6ad860e8419d9e
|
2023-11-15 23:51:26 +08:00 |
|
hiyouga
|
3e0b76650a
|
update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
e30290444a
|
support multiple modules in freeze training #1514
Former-commit-id: 4907452d955367ebe987e6deae4fd4213628f2b2
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
4a0be64ae6
|
fix imports
Former-commit-id: bbbce1f516840f722247edd37057d16502ea0557
|
2023-11-15 16:47:45 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
8ee48a9c9e
|
fix #1507
Former-commit-id: 2f02f688e15db8bf8f0dad3b7284863998c207a3
|
2023-11-15 16:22:32 +08:00 |
|
hiyouga
|
fffb8ea764
|
add cal_lr.py
Former-commit-id: 42c8fc4fb970775159a68a123d5c7bedb701c8cf
|
2023-11-14 20:58:37 +08:00 |
|
hiyouga
|
8387f3011c
|
fix #1494
Former-commit-id: d125ef55358837d4d76943739afeb6c70a901cd7
|
2023-11-14 18:07:20 +08:00 |
|
hiyouga
|
9176b55fe6
|
fix #1489
Former-commit-id: 3743b7420b823c5a56aef6100121d89b677ffb0e
|
2023-11-14 15:27:05 +08:00 |
|
hiyouga
|
5c4ddebde5
|
support eval remote dataset
Former-commit-id: 2d42be32c1b32b26548ea5af5fc3c810f4d668c1
|
2023-11-14 02:42:30 +08:00 |
|
hiyouga
|
4a767e5593
|
release v0.2.2, fix #1478 #1466
Former-commit-id: 35cc1e28f675889c44f75a0a3194005c7f23631b
|
2023-11-13 23:09:05 +08:00 |
|
hiyouga
|
37db26800c
|
fix #424
Former-commit-id: 87390ae3b70f654d520b9aadb335c9650130a42c
|
2023-11-13 22:42:23 +08:00 |
|
hiyouga
|
125587b187
|
refactor evaluation, upgrade trl to 074
Former-commit-id: 442aefb925c4ff02b98aa30c49c2e01d04f6496a
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
982e0e79c2
|
fix flashattn warning
Former-commit-id: 4bd8e3906d09bf6ec4b8f6b553a347fca9db4f80
|
2023-11-10 18:34:54 +08:00 |
|
hiyouga
|
55e097aaac
|
add todo
Former-commit-id: a0c31c68c4909637b86c90c319c321fd887c4910
|
2023-11-10 14:38:18 +08:00 |
|