hiyouga
|
7f54008d3c
|
update readme
Former-commit-id: 561481a8008fde5a3273558460193864a09866ed
v0.3.2
|
2023-11-21 13:15:46 +08:00 |
|
hiyouga
|
5f5959bc33
|
set version
Former-commit-id: 6b47ad74c7b3099f9b5087c73db4aee42c451297
|
2023-11-20 22:57:44 +08:00 |
|
hiyouga
|
0105cd48f2
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: fdccc6cc9b68890199e9250cabdb996ff2f853b9
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
e585950c54
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 1b64678fa4979485f67c3bb1420dfdff6fcbc6e7
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
bcd661afa6
|
fix value head model resuming
Former-commit-id: ccf0b65d886c09c7c49977c43b0544fe1bfcc258
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
adf2730d1d
|
fix #1567
Former-commit-id: 8c01ffe8d277d49a413571e0669f460c8d0802bf
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
ba2be6371d
|
better data streaming
Former-commit-id: 65ac8e84fd6f22255c587b20382fdf5d8131d015
|
2023-11-19 23:32:47 +08:00 |
|
hiyouga
|
d2ff09a404
|
fix model card network issue
Former-commit-id: 36155cd1893bea036f15c648c06b0047c02dfb4f
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
9f364d3880
|
fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547
Former-commit-id: d426ecdf6e95402fc36893f7e4f17f881e1b957b
|
2023-11-19 16:29:30 +08:00 |
|
hiyouga
|
cfad41b901
|
fix #1263
Former-commit-id: faff5d32621f187ebd3124d7ade04e3fa437c53e
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
6889f044fb
|
fix #1558
Former-commit-id: 263b2b24c8a649b51fa5ae768a24e67def8e0e96
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
3d1ee27ccd
|
fix evaluator and cached_file in 4.31.0
Former-commit-id: 970897da402f604220d45084d492de4dab809ba4
|
2023-11-18 19:39:23 +08:00 |
|
hiyouga
|
775ce62950
|
update benchmark
Former-commit-id: 1cd2ae910e3ffca92978772d000de6fde2f6bb13
|
2023-11-18 11:30:01 +08:00 |
|
hiyouga
|
821a6f2fa6
|
update readme
Former-commit-id: a4d86a4bea1cce2219a54def9dfd3fd732d48e72
|
2023-11-18 11:15:56 +08:00 |
|
hiyouga
|
5197fb2fad
|
add benchmark
Former-commit-id: 85a09cb649be740a47359371499d821ee0d5c81e
|
2023-11-18 11:09:52 +08:00 |
|
hiyouga
|
92abe91d22
|
update dataset
Former-commit-id: a310b22b446118d90dd73906847ed3d01a574b50
|
2023-11-17 23:19:12 +08:00 |
|
hiyouga
|
a7bf0b85d7
|
fix quantization
Former-commit-id: 8268aefe8fba268065e24ffe159a9c49f7c6f3a5
|
2023-11-17 22:21:29 +08:00 |
|
hiyouga
|
5ce5ea84a9
|
fix #1550
Former-commit-id: c12acd21a5a500892ed739c79327ccd39fddad5b
|
2023-11-17 17:23:13 +08:00 |
|
Yuchen Han
|
992be39f90
|
Update README_zh.md
Former-commit-id: 3e8a17c92d700bcafbe6559ea689dc4c0ad0481a
|
2023-11-17 00:18:07 -08:00 |
|
Yuchen Han
|
cab80a3c56
|
Update README.md
Former-commit-id: c1532dc6fe5d5b427011bd5509a2bc44ee16d951
|
2023-11-17 00:17:36 -08:00 |
|
Yuchen Han
|
6af7107938
|
Update workflow.py
Former-commit-id: f70b7ffe6442217a222e0ef797c407f259a13886
|
2023-11-17 00:16:27 -08:00 |
|
Yuchen Han
|
bcd31cf245
|
Update finetuning_args.py
Former-commit-id: 30e3430553f1f7e09cd57ef2c9843b549746c618
|
2023-11-17 00:15:51 -08:00 |
|
hiyouga
|
85c4ccfef9
|
fix packages
Former-commit-id: c93175d18ad9a4b7b61629153acabf8d0c978dfc
|
2023-11-17 16:11:48 +08:00 |
|
hoshi-hiyouga
|
dc0f81aabc
|
Merge #1544 from Outsider565/main, fix #1548
Fix: Change rouge-chinese package name to rouge_chinese
Former-commit-id: c24da51cb5d3f78d54dcbfb31b565fcac4783a76
|
2023-11-17 16:09:42 +08:00 |
|
Shaowen Wang
|
07f934566a
|
Fix: Change rouge-chinese package name to rouge_chinese
To reproduce:
python:
importlib.util.find_spec('rouge-chinese') -> None
importlib.util.find_spec('rouge_chinese') -> ModuleSpec(name='rouge_chinese'...)
from rouge_chinese import Rouge
print(Rouge.__module__) -> rouge_chinese
Former-commit-id: a78b11d944b6cb7dbe2a1d8a24d240e196aa530a
|
2023-11-16 20:12:35 -06:00 |
|
hiyouga
|
77cb18e9e3
|
fix chatglm template
Former-commit-id: 6a4b79c2e0610a17012bf3e72a2b5e8bac060092
|
2023-11-16 22:54:15 +08:00 |
|
hiyouga
|
fccaecf730
|
Update bug-report.yml
Former-commit-id: 92ed2297c78d016113fa7f90cedc0933a0bb2be0
|
2023-11-16 19:37:35 +08:00 |
|
hiyouga
|
53cdfe8f73
|
add issue template
Former-commit-id: 4ca01a6b051043593541403d74e4d464b70e0e4b
|
2023-11-16 19:35:30 +08:00 |
|
hoshi-hiyouga
|
ea03523c6a
|
Update issue templates
Former-commit-id: f967abcfcd052b65745f20e2c760ca45c412b66a
|
2023-11-16 18:56:30 +08:00 |
|
hiyouga
|
caf3cbf8d7
|
fix web ui demo
Former-commit-id: e566a68a27872f730b111078977048755ec74a40
|
2023-11-16 18:41:55 +08:00 |
|
hiyouga
|
da411066c9
|
fix web ui demo
Former-commit-id: 6fead193fe44fec74c2262d8653ed2f6006fac36
|
2023-11-16 17:12:23 +08:00 |
|
hiyouga
|
95d0f77fc2
|
release v0.3.0
Former-commit-id: de7f5b622340ab09ebbe57ad2703e63d06dfdeea
v0.3.0
|
2023-11-16 16:00:11 +08:00 |
|
hiyouga
|
9b2654277b
|
update readme
Former-commit-id: 4018aabc5d1623033d27a8aced25804de79b7e7b
|
2023-11-16 15:58:37 +08:00 |
|
hoshi-hiyouga
|
f1b3bdac3f
|
Merge #1525 from hiyouga/dev, fix #224 #336 #931 #936 #1011
Refactor llmtuner, support full-parameter RLHF
Former-commit-id: 3b92826803dc69471827b4f8204c2c3dc5310619
|
2023-11-16 15:47:13 +08:00 |
|
hiyouga
|
595fdbd95d
|
fix css
Former-commit-id: 7afec127f60257462828298b25a5f6fd9c6f42c5
|
2023-11-16 15:45:38 +08:00 |
|
hiyouga
|
dab9385297
|
fix bug in web ui
Former-commit-id: a598f145ec903dd2b2c984d951b6c450b142ece5
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
df83def566
|
update ppo and demo in webui
Former-commit-id: de7571704c82121db13e3fc907379d2453100191
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
f9d4e37b3c
|
fix bug in freeze tuning
Former-commit-id: f6b436a08421ca17d64abc51497f4aa43729a43b
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
e59a3d71e0
|
tiny fix
Former-commit-id: d65519d8a44b73bbb713741c23465f13c35c83f5
|
2023-11-16 03:27:19 +08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
e017266b98
|
fix bug in PPO training
Former-commit-id: 2e99f0e53ce6de0acbcab85dd50aef874e8c6336
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
f81a8a5e5c
|
fix import bug
Former-commit-id: 2356029cdd120d5f7bf630b80681ce8c53bff90d
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
8263b2d32d
|
add demo mode for web UI
Former-commit-id: 5ad34f08b4e1505d7933b973497347f126b2e818
|
2023-11-15 23:51:26 +08:00 |
|
hoshi-hiyouga
|
833cd490b8
|
Create CODE_OF_CONDUCT.md
Former-commit-id: 6bee64cdf9c75488033e600fb5b48738daa1ed3b
|
2023-11-15 20:42:15 +08:00 |
|
hiyouga
|
2162c37e41
|
update readme and constants
Former-commit-id: 7d83e3dd9101a4fdd0b589d0c1f7b609c0feecd1
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
b2ac8376e1
|
support multiple modules in freeze training #1514
Former-commit-id: 60abac70dfd778df2ae8b3a2e960ed8b607d7ab6
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
8079584143
|
fix imports
Former-commit-id: 6156f1abef631c675d150dd1cb0325cfc3820c91
|
2023-11-15 16:47:45 +08:00 |
|
hiyouga
|
09a4474e7f
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|