133 Commits

Author SHA1 Message Date
hoshi-hiyouga
792da85866 Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer

Former-commit-id: ef482394f0e2820ee8a245f8a6b050a32591b40a
2024-08-09 19:51:33 +08:00
hiyouga
b5146facff follow #5115
Former-commit-id: c87023d539875cd8e622d40212a5627c9c182fb8
2024-08-09 18:03:00 +08:00
“Wzw”
13e5fff97a mask_history args verify valid
Former-commit-id: 2fa1e0b2add60142c178e5e21ebaad7132fa5b00
2024-08-08 10:12:01 +08:00
moontidef
44f7c4dd56 feat: add support for adammini
Former-commit-id: 82bc15dc795f95768b81c25eaaabdc613da30cd8
2024-08-07 10:08:22 +08:00
hiyouga
542658c986 update parser
Former-commit-id: 8f6995081cbdbb2424da586a443e5220a8990faa
2024-07-19 01:36:39 +08:00
hiyouga
34f16cc635 follow #4878 fix #4684
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
2024-07-18 22:06:12 +08:00
Shiyu Zhang
249adacc4d 仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
2024-07-18 15:30:25 +08:00
hiyouga
e90fae61f4 support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
2024-07-17 00:33:00 +08:00
codingma
76046dfda8 1. change the task name format
2. delete split param in data_args.py


Former-commit-id: 645211dc01b5d4db3ccd0e3dce03a53860eded26
2024-07-15 09:55:33 +08:00
hiyouga
22859b8734 allow computing rouge in training
Former-commit-id: 99ab7a8c1c966232faa11b6a42b9740d9a20ace3
2024-07-15 01:16:26 +08:00
hiyouga
14bc7b0551 fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
2024-07-15 01:04:56 +08:00
hoshi-hiyouga
2b22a7da48 Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support

Former-commit-id: 15b399a82f45b08fc07d2957884fb7821eba9fd9
2024-07-15 01:00:34 +08:00
hoshi-hiyouga
788dc1c679 Update data_args.py
Former-commit-id: cba673f491c5d97aba62aea03f310bd54fb3fe28
2024-07-15 00:56:03 +08:00
hiyouga
dfd2d912cd fix #4699
slow tokenizer for yi models


Former-commit-id: 88a20ba7972c533d650967a118d612471fe2b2e8
2024-07-14 15:34:22 +08:00
codingma
74f0d02eb8 1. add custom eval dataset support
2. merge load dataset and split dataset function


Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
2024-07-05 15:52:10 +08:00
hiyouga
7b3c1f29ff fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
2024-07-04 01:52:43 +08:00
hiyouga
bfdaadcc40 update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
2024-07-04 01:10:55 +08:00
hiyouga
ff6fc666c1 update hparams
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
2024-07-03 23:18:58 +08:00
ancv
7f42932957 move efficient_packing from data_args to model_args
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
2024-07-02 18:37:55 +07:00
hoshi-hiyouga
2452f57cd7 Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
2024-07-01 21:01:09 +08:00
hiyouga
ca7b65439d fix #4402 #4617
Deprecate reserved_label_len arg


Former-commit-id: 1771251ce3f6887b301dac10f3de7a253c5e5884
2024-07-01 01:19:27 +08:00
hiyouga
b0acd27114 increase pissa_iter for stability
Former-commit-id: 64f4337daca4c914d86a7181dd582508688383cd
2024-06-28 03:18:54 +08:00
hiyouga
835f0578c2 refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
2024-06-28 01:04:24 +08:00
hiyouga
a294ef2fae fix #4549
Former-commit-id: 8ed6b367e26490acab5d2d7b32f0d5dad449d26a
2024-06-28 00:41:58 +08:00
hiyouga
7c488cea57 tiny fix
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
2024-06-27 20:14:48 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
f3f25ae3b7 lint
Former-commit-id: 555ca8d780a1fbaf42e73450f5eb33048329d921
2024-06-25 02:55:50 +08:00
hiyouga
a225b5a70c tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
2024-06-25 01:54:53 +08:00
hoshi-hiyouga
fe6ef6400c Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam

Former-commit-id: d0f953bf5bdbfd49acc82ff055bd54889241761a
2024-06-25 01:49:13 +08:00
hiyouga
d519c2fde5 tiny fix
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
709bbc1d92 Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.

Former-commit-id: def6d280db3a9fe468b05503bcd9929c83c6c19b
2024-06-24 23:17:55 +08:00
hiyouga
47651a94a3 fix #4410
Former-commit-id: fca893d73c3d7bbb87a816522f2e1568d3e9c612
2024-06-24 22:34:31 +08:00
hoshi-hiyouga
e74fcdf7b1 Update parser.py
Former-commit-id: e90c424f55b17e4971f8b9d85b6aeac89bb6b98e
2024-06-24 21:37:42 +08:00
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
2024-06-24 20:39:31 +08:00
mMrBun
c0e005e2ea Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb0cd1771906be035745a2d9fcd3e138
2024-06-22 02:13:23 +08:00
ancv
5319447aa5 move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
2024-06-21 00:45:06 +07:00
hiyouga
0844750bb9 tiny fix
Former-commit-id: 8d4f5093cfcccfe9df173b4c4f7ec0125aecf198
2024-06-20 22:56:05 +08:00
Jonery
c779899f7b Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
2024-06-19 12:29:40 +08:00
hiyouga
5156114981 fix #4357
Former-commit-id: 4bd77d8563aa85230af65caf901214247e214bed
2024-06-18 22:42:45 +08:00
Jonery
c2734108e7 fix typo
Former-commit-id: 8f7c78b64138602406af748b0e15948ebbd2dcb5
2024-06-18 12:39:26 +08:00
Jonery
3a5eacb4cf Support distributed BAdam.
Former-commit-id: 0f72aac8c9227e33ad20d2b1641b1c9faae16a5f
2024-06-18 12:27:47 +08:00
hiyouga
19bf21efba lint
Former-commit-id: 24c160df3d575843e5ad5f1b47246d04430a79f0
2024-06-17 22:35:56 +08:00
Jonery
5d59f6562a Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
2024-06-17 18:44:51 +08:00
Jonery
756566342d adapt for badam with ds zero3
Former-commit-id: 33b437277846d4f0b64c13a0bc892ef4f345a21e
2024-06-17 18:18:10 +08:00
hoshi-hiyouga
06bbc29614 Update parser.py
Former-commit-id: 29c1f31baa442e35714b18b7e51896274a828cae
2024-06-16 02:57:00 +08:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
c0c6b8075a tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
2024-06-16 01:06:41 +08:00
hiyouga
96b82ccd4d use fixture
Former-commit-id: 80a9e6bf94cf14fa63e6b6cdf7e1ce13722c8b5e
2024-06-15 20:06:17 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
fcbfa70c19 disable DP
Former-commit-id: d519b4d76d39b21a21b1d2f6f7ce6b3af9525d03
2024-06-15 04:57:19 +08:00