hoshi-hiyouga
|
792da85866
|
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
Former-commit-id: ef482394f0e2820ee8a245f8a6b050a32591b40a
|
2024-08-09 19:51:33 +08:00 |
|
hiyouga
|
b5146facff
|
follow #5115
Former-commit-id: c87023d539875cd8e622d40212a5627c9c182fb8
|
2024-08-09 18:03:00 +08:00 |
|
“Wzw”
|
13e5fff97a
|
mask_history args verify valid
Former-commit-id: 2fa1e0b2add60142c178e5e21ebaad7132fa5b00
|
2024-08-08 10:12:01 +08:00 |
|
moontidef
|
44f7c4dd56
|
feat: add support for adammini
Former-commit-id: 82bc15dc795f95768b81c25eaaabdc613da30cd8
|
2024-08-07 10:08:22 +08:00 |
|
hiyouga
|
542658c986
|
update parser
Former-commit-id: 8f6995081cbdbb2424da586a443e5220a8990faa
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
34f16cc635
|
follow #4878 fix #4684
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
249adacc4d
|
仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
codingma
|
76046dfda8
|
1. change the task name format
2. delete split param in data_args.py
Former-commit-id: 645211dc01b5d4db3ccd0e3dce03a53860eded26
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
22859b8734
|
allow computing rouge in training
Former-commit-id: 99ab7a8c1c966232faa11b6a42b9740d9a20ace3
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
2b22a7da48
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 15b399a82f45b08fc07d2957884fb7821eba9fd9
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
788dc1c679
|
Update data_args.py
Former-commit-id: cba673f491c5d97aba62aea03f310bd54fb3fe28
|
2024-07-15 00:56:03 +08:00 |
|
hiyouga
|
dfd2d912cd
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 88a20ba7972c533d650967a118d612471fe2b2e8
|
2024-07-14 15:34:22 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
ff6fc666c1
|
update hparams
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
7f42932957
|
move efficient_packing from data_args to model_args
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
ca7b65439d
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 1771251ce3f6887b301dac10f3de7a253c5e5884
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
b0acd27114
|
increase pissa_iter for stability
Former-commit-id: 64f4337daca4c914d86a7181dd582508688383cd
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a294ef2fae
|
fix #4549
Former-commit-id: 8ed6b367e26490acab5d2d7b32f0d5dad449d26a
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
7c488cea57
|
tiny fix
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
f3f25ae3b7
|
lint
Former-commit-id: 555ca8d780a1fbaf42e73450f5eb33048329d921
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
fe6ef6400c
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5bdbfd49acc82ff055bd54889241761a
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
d519c2fde5
|
tiny fix
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
|
2024-06-25 01:15:19 +08:00 |
|
hoshi-hiyouga
|
709bbc1d92
|
Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.
Former-commit-id: def6d280db3a9fe468b05503bcd9929c83c6c19b
|
2024-06-24 23:17:55 +08:00 |
|
hiyouga
|
47651a94a3
|
fix #4410
Former-commit-id: fca893d73c3d7bbb87a816522f2e1568d3e9c612
|
2024-06-24 22:34:31 +08:00 |
|
hoshi-hiyouga
|
e74fcdf7b1
|
Update parser.py
Former-commit-id: e90c424f55b17e4971f8b9d85b6aeac89bb6b98e
|
2024-06-24 21:37:42 +08:00 |
|
stceum
|
9aa640f27b
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
|
2024-06-24 20:39:31 +08:00 |
|
mMrBun
|
c0e005e2ea
|
Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb0cd1771906be035745a2d9fcd3e138
|
2024-06-22 02:13:23 +08:00 |
|
ancv
|
5319447aa5
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
0844750bb9
|
tiny fix
Former-commit-id: 8d4f5093cfcccfe9df173b4c4f7ec0125aecf198
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
5156114981
|
fix #4357
Former-commit-id: 4bd77d8563aa85230af65caf901214247e214bed
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
c2734108e7
|
fix typo
Former-commit-id: 8f7c78b64138602406af748b0e15948ebbd2dcb5
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9227e33ad20d2b1641b1c9faae16a5f
|
2024-06-18 12:27:47 +08:00 |
|
hiyouga
|
19bf21efba
|
lint
Former-commit-id: 24c160df3d575843e5ad5f1b47246d04430a79f0
|
2024-06-17 22:35:56 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b437277846d4f0b64c13a0bc892ef4f345a21e
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
06bbc29614
|
Update parser.py
Former-commit-id: 29c1f31baa442e35714b18b7e51896274a828cae
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
96b82ccd4d
|
use fixture
Former-commit-id: 80a9e6bf94cf14fa63e6b6cdf7e1ce13722c8b5e
|
2024-06-15 20:06:17 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
fcbfa70c19
|
disable DP
Former-commit-id: d519b4d76d39b21a21b1d2f6f7ce6b3af9525d03
|
2024-06-15 04:57:19 +08:00 |
|