hiyouga
|
542658c986
|
update parser
Former-commit-id: 8f6995081c
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
34f16cc635
|
follow #4878 fix #4684
Former-commit-id: 779aae83d2
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
249adacc4d
|
仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f12
|
2024-07-17 00:33:00 +08:00 |
|
codingma
|
76046dfda8
|
1. change the task name format
2. delete split param in data_args.py
Former-commit-id: 645211dc01
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
22859b8734
|
allow computing rouge in training
Former-commit-id: 99ab7a8c1c
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d5
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
2b22a7da48
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 15b399a82f
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
788dc1c679
|
Update data_args.py
Former-commit-id: cba673f491
|
2024-07-15 00:56:03 +08:00 |
|
hiyouga
|
dfd2d912cd
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 88a20ba797
|
2024-07-14 15:34:22 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
ff6fc666c1
|
update hparams
Former-commit-id: 575a02a23d
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
7f42932957
|
move efficient_packing from data_args to model_args
Former-commit-id: e8e13b0942
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af2651
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
ca7b65439d
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 1771251ce3
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
b0acd27114
|
increase pissa_iter for stability
Former-commit-id: 64f4337dac
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a294ef2fae
|
fix #4549
Former-commit-id: 8ed6b367e2
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
7c488cea57
|
tiny fix
Former-commit-id: e44a4f07f0
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
f3f25ae3b7
|
lint
Former-commit-id: 555ca8d780
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
fe6ef6400c
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5b
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
d519c2fde5
|
tiny fix
Former-commit-id: 41086059b1
|
2024-06-25 01:15:19 +08:00 |
|
hoshi-hiyouga
|
709bbc1d92
|
Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.
Former-commit-id: def6d280db
|
2024-06-24 23:17:55 +08:00 |
|
hiyouga
|
47651a94a3
|
fix #4410
Former-commit-id: fca893d73c
|
2024-06-24 22:34:31 +08:00 |
|
hoshi-hiyouga
|
e74fcdf7b1
|
Update parser.py
Former-commit-id: e90c424f55
|
2024-06-24 21:37:42 +08:00 |
|
stceum
|
9aa640f27b
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281
|
2024-06-24 20:39:31 +08:00 |
|
mMrBun
|
c0e005e2ea
|
Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb
|
2024-06-22 02:13:23 +08:00 |
|
ancv
|
5319447aa5
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc83
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
0844750bb9
|
tiny fix
Former-commit-id: 8d4f5093cf
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
5156114981
|
fix #4357
Former-commit-id: 4bd77d8563
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
c2734108e7
|
fix typo
Former-commit-id: 8f7c78b641
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9
|
2024-06-18 12:27:47 +08:00 |
|
hiyouga
|
19bf21efba
|
lint
Former-commit-id: 24c160df3d
|
2024-06-17 22:35:56 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e0
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b4372778
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
06bbc29614
|
Update parser.py
Former-commit-id: 29c1f31baa
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78a
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52e
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
96b82ccd4d
|
use fixture
Former-commit-id: 80a9e6bf94
|
2024-06-15 20:06:17 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa6
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
fcbfa70c19
|
disable DP
Former-commit-id: d519b4d76d
|
2024-06-15 04:57:19 +08:00 |
|
hiyouga
|
a3f4925c2c
|
add test cases
Former-commit-id: b27269bd2b
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
99ce085415
|
fix lint
Former-commit-id: 713fde4259
|
2024-06-13 00:48:44 +08:00 |
|
ancv
|
045eb155a2
|
implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
5834651c4a
|
fix #4198
Former-commit-id: 89f2bd8c8c
|
2024-06-11 15:38:38 +08:00 |
|