Commit Graph

92 Commits

Author SHA1 Message Date
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
59d2b31e96 fix #5334 2024-09-03 19:09:42 +08:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
a7dd7d325e update liger kernel 2024-08-29 20:46:08 +08:00
hiyouga
aa1afdc756 fix #5292 2024-08-29 20:37:47 +08:00
hiyouga
72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hoshi-hiyouga
ef482394f0 Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
2024-08-09 19:51:33 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
“Wzw”
2fa1e0b2ad mask_history args verify valid 2024-08-08 10:12:01 +08:00
moontidef
82bc15dc79 feat: add support for adammini 2024-08-07 10:08:22 +08:00
hiyouga
8f6995081c update parser 2024-07-19 01:36:39 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
codingma
645211dc01 1. change the task name format
2. delete split param in data_args.py
2024-07-15 09:55:33 +08:00
hiyouga
99ab7a8c1c allow computing rouge in training 2024-07-15 01:16:26 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hoshi-hiyouga
15b399a82f Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
2024-07-15 01:00:34 +08:00
hoshi-hiyouga
cba673f491 Update data_args.py 2024-07-15 00:56:03 +08:00
hiyouga
88a20ba797 fix #4699
slow tokenizer for yi models
2024-07-14 15:34:22 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
cce7083024 update packing 2024-07-04 01:10:55 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hoshi-hiyouga
e8e6af2651 Merge branch 'main' into main 2024-07-01 21:01:09 +08:00
hiyouga
1771251ce3 fix #4402 #4617
Deprecate reserved_label_len arg
2024-07-01 01:19:27 +08:00
hiyouga
64f4337dac increase pissa_iter for stability 2024-06-28 03:18:54 +08:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
8ed6b367e2 fix #4549 2024-06-28 00:41:58 +08:00
hiyouga
e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hiyouga
ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hiyouga
555ca8d780 lint 2024-06-25 02:55:50 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
hoshi-hiyouga
d0f953bf5b Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
2024-06-25 01:49:13 +08:00
hiyouga
41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga
def6d280db Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.
2024-06-24 23:17:55 +08:00
hiyouga
fca893d73c fix #4410 2024-06-24 22:34:31 +08:00
hoshi-hiyouga
e90c424f55 Update parser.py 2024-06-24 21:37:42 +08:00
stceum
3ed063f281 Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this. 2024-06-24 20:39:31 +08:00
mMrBun
20e2e6fdcb Add tool_format to overwrite tool formatter template 2024-06-22 02:13:23 +08:00
ancv
770f75dc83 move configure_packing to llamafactory.model.patcher and fix constants 2024-06-21 00:45:06 +07:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga
4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
Jonery
8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00