Commit Graph

54 Commits

Author SHA1 Message Date
Liuww
f91a9a250a fix: Repair the issue where quantization failed after merging the adapter. 2024-07-24 14:31:29 +08:00
hiyouga
c333e2f49d tiny fix 2024-07-22 00:06:03 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hiyouga
6b48308ef9 fix #4792 2024-07-13 22:07:58 +08:00
hoshi-hiyouga
555194e150 Merge pull request #4700 from marko1616/patch-1
Fix Windows command preview
2024-07-10 13:51:50 +08:00
hiyouga
51942acee8 fix #4731 2024-07-10 11:32:36 +08:00
hiyouga
a15782cb9f fix #4705 2024-07-07 13:10:06 +08:00
marko1616
e0562521bb Update utils.py
In windows mutiline command should like
command --arg1 xxx `
--arg2 xxx `
2024-07-06 20:40:13 +08:00
hiyouga
7f770f6895 update ui 2024-07-03 23:13:49 +08:00
hoshi-hiyouga
e8e6af2651 Merge branch 'main' into main 2024-07-01 21:01:09 +08:00
hiyouga
4d35e218b1 bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
2024-06-28 06:00:26 +08:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
f17c9dfd84 tiny fix 2024-06-27 00:46:41 +08:00
hiyouga
29c710da3a tiny fix 2024-06-27 00:36:04 +08:00
hiyouga
ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hiyouga
41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hiyouga
fca893d73c fix #4410 2024-06-24 22:34:31 +08:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
hiyouga
f22d8f9ca4 improve llamaboard 2024-06-19 23:46:03 +08:00
hiyouga
3f84411b5d fix llamaboard abort 2024-06-19 23:22:28 +08:00
hiyouga
cd75b1fe9d fix tool formatter, allow parallel function #4362 2024-06-19 03:23:51 +08:00
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
9092f963db fix #4292 2024-06-15 04:47:13 +08:00
hiyouga
c94e6c9411 add quant check in webui export tab 2024-06-13 03:19:18 +08:00
ancv
b2c367bc61 implement efficient packing without cross-contamination attention 2024-06-12 11:56:01 +07:00
hiyouga
06e5d136a4 add resume args in webui 2024-06-08 00:22:16 +08:00
hiyouga
8bf9da659c fix #4137 2024-06-07 19:16:06 +08:00
hiyouga
74f96efef9 rename files 2024-06-07 00:09:06 +08:00
hiyouga
cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hiyouga
7daf8366db lint 2024-06-06 03:33:44 +08:00
hoshi-hiyouga
f2580ad403 Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
2024-06-06 03:32:04 +08:00
hoshi-hiyouga
ca459f67eb Merge pull request #4080 from MengqingCao/npu
Add npu option for model exporting
2024-06-06 03:15:44 +08:00
hoshi-hiyouga
feaee36c46 Update export.py 2024-06-06 03:14:46 +08:00
hoshi-hiyouga
0e740aa463 Merge pull request #4053 from hzhaoy/feature/add_select_config_file
Support selecting saved configuration files
2024-06-06 03:06:03 +08:00
hiyouga
dc4a00dd63 update train hparams 2024-06-06 01:49:20 +08:00
MengqingCao
2c03052662 modify export_device option 2024-06-05 09:37:36 +00:00
MengqingCao
07045c876a add npu for model export 2024-06-05 07:06:40 +00:00
faddddeout
b2f0459542 add throughput entry to log 2024-06-04 11:04:29 +00:00
hzhaoy
b27c4cfcb3 add: support selecting saved configuration files and loading training parameters 2024-06-04 10:33:43 +08:00
hiyouga
2187518762 fix abort in webui DDP mode 2024-06-04 00:10:24 +08:00
hoshi-hiyouga
ae18e1e251 Merge pull request #3987 from injet-zhou/main
Fix cann't interrupt training when using multi GPUs in webui
2024-06-04 00:04:07 +08:00
hiyouga
79784ebeb6 fix #4043 2024-06-03 23:30:37 +08:00
faddddeout
b13d03946e fix cann't interrupt training when using multi GPUs in webui 2024-05-30 08:39:21 +00:00
hiyouga
8070871732 better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
2024-05-29 23:55:38 +08:00
hiyouga
e4b420c146 add ds config to webui 2024-05-29 01:13:17 +08:00
hiyouga
7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga
cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00