Commit Graph

  • 639a7f6796 support image input in api #3971 #4061 hiyouga 2024-06-06 02:29:55 +08:00
  • 35379c7c0e update train hparams hiyouga 2024-06-06 01:49:20 +08:00
  • d992f5353f fix setup hiyouga 2024-06-06 01:39:02 +08:00
  • 875eef45f3 add llamafactory-cli env hiyouga 2024-06-06 01:28:14 +08:00
  • 556a4aa972 fix #4090 hiyouga 2024-06-06 00:50:32 +08:00
  • 8dc1969111 modify export_device option MengqingCao 2024-06-05 09:37:36 +00:00
  • b74c229498 fix #4079 hiyouga 2024-06-05 16:56:54 +08:00
  • 3dbca466fd update readme hiyouga 2024-06-05 16:32:32 +08:00
  • ce6f7fdb82 fix #4077 MengqingCao 2024-06-05 08:03:30 +00:00
  • 7528bc1bc0 support glm-4 hiyouga 2024-06-05 15:16:38 +08:00
  • 9dd5f7d642 add npu for model export MengqingCao 2024-06-05 07:06:40 +00:00
  • 99ecb0daaf add throughput entry to log faddddeout 2024-06-04 11:04:29 +00:00
  • 39d8d7995a add: support selecting saved configuration files and loading training parameters hzhaoy 2024-06-04 10:33:43 +08:00
  • 2ac2cde03e tiny fix hiyouga 2024-06-04 00:31:10 +08:00
  • aa6c3766de fix #3873 hiyouga 2024-06-04 00:21:50 +08:00
  • f4f5d7e3ce fix #3992 hiyouga 2024-06-04 00:17:36 +08:00
  • efbf6018d3 fix abort in webui DDP mode hiyouga 2024-06-04 00:10:24 +08:00
  • 1090bb8bf3 Merge pull request #3987 from injet-zhou/main hoshi-hiyouga 2024-06-04 00:04:07 +08:00
  • 26bc79f971 fix #4043 hiyouga 2024-06-03 23:30:37 +08:00
  • 4c1f015eca remove gc warnings in DPO&KTO hiyouga 2024-06-03 22:53:54 +08:00
  • 0655a183d3 Merge pull request #4045 from enji-zhou/feature/add_kto hoshi-hiyouga 2024-06-03 22:09:25 +08:00
  • 7754024e9b Update trainer.py hoshi-hiyouga 2024-06-03 22:08:38 +08:00
  • b4913569a8 fix KTO Trainer Sampler enji.zhou 2024-06-03 21:32:38 +08:00
  • eae9f09ca8 Merge pull request #4006 from Uminosachi/scheduler-kwargs hoshi-hiyouga 2024-06-03 19:27:53 +08:00
  • 8264e5ceaa update placeholder in issue template hiyouga 2024-06-03 19:24:10 +08:00
  • b76f319e45 Merge pull request #4011 from statelesshz/issue-template hoshi-hiyouga 2024-06-03 19:20:43 +08:00
  • 82d744716a fix #4005 #4013 hiyouga 2024-06-03 19:12:29 +08:00
  • 1a3764ab8f Merge pull request #4007 from xu-song/patch-3 hoshi-hiyouga 2024-06-03 18:54:37 +08:00
  • d2ede9d393 fix #4022 hiyouga 2024-06-03 18:38:36 +08:00
  • 5690f513fc bump versions hiyouga 2024-06-03 18:29:38 +08:00
  • 123a845209 fix data loader hint hiyouga 2024-06-03 18:28:27 +08:00
  • b1b7d735b3 remove empty line ylfeng 2024-05-31 21:43:08 +08:00
  • 230c69f7ce fix eos ylfeng 2024-05-31 21:40:41 +08:00
  • bfc43558ef supervised packing with greedy knapsack algorithm ylfeng 2024-05-31 15:33:54 +08:00
  • f2ae2cc04d Update model_args.py Xu Song 2024-05-31 14:35:48 +08:00
  • 6e9c03f958 Update bug-report.yml statelesshz 2024-05-31 13:18:18 +08:00
  • 2696f614a7 Set scheduler_specific_kwargs to get_scheduler Uminosachi 2024-05-31 13:45:39 +09:00
  • 070b944895 update readme hiyouga 2024-05-30 16:40:17 +08:00
  • f5f091d390 fix cann't interrupt training when using multi GPUs in webui faddddeout 2024-05-30 08:39:21 +00:00
  • 14ab14a0e6 fix #3837 hiyouga 2024-05-30 00:52:26 +08:00
  • 4f7c850115 Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num hoshi-hiyouga 2024-05-30 00:25:45 +08:00
  • 391eca66cf Update loader.py hoshi-hiyouga 2024-05-30 00:20:20 +08:00
  • a67199246d Update loader.py hoshi-hiyouga 2024-05-30 00:17:21 +08:00
  • 5f67fdaac9 Update loader.py hoshi-hiyouga 2024-05-30 00:12:12 +08:00
  • 05e6fe4287 Update parser.py hoshi-hiyouga 2024-05-30 00:05:20 +08:00
  • 91cc571e6e Update README_zh.md hoshi-hiyouga 2024-05-30 00:04:47 +08:00
  • 890926e60c Update README.md hoshi-hiyouga 2024-05-30 00:04:26 +08:00
  • 87aa332583 better llamaboard hiyouga 2024-05-29 23:55:38 +08:00
  • f90c4ca672 fix cohere system hiyouga 2024-05-29 20:58:23 +08:00
  • a922e85a5c fix #3965 hiyouga 2024-05-29 20:55:51 +08:00
  • 9a65820592 update readme hiyouga 2024-05-29 18:39:11 +08:00
  • f4e16ae373 Merge pull request #3930 from MengqingCao/npu hoshi-hiyouga 2024-05-29 18:33:38 +08:00
  • e2cfd34da0 update torch-npu version MengqingCao 2024-05-29 10:05:11 +00:00
  • 668dea9706 update cann kernels url MengqingCao 2024-05-29 09:53:31 +00:00
  • 084be442f2 Merge pull request #3958 from hzhaoy/add_telechat_12b_support hoshi-hiyouga 2024-05-29 17:20:53 +08:00
  • 29cb4a1327 add TeleChat-12B/TeleChat-12B-v2 models hzhaoy 2024-05-29 15:00:37 +08:00
  • 81a61134b8 fix hf chat engine hiyouga 2024-05-29 01:20:07 +08:00
  • cb1a49aa02 add ds config to webui hiyouga 2024-05-29 01:13:17 +08:00
  • 351b4efc6c 10x generate in ppo w/ zero3 hiyouga 2024-05-29 00:23:23 +08:00
  • 9b551309de update dpo, kto trainer hiyouga 2024-05-29 00:14:29 +08:00
  • 9fed4a2ef4 clean kto trainer hiyouga 2024-05-28 21:43:26 +08:00
  • bceac4f554 bump vllm version to 0.4.1 hiyouga 2024-05-28 21:27:27 +08:00
  • ae3a88d3a7 update readme hiyouga 2024-05-28 19:35:52 +08:00
  • 9138a7a5ba support DDP in webui hiyouga 2024-05-28 19:24:22 +08:00
  • 9912b43fcc update readme hiyouga 2024-05-28 16:41:34 +08:00
  • 5ac37555a4 update readme hiyouga 2024-05-28 16:19:56 +08:00
  • 34bdc730a6 fix #3931 hiyouga 2024-05-28 13:44:22 +08:00
  • e45a9d70fc add Ascend npu doc and dependency MengqingCao 2024-05-28 01:33:54 +00:00
  • 232b36059c Merge pull request #3925 from Yimi81/feat-fix-yi-template hoshi-hiyouga 2024-05-27 22:59:32 +08:00
  • d9fbd675d5 fix yi template Yimi81 2024-05-27 13:11:25 +00:00
  • 0206e7b9de tiny fix hiyouga 2024-05-27 20:54:26 +08:00
  • a886544d3d Merge pull request #3921 from gusye1234/main hoshi-hiyouga 2024-05-27 20:52:37 +08:00
  • 8c9b929bb0 Update template.py hoshi-hiyouga 2024-05-27 20:51:56 +08:00
  • 1bb1ae834e Update template.py hoshi-hiyouga 2024-05-27 20:51:26 +08:00
  • 0d9e364a90 add openchat-3.6-8B support Jianbai Ye 2024-05-27 20:42:08 +08:00
  • 3b28c003dd fix full/freeze tuning for mllm hiyouga 2024-05-27 20:37:57 +08:00
  • 48ff9fb150 Merge pull request #3835 from BUAADreamer/main hoshi-hiyouga 2024-05-27 20:23:45 +08:00
  • c43bc74fe6 support Aya23 hiyouga 2024-05-27 20:23:24 +08:00
  • eaf9cc2195 Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-27 20:10:58 +08:00
  • 4bd276f58f add llava 1k datasets hiyouga 2024-05-27 19:57:33 +08:00
  • f8cf0d5e5d update dpo examples hiyouga 2024-05-27 19:56:04 +08:00
  • 79bc60db33 Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-27 19:00:48 +08:00
  • dc7c54067e add only tune lm and mm_proj BUAADreamer 2024-05-27 19:00:15 +08:00
  • 932f0d5c20 add regex of only tune lm and mm_proj BUAADreamer 2024-05-27 18:59:00 +08:00
  • 9670f5e41a add phi-3 7b/14b, mistral v0.3 models hiyouga 2024-05-27 18:20:16 +08:00
  • 97a23e1cbe update readme hiyouga 2024-05-27 18:14:02 +08:00
  • 11fcd055ec Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-27 11:54:01 +08:00
  • b0d9966663 support SimPO #3900 hiyouga 2024-05-26 23:46:33 +08:00
  • 5c51ab7e1f Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-25 14:18:49 +08:00
  • 26f293d587 fix #3853 hiyouga 2024-05-24 23:29:45 +08:00
  • a3b52fd380 Merge branch 'main' into add_dataset_sample_num seanzhang-zhichen 2024-05-24 15:57:47 +08:00
  • 27d8706d6d Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-24 09:50:00 +08:00
  • bf59383783 refactor data preprocessing, fix mllm rlhf hiyouga 2024-05-24 04:08:25 +08:00
  • 1078611259 Merge pull request #3876 from dongdongqiang2018/main hoshi-hiyouga 2024-05-24 01:54:30 +08:00
  • e6fc0ac8fe fix paligemma sft hiyouga 2024-05-24 00:23:40 +08:00
  • 554ca3d8dc fix oom issues in export hiyouga 2024-05-23 23:32:45 +08:00
  • 86dfdf956d adapted to 910B image donggang 2024-05-23 09:48:22 +00:00
  • c0e4475485 Merge branch 'hiyouga:main' into main BUAADreamer 2024-05-21 22:18:20 +08:00
  • 2b65f8bd5c fix paligemma sft hiyouga 2024-05-21 20:03:09 +08:00
  • 09e78272c2 Update README_zh.md hiyouga 2024-05-21 18:30:59 +08:00