Commit Graph

  • da34411bf2 Update test_template.py hoshi-hiyouga 2024-07-15 22:52:25 +08:00
  • 1891b64072 fix #4820 hiyouga 2024-07-15 22:32:07 +08:00
  • a14069acf8 add IN_GITHUB_ACTIONS codingma 2024-07-15 10:28:07 +08:00
  • 0ea708c226 1. change the task name format 2. delete split param in data_args.py codingma 2024-07-15 09:55:33 +08:00
  • cb474c7b11 allow computing rouge in training hiyouga 2024-07-15 01:16:26 +08:00
  • e4d11a117b fix up hiyouga 2024-07-15 01:04:56 +08:00
  • 68365045b4 Merge pull request #4691 from codemayq/feature-suppot-eval-dataset hoshi-hiyouga 2024-07-15 01:00:34 +08:00
  • 502555b65d Update data_args.py hoshi-hiyouga 2024-07-15 00:56:03 +08:00
  • 0bc52c0aae Update preprocess.py hoshi-hiyouga 2024-07-15 00:55:36 +08:00
  • 6bf2663b8e Update parser.py hoshi-hiyouga 2024-07-15 00:55:21 +08:00
  • d337de668e Update data_utils.py hoshi-hiyouga 2024-07-15 00:54:34 +08:00
  • ec372f91e9 Update loader.py hoshi-hiyouga 2024-07-15 00:50:06 +08:00
  • 20b1bd8c54 update test template hiyouga 2024-07-15 00:49:34 +08:00
  • ee17741591 Update parser.py hoshi-hiyouga 2024-07-14 23:04:34 +08:00
  • 93a6925ec5 Update README.md hoshi-hiyouga 2024-07-14 21:27:04 +08:00
  • 47405a8e8a add gemma test hiyouga 2024-07-14 18:01:45 +08:00
  • 54ba30c47f fix test hiyouga 2024-07-14 15:44:30 +08:00
  • b92214f78b fix #4699 hiyouga 2024-07-14 15:34:22 +08:00
  • 71e4404c0d tiny fix hiyouga 2024-07-14 10:56:45 +08:00
  • 5ab997d484 fix gemma2 attention hiyouga 2024-07-13 23:33:45 +08:00
  • 6e7048831b update workflows hiyouga 2024-07-13 22:31:15 +08:00
  • 97cd932c19 Merge pull request #4781 from hzhaoy/fix-dockerfile-cuda hoshi-hiyouga 2024-07-13 22:25:32 +08:00
  • dfc7a7d5cd fix #4792 hiyouga 2024-07-13 22:07:58 +08:00
  • 27e13a8371 Merge pull request #4804 from codemayq/fix-examples hoshi-hiyouga 2024-07-13 20:49:13 +08:00
  • bf6ad1fbed Update llava1_5.yaml hoshi-hiyouga 2024-07-13 20:30:06 +08:00
  • bc71380b59 1. fix output_dir in llama3_lora_pretrain.yaml 2. add llava1_5.yaml for inference codingma 2024-07-13 13:16:22 +08:00
  • 137c87ff60 tiny fix hzhaoy 2024-07-12 00:28:44 +08:00
  • 485b8dc18b fix #4780 hzhaoy 2024-07-12 00:25:48 +08:00
  • 875f9078d1 fix #4779 hzhaoy 2024-07-12 00:15:15 +08:00
  • d3bfcbd3af Merge pull request #4700 from marko1616/patch-1 hoshi-hiyouga 2024-07-10 13:51:50 +08:00
  • e36db692e7 Merge pull request #4746 from yzoaim/fix hoshi-hiyouga 2024-07-10 13:32:49 +08:00
  • 460a40756c Update callbacks.py hoshi-hiyouga 2024-07-10 13:32:20 +08:00
  • 18057e14ef fix src/llamafactory/train/callbacks.py -.- 2024-07-10 12:05:51 +08:00
  • 025c8fe302 fix #4731 hiyouga 2024-07-10 11:32:36 +08:00
  • 446129ca7a fix ppo trainer hiyouga 2024-07-10 11:05:45 +08:00
  • 834c4e8ad9 fix #4742 hiyouga 2024-07-09 23:24:24 +08:00
  • 11d961cf3c Merge pull request #4706 from T-Atlas/main hoshi-hiyouga 2024-07-07 15:50:38 +08:00
  • 00b93d8b2f Update packages.py hoshi-hiyouga 2024-07-07 15:48:29 +08:00
  • 281fd5bb89 chore: Update vllm_engine.py to support vllm version >= 0.5.1 Lian Junhong 2024-07-07 15:08:12 +08:00
  • cb10050cb9 fix #4705 hiyouga 2024-07-07 13:10:06 +08:00
  • 2935c4cddb Update utils.py marko1616 2024-07-06 20:40:13 +08:00
  • 0d6ec70c6f add codegeex4, internlm2.5 hiyouga 2024-07-06 16:16:47 +08:00
  • 74777b4ded update pissa example hiyouga 2024-07-06 15:47:32 +08:00
  • 5f2bd04799 1. add custom eval dataset support 2. merge load dataset and split dataset function codingma 2024-07-05 15:52:10 +08:00
  • 9a1a5f9778 fix processors hiyouga 2024-07-05 08:33:22 +08:00
  • edc8aefa59 fix #4683 hiyouga 2024-07-05 00:58:05 +08:00
  • ee1c786a12 fix #4674 hiyouga 2024-07-05 00:41:03 +08:00
  • a3e4f2b716 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory hiyouga 2024-07-04 14:23:37 +08:00
  • 6685f1fb9e fix #4677 hiyouga 2024-07-04 14:22:07 +08:00
  • c89ff328f6 Merge pull request #4673 from hzhaoy/main hoshi-hiyouga 2024-07-04 10:40:41 +08:00
  • c6f1bc65c0 tiny fix hzhaoy 2024-07-04 10:20:28 +08:00
  • 0f43c61229 update tests hiyouga 2024-07-04 04:00:12 +08:00
  • 8567dab167 tiny fix hiyouga 2024-07-04 03:47:05 +08:00
  • 0517d7bee5 tiny fix hiyouga 2024-07-04 03:02:23 +08:00
  • 5bc0b9b31c fix data map for packing hiyouga 2024-07-04 03:01:31 +08:00
  • 3d219b91b9 fix packing for eager/sdpa attn hiyouga 2024-07-04 01:52:43 +08:00
  • a90c6306f8 Merge pull request #4224 from chuan298/main hoshi-hiyouga 2024-07-04 01:18:54 +08:00
  • 60558388ec update packing hiyouga 2024-07-04 01:10:55 +08:00
  • b29a7f8cd6 Update packing.py hoshi-hiyouga 2024-07-03 23:36:01 +08:00
  • a1501591e8 update func name hiyouga 2024-07-03 23:29:33 +08:00
  • 1408aa078d update arg name hiyouga 2024-07-03 23:23:24 +08:00
  • 5acaa476d6 update hparams hiyouga 2024-07-03 23:18:58 +08:00
  • 8ac4f87c91 update ui hiyouga 2024-07-03 23:13:49 +08:00
  • 14d3001824 test hiyouga 2024-07-03 23:05:39 +08:00
  • 1ac9389ddc update scripts hiyouga 2024-07-03 20:07:44 +08:00
  • 0b0e27c2f1 fix #4609 hiyouga 2024-07-03 19:45:51 +08:00
  • fd1199cce4 update readme hiyouga 2024-07-03 19:39:05 +08:00
  • 3c9eda8265 Merge pull request #4662 from wzh1994/wzh/readme hoshi-hiyouga 2024-07-03 15:51:02 +08:00
  • 6622cdb43f Update README_zh.md wangzhihong 2024-07-03 14:59:09 +08:00
  • 49c28a7dab add LazyLLM to Projects using LLaMA Factory in README.md wangzhihong 2024-07-03 11:12:20 +08:00
  • a42671c2d7 tiny fix hiyouga 2024-07-03 02:31:50 +08:00
  • f17ab6ad92 tiny fix hiyouga 2024-07-02 23:06:13 +08:00
  • ca548af2a2 remove rlhf support for chatglm2&3 hiyouga 2024-07-02 23:03:17 +08:00
  • 579997688f upcast logits hiyouga 2024-07-02 22:32:05 +08:00
  • e6ba7ef3e6 improve rlhf hiyouga 2024-07-02 22:23:08 +08:00
  • 20fdf177e8 move efficient_packing from data_args to model_args ancv 2024-07-02 18:37:55 +07:00
  • f0b01803ea Update bug-report.yml hiyouga 2024-07-02 19:18:56 +08:00
  • f5c4841ff2 Update bug-report.yml hiyouga 2024-07-02 19:16:12 +08:00
  • 1e01283d81 Merge pull request #4651 from hzhaoy/add-telechat-1b hoshi-hiyouga 2024-07-02 17:56:43 +08:00
  • 2196448c21 add TeleChat-1B hzhaoy 2024-07-02 17:49:04 +08:00
  • 96a81ce89d fix ppo callbacks hiyouga 2024-07-02 17:34:56 +08:00
  • a715490c2a Merge branch 'main' into main hoshi-hiyouga 2024-07-01 21:01:09 +08:00
  • 973cf8e980 tiny fix hiyouga 2024-07-01 05:43:17 +08:00
  • 4357e42391 tiny fix hiyouga 2024-07-01 03:55:20 +08:00
  • 884b49e662 add eval acc hiyouga 2024-07-01 03:51:20 +08:00
  • 38c94d2e9c Update label_issue.yml hiyouga 2024-07-01 01:29:09 +08:00
  • 67d2eb6b2a fix #4402 #4617 hiyouga 2024-07-01 01:19:27 +08:00
  • b670fb57db update readme hiyouga 2024-07-01 00:22:52 +08:00
  • 188b4be64d fix #4398 #4592 hiyouga 2024-06-30 21:28:51 +08:00
  • 889c042ecd update npu docker hiyouga 2024-06-30 21:05:31 +08:00
  • 3c4f8eaa55 loose gemma2 attention hiyouga 2024-06-29 01:42:14 +08:00
  • 6a75d57060 update readme hiyouga 2024-06-28 06:55:19 +08:00
  • fda2cf677b bf16 by default, gemma2 attns hiyouga 2024-06-28 06:00:26 +08:00
  • cfdf5a5a78 increase pissa_iter for stability hiyouga 2024-06-28 03:18:54 +08:00
  • a1437c15f7 fix docker flashattn hiyouga 2024-06-28 01:28:59 +08:00
  • 42e7489713 add Gemma2 models hiyouga 2024-06-28 01:26:50 +08:00
  • 024760f866 update examples hiyouga 2024-06-28 01:17:07 +08:00
  • 46f0189e88 refactor pissa, improve llamaboard hiyouga 2024-06-28 01:04:24 +08:00
  • edc7498111 Merge pull request #4580 from hzhaoy/bugfix-deepspeed-pissa hoshi-hiyouga 2024-06-28 00:46:51 +08:00
  • 9103fdf866 fix #4549 hiyouga 2024-06-28 00:41:58 +08:00