From 650a2a2e01150ffd51e3fcccbe45ba43cfb8a196 Mon Sep 17 00:00:00 2001 From: hiyouga Date: Wed, 27 Sep 2023 21:57:47 +0800 Subject: [PATCH] update readme Former-commit-id: 4eae06146436a21fa63239c3687882b1e9e8ba13 --- README.md | 2 +- README_zh.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 50157e50..e4a96bed 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ ## Changelog -[23/09/27] We supported **S^2-Attn** proposed by [LongLoRA](https://github.com/dvlab-research/LongLoRA). Try `--shift_attn` argument to enable shift short attention. +[23/09/27] We supported **$S^2$-Attn** proposed by [LongLoRA](https://github.com/dvlab-research/LongLoRA). Try `--shift_attn` argument to enable shift short attention. [23/09/23] We integrated MMLU, C-Eval and CMMLU benchmarks in this repo. See [this example](#evaluation) to evaluate your models. diff --git a/README_zh.md b/README_zh.md index 2b9d8533..04802e90 100644 --- a/README_zh.md +++ b/README_zh.md @@ -14,7 +14,7 @@ ## 更新日志 -[23/09/27] 我们支持了 [LongLoRA](https://github.com/dvlab-research/LongLoRA) 提出的 **S^2-Attn**。请使用 `--shift_attn` 参数以启用该功能。 +[23/09/27] 我们支持了 [LongLoRA](https://github.com/dvlab-research/LongLoRA) 提出的 **$S^2$-Attn**。请使用 `--shift_attn` 参数以启用该功能。 [23/09/23] 我们在项目中集成了 MMLU、C-Eval 和 CMMLU 评估集。使用方法请参阅[此示例](#模型评估)。