10 Commits

Author SHA1 Message Date
hiyouga
20013e130b fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
hiyouga
12e0e5d0d7 tiny fix
Former-commit-id: d3c01552e0f978f150902175f096f6e3bfb64363
2024-07-14 10:56:45 +08:00
hiyouga
0b26011181 fix gemma2 attention
Former-commit-id: 2f6af73da28c4f8321b625fd09ddec8bd4977b08
2024-07-13 23:33:45 +08:00
hiyouga
de4de5b5ab tiny fix
Former-commit-id: 8c41a0aa6db8bf31200c83b14819d474927268a1
2024-07-01 03:55:20 +08:00
hiyouga
2b006beab1 loose gemma2 attention
Former-commit-id: 2f4b89ace15b7a4d2adf16eeba9feb7de9e25d43
2024-06-29 01:42:14 +08:00
hiyouga
87e60f8bac bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674


Former-commit-id: 4d35e218b1d60ff24b368ff5bc608be9c85411de
2024-06-28 06:00:26 +08:00
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
2024-06-24 20:39:31 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
833aa324c2 clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
2024-06-13 01:58:16 +08:00
hiyouga
8da149ba40 rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
2024-06-07 00:09:06 +08:00