hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
12e0e5d0d7
|
tiny fix
Former-commit-id: d3c01552e0f978f150902175f096f6e3bfb64363
|
2024-07-14 10:56:45 +08:00 |
|
hiyouga
|
0b26011181
|
fix gemma2 attention
Former-commit-id: 2f6af73da28c4f8321b625fd09ddec8bd4977b08
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
de4de5b5ab
|
tiny fix
Former-commit-id: 8c41a0aa6db8bf31200c83b14819d474927268a1
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
2b006beab1
|
loose gemma2 attention
Former-commit-id: 2f4b89ace15b7a4d2adf16eeba9feb7de9e25d43
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
87e60f8bac
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
Former-commit-id: 4d35e218b1d60ff24b368ff5bc608be9c85411de
|
2024-06-28 06:00:26 +08:00 |
|
stceum
|
9aa640f27b
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
|
2024-06-24 20:39:31 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
833aa324c2
|
clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
|
2024-06-07 00:09:06 +08:00 |
|