hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hoshi-hiyouga
|
fe7057a8a3
|
Update attention.py
|
2024-09-29 10:47:41 +08:00 |
|
Amirreza A
|
94ee105526
|
made a small change to a warning about fa2 for gemma2 models.
|
2024-09-28 19:03:36 +03:30 |
|
hiyouga
|
b7ca6c8dc1
|
fix #5048
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
d3c01552e0
|
tiny fix
|
2024-07-14 10:56:45 +08:00 |
|
hiyouga
|
2f6af73da2
|
fix gemma2 attention
|
2024-07-13 23:33:45 +08:00 |
|
hiyouga
|
8c41a0aa6d
|
tiny fix
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
2f4b89ace1
|
loose gemma2 attention
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
4d35e218b1
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
|
2024-06-28 06:00:26 +08:00 |
|
stceum
|
3ed063f281
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
|
2024-06-24 20:39:31 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
2ed8270112
|
clean code
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|