Add sage attention as it speeds up inference by 2 to 3 times without quality loss
Add sage attention as it speeds up inference by 2 to 3 times without quality loss