Skip to content

Commit

Permalink
[GPU] Improve sdpa_opt kernel performance with flashattn2 softmax tri…
Browse files Browse the repository at this point in the history
…cks. (openvinotoolkit#28013)

### Details:
- *Switch to FlashAttn2 softmax update tricks which reduces the number
of non-matmul FLOPS.*

### Tickets:
 - *158462*

---------

Co-authored-by: Chen Peter <[email protected]>
  • Loading branch information
ceciliapeng2011 and peterchen-intel authored Jan 7, 2025
1 parent fc8a2ef commit 26e5fe9
Showing 1 changed file with 92 additions and 133 deletions.
Loading

0 comments on commit 26e5fe9

Please sign in to comment.