10 months ago
Tues Jan 16, 2024 5:12pm PST
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
read article
comments:
add comment
loading comments...