hey whisper-cpp friends, there’s a new feature called, “Flash Attention,” that makes it faster?: github.com/ggerganov…
use “-fa” to enable it
hey whisper-cpp friends, there’s a new feature called, “Flash Attention,” that makes it faster?: github.com/ggerganov…
use “-fa” to enable it