Fergus Finn

@fergusfinn

Joined January 2025

About

Nothing here yet.

Available for

Nothing here yet.

Fergus Finn's blogs

fergusfergus.hashnode.dev1 post

Articles Threads Comments1

Recently published

FFFergus Finnfergus.hashnode.dev

0

Attention kernels for LLM inference

Jan 7, 2025 · 9 min read · Flashinfer is a kernel library for LLMs that provides high-performance implementations of PagedAttention, FlashAttention, and a few others. Relative to the original implementation of these algorithms, Flashinfer promises “state-of-the-art performance...

Join discussion

Fergus Finn

About

Available for

Fergus Finn's blogs

Recently published

Attention kernels for LLM inference

Search Hashnode

Fergus Finn

About

Available for

Fergus Finn's blogs

Recently published

Attention kernels for LLM inference