@moussaba

Moussa Ba

@moussabaFolsom, USAJoined April 2026

About

Nothing here yet.

Available for

Nothing here yet.

Moussa Ba's blogs

Private Intelligencefulatoro.hashnode.dev1 post

Articles Comments

Recently published

MBMoussa Bafulatoro.hashnode.devApr 29 · 26 min read

Same model, same GPU, 4× the context: a weekend of inference-stack dogfooding

I have an RTX 3090 sitting in a Xeon Silver 4314 box at home. I wanted to: Stand up a local inference stack (vLLM nightly with all the bells: speculative decoding, FlashInfer, prefix caching). Use t

Moussa Ba

About

Available for

Moussa Ba's blogs

Recently published

Same model, same GPU, 4× the context: a weekend of inference-stack dogfooding

Search Hashnode

Moussa Ba

About

Available for

Moussa Ba's blogs

Recently published

Same model, same GPU, 4× the context: a weekend of inference-stack dogfooding