@orhunkupeli

Orhun Küpeli

@orhunkupeliMunich, GermanyJoined May 2026

About

Nothing here yet.

Available for

Nothing here yet.

Orhun Küpeli's blogs

Orhun Kupeliorhunkupeli.hashnode.dev3 posts

Articles Comments

Recently published

OKOrhun Küpeliorhunkupeli.hashnode.devJul 21 · 6 min read

One GPU, Two LLMs

What I was aiming simple in theory: deploy two open-weight LLMs behind a custom gateway, on Kubernetes, infrastructure as code. The contraint that made it interesting was a hard cost cap. One spot GPU

OKOrhun Küpeliorhunkupeli.hashnode.devJun 21 · 5 min read

The Numbers: Benchmarking My LLM Gateway on a H100

A couple of weeks ago I wrote about rewriting my LLM gateway to bring it from MVP to production. The architectural claims were; multi-tenancy, hybrid inference , sub-5ms overhead. So I benchmarked it

OKOrhun Küpeliorhunkupeli.hashnode.devMay 14 · 5 min read

MVP to Mission-Critical: The Idea Behind My LLM Gateway Rewrite

Several months ago I decided to play around with my first LLM gateway prototype which I simply used LiteLLM with some benefits on top. Then I did the math to find out how far it was from the productio

Orhun Küpeli

About

Available for

Orhun Küpeli's blogs

Recently published

One GPU, Two LLMs

The Numbers: Benchmarking My LLM Gateway on a H100

MVP to Mission-Critical: The Idea Behind My LLM Gateway Rewrite

Search Hashnode

Orhun Küpeli

About

Available for

Orhun Küpeli's blogs

Recently published

One GPU, Two LLMs

The Numbers: Benchmarking My LLM Gateway on a H100

MVP to Mission-Critical: The Idea Behind My LLM Gateway Rewrite