We put a cost SLO on our LLM features. It is the number that finally made eng care about token spend.
TL;DR: Our LLM bill was a single number going up, and "spend rose 18%" is not something an engineer can act on. We gave each feature a cost SLO, a target cost per successful outcome with a monthly err
jas-blogs.hashnode.dev3 min read