Blog

#prompt-caching

Posts on AI engineering, LLM systems, and software development.

Sort:

AI RunningMarch 28, 2026#6

LLM API bills, and why a token costs what it costs

How input and output tokens get priced, why output runs 5-6x more, and how prompt caching cuts the input bill by 10x. Plus the hidden costs that ambush people.

AI AI Running API LLM Pricing

Read →