2 docs tagged with "caching"

10.5 Cost Optimization

Caching, prompt compression, and model routing (small vs large model) for LLM cost control.

Token counting, cost estimation, batching, and caching strategies for LLM APIs.