Skip to main content

One doc tagged with "model-routing"

View All Tags

10.5 Cost Optimization

Caching, prompt compression, and model routing (small vs large model) for LLM cost control.