🧪

Knowledge Challenge

A friend thinks you can answer this question about AI Context Window Strategy

Your customer-support copilot has a 12K-token system prompt (instructions + examples + product knowledge), and handles 50K conversations/month. Each conversation averages 4 LLM turns. You're not using prompt caching. What's the highest-leverage optimization?