How to Measure and Eliminate Token Waste in Production AI Applications