@Teknium
@hexyn7 Every tool call an agent makes, it has to send back the whole chat history EVERY time. so if you are at 50k tokens, each tool call from there sends back 50k tokens more. These tokens are majority input tokens, and cached - input tokens usually cost 5x less, and cached input tokens cost 90% less than input tokens in most cases.