Stop Calling Tools, Start Writing Code (Mode)
By Jeremiah Lowin
MCP servers do not scale efficiently. As the number of tools increases, all tool schemas are injected into the LLM’s context window before processing the user request. The traditional execution model worsens this inefficiency: each tool call requires a round-trip, with intermediate results repeatedly flowing through the context window, increasing token usage and reasoning overhead. The code mode pattern addresses both problems.