When an agent is connected to many MCP servers, each with dozens of tools, the full set of tool definitions can consume a large portion of the context window before the user even sends a message. Deferred Tool Loading solves this by giving the agent only the names and descriptions of MCP servers upfront, and letting it discover and load individual tool definitions on demand as needed. This keeps the context window lean at the start of every conversation, while still giving the agent access to the full breadth of tools when it needs them.Documentation Index
Fetch the complete documentation index at: https://www.truefoundry.com/llms.txt
Use this file to discover all available pages before exploring further.
The problem
- Each tool definition — name, description, input schema, output schema — consumes tokens from the available context window.
- With many MCP servers and tools, the context window can fill up before any user interaction happens.
- Most interactions only need a small subset of the available tools, not all of them.

Example
In this example we havetruefoundry-mcp with 100+ tools added with “deferred” mode and a web_search mcp server with 2 tools added without “deferred” mode
User
What vllm version is “nomic-embed-multimodal-7b-1” app in “tfy-test-cluster:test-ws” workspace using? Can you please get me the github release notes link of that version?
Assistant
The agent first lists all tools available on
truefoundry-mcp mcp server.list_tools
list_tools
Tool CallTool Response
Assistant
The agent then gets the tool information for
list_applications tools.get_tool_info
get_tool_info
Tool CallTool Response
Assistant
The agent then calls the
list_applications tool using the meta tool call_tool to find the application by its name.call_tool
call_tool
Tool CallTool Response
Assistant
I can see the vLLM version from the image URI: v0.13.0. Now let me get the GitHub release notes link for that version.
search
search
Tool CallTool Response
Assistant
GitHub Release Notes: 🔗 https://github.com/vllm-project/vllm/releases/tag/v0.13.0
…