@dair_ai
Reasoning-Aware Retrieval for Deep Research Agents Deep research agents generate explicit reasoning before every search call. These reasoning traces encode rich signals about search intent and problem-solving context. Yet no existing retriever learns to exploit them effectively. This paper introduces AgentIR, a reasoning-aware retrieval system that jointly embeds the agent's reasoning trace alongside its query instead of just the query alone. Why does it matter? The agent's reasoning acts as a retrieval instruction, a memory of key history, and an implicit filter for outdated information. All of this context is available for free since the agent already generates it. AgentIR-4B achieves 68% accuracy on BrowseComp-Plus with the open-weight Tongyi-DeepResearch agent, compared to 52% with conventional embedding models twice its size and 37% with BM25. It also outperforms LLM-based reranking by 10% absolute, with no additional inference overhead. Paper: https://t.co/rok5nZDfYw Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c