@omarsar0
Selective use of long histories The Answer Agent retrieves up to 60 candidates, performs memory distillation to keep only what matters, then generates the answer. RL fine-tuning improves answer quality beyond static retrieval. https://t.co/JzernO0vII