🐦 Twitter Post Details

Viewing enriched Twitter post

@dair_ai

New research on agent memory. Agent memory is evaluated on chatbot-style dialogues. But real agents don't chat. They interact with databases, code executors, and web interfaces, generating machine-readable trajectories, not conversational text. The key to better memory is to preserve causal dependencies. Existing memory benchmarks don't actually measure what matters for agentic applications. This new research introduces AMA-Bench, the first benchmark built for evaluating long-horizon memory in real agentic tasks. It spans six domains including web, text-to-SQL, software engineering, gaming, and embodied AI, with both real-world trajectories and synthetic ones that scale to arbitrary lengths. The findings are interesting. Many existing agent memory systems that outperform baselines on dialogue benchmarks actually underperform simple long-context LLMs on agentic tasks. Even GPT 5.2 only achieves 72.26% accuracy. To address this, they propose AMA-Agent with a causality graph and tool-augmented retrieval, achieving 57.22% average accuracy and surpassing the strongest baselines by 11.16%. Why it matters? Agent memory needs to preserve causal dependencies and objective information, not just similarity-based retrieval. This benchmark exposes where current memory systems actually break. Paper: https://t.co/GX0GaHsijN Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2027776582262395054/media_0.jpg?",
      "filename": "media_0.jpg"
    },
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2027776582262395054/media_1.png?",
      "filename": "media_1.png"
    }
  ],
  "processed_at": "2026-03-01T19:15:05.478550",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2027776582262395054",
  "url": "https://x.com/dair_ai/status/2027776582262395054",
  "twitterUrl": "https://twitter.com/dair_ai/status/2027776582262395054",
  "text": "New research on agent memory.\n\nAgent memory is evaluated on chatbot-style dialogues. But real agents don't chat. They interact with databases, code executors, and web interfaces, generating machine-readable trajectories, not conversational text.\n\nThe key to better memory is to preserve causal dependencies.\n\nExisting memory benchmarks don't actually measure what matters for agentic applications.\n\nThis new research introduces AMA-Bench, the first benchmark built for evaluating long-horizon memory in real agentic tasks. It spans six domains including web, text-to-SQL, software engineering, gaming, and embodied AI, with both real-world trajectories and synthetic ones that scale to arbitrary lengths.\n\nThe findings are interesting.\n\nMany existing agent memory systems that outperform baselines on dialogue benchmarks actually underperform simple long-context LLMs on agentic tasks. Even GPT 5.2 only achieves 72.26% accuracy.\n\nTo address this, they propose AMA-Agent with a causality graph and tool-augmented retrieval, achieving 57.22% average accuracy and surpassing the strongest baselines by 11.16%.\n\nWhy it matters?\n\nAgent memory needs to preserve causal dependencies and objective information, not just similarity-based retrieval. This benchmark exposes where current memory systems actually break.\n\nPaper: https://t.co/GX0GaHsijN\n\nLearn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c",
  "source": "Twitter for iPhone",
  "retweetCount": 58,
  "replyCount": 17,
  "likeCount": 389,
  "quoteCount": 5,
  "viewCount": 56717,
  "createdAt": "Sat Feb 28 16:03:06 +0000 2026",
  "lang": "en",
  "bookmarkCount": 529,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2027776582262395054",
  "displayTextRange": [
    0,
    277
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "dair_ai",
    "url": "https://x.com/dair_ai",
    "twitterUrl": "https://twitter.com/dair_ai",
    "id": "889050642903293953",
    "name": "DAIR.AI",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1643277398522187778/31dedbLo_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/889050642903293953/1742055232",
    "description": "",
    "location": "",
    "followers": 90586,
    "following": 1,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Sun Jul 23 09:12:45 +0000 2017",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 4185,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 161,
    "statusesCount": 2963,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2028094132090966088"
    ],
    "profile_bio": {
      "description": "Democratizing AI research, education, and technologies.",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [],
          "user_mentions": []
        },
        "url": {
          "urls": [
            {
              "display_url": "dair.ai",
              "expanded_url": "https://www.dair.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/lkqPZtMmfU"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/rZHQjCWKuY",
        "expanded_url": "https://twitter.com/dair_ai/status/2027776582262395054/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": []
          },
          "orig": {
            "faces": []
          }
        },
        "id_str": "2027776578646953984",
        "indices": [
          278,
          301
        ],
        "media_key": "3_2027776578646953984",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARwkHA3+G9AACgACHCQcDtWbEK4AAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABHCQcDf4b0AAKAAIcJBwO1ZsQrgAA",
            "media_key": "3_2027776578646953984"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/HCQcDf4b0AA0VFI.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 906,
              "w": 1618,
              "x": 0,
              "y": 0
            },
            {
              "h": 1618,
              "w": 1618,
              "x": 0,
              "y": 0
            },
            {
              "h": 1796,
              "w": 1575,
              "x": 0,
              "y": 0
            },
            {
              "h": 1796,
              "w": 898,
              "x": 0,
              "y": 0
            },
            {
              "h": 1796,
              "w": 1618,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1796,
          "width": 1618
        },
        "sizes": {
          "large": {
            "h": 1796,
            "w": 1618
          }
        },
        "type": "photo",
        "url": "https://t.co/rZHQjCWKuY"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "urls": [
      {
        "display_url": "arxiv.org/abs/2602.22769",
        "expanded_url": "https://arxiv.org/abs/2602.22769",
        "indices": [
          1317,
          1340
        ],
        "url": "https://t.co/GX0GaHsijN"
      },
      {
        "display_url": "academy.dair.ai",
        "expanded_url": "https://academy.dair.ai/",
        "indices": [
          1393,
          1416
        ],
        "url": "https://t.co/LRnpZN7L4c"
      }
    ],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}