🐦 Twitter Post Details

Viewing enriched Twitter post

@JacksonAtkinsX

Meta just made training AI agents 25x faster. This is a breakthrough for robotics and complex planning. Meta's FAIR open sourced a new method called Scalable Option Learning. It trains a specialized agent at the scale previously seen only with LLMs. Here's how it works: The reason this type of AI (Agents trained with Hierarchical Reinforcement Learning) has been slow to train is a parallelization bottleneck. Imagine an AI team with a planner and many specialist workers (the sub-tasks). Older methods struggled because they had to process each planner's decision one-by-one before training the workers. SOL solves this with a new system design: A Single, Unified Brain: Instead of separate models, it uses a single actor-critic network to house the planner (controller policy) and all the workers (option policies). A Digital "Switch": It tells this unified brain which role to play at any given moment using a one-hot vector, a flag that says, "for this input, act as the 'navigation' worker." This allows thousands of different decisions for different policies to be batched and sent to the GPU at once. A Smart "Filter" for Learning: After the actions are taken, it uses a technique called tensorized masking. Think of this as a smart filter that ensures the right performance feedback (the rewards and advantages) goes to the correct worker policy. This is what breaks the one-at-a-time update problem. This architecture allows the entire hierarchical system to learn in parallel batches and removes the bottlenecks that held the field back. Why this matters: This new training method changes the viability of building agents that can reason and execute long-horizon tasks. - Business Leaders: This architecture is a key to developing sophisticated autonomous systems. A 25x faster training cycle accelerates R&D in robotics, logistics, and multi-stage process automation, making complex, strategic AI commercially achievable. - Practitioners: The authors plan to open-source SOL. You can implement agents that learn long-horizon skills without the performance penalty of older HRL methods, creating a path to more structured and potentially more robust models. - Researchers: This paper presents a validated solution to the HRL scaling problem (Section 3.2). The system for enabling high-throughput, asynchronous updates for a hierarchical agent is a major contribution that opens the door for large-scale experiments in temporal abstraction and credit assignment.

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1967284333678350342/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1967284333678350342/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-09-18T13:56:51.368512",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "1967284333678350342",
  "url": "https://x.com/JacksonAtkinsX/status/1967284333678350342",
  "twitterUrl": "https://twitter.com/JacksonAtkinsX/status/1967284333678350342",
  "text": "Meta just made training AI agents 25x faster.\n\nThis is a breakthrough for robotics and complex planning. \n\nMeta's FAIR open sourced a new method called Scalable Option Learning. It trains a specialized agent at the scale previously seen only with LLMs.\n\nHere's how it works:\n\nThe reason this type of AI (Agents trained with Hierarchical Reinforcement Learning) has been slow to train is a parallelization bottleneck. \n\nImagine an AI team with a planner and many specialist workers (the sub-tasks). Older methods struggled because they had to process each planner's decision one-by-one before training the workers.\n\nSOL solves this with a new system design:\n\nA Single, Unified Brain: Instead of separate models, it uses a single actor-critic network to house the planner (controller policy) and all the workers (option policies).\n\nA Digital \"Switch\": It tells this unified brain which role to play at any given moment using a one-hot vector, a flag that says, \"for this input, act as the 'navigation' worker.\" This allows thousands of different decisions for different policies to be batched and sent to the GPU at once.\n\nA Smart \"Filter\" for Learning: After the actions are taken, it uses a technique called tensorized masking. Think of this as a smart filter that ensures the right performance feedback (the rewards and advantages) goes to the correct worker policy. This is what breaks the one-at-a-time update problem.\n\nThis architecture allows the entire hierarchical system to learn in parallel batches and removes the bottlenecks that held the field back.\n\nWhy this matters:\n\nThis new training method changes the viability of building agents that can reason and execute long-horizon tasks.\n\n- Business Leaders: This architecture is a key to developing sophisticated autonomous systems. A 25x faster training cycle accelerates R&D in robotics, logistics, and multi-stage process automation, making complex, strategic AI commercially achievable.\n\n- Practitioners: The authors plan to open-source SOL. You can implement agents that learn long-horizon skills without the performance penalty of older HRL methods, creating a path to more structured and potentially more robust models.\n\n- Researchers: This paper presents a validated solution to the HRL scaling problem (Section 3.2). The system for enabling high-throughput, asynchronous updates for a hierarchical agent is a major contribution that opens the door for large-scale experiments in temporal abstraction and credit assignment.",
  "source": "Twitter for iPhone",
  "retweetCount": 84,
  "replyCount": 18,
  "likeCount": 456,
  "quoteCount": 8,
  "viewCount": 54866,
  "createdAt": "Sun Sep 14 17:48:30 +0000 2025",
  "lang": "en",
  "bookmarkCount": 457,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "1967284333678350342",
  "displayTextRange": [
    0,
    280
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "JacksonAtkinsX",
    "url": "https://x.com/JacksonAtkinsX",
    "twitterUrl": "https://twitter.com/JacksonAtkinsX",
    "id": "1913258409677512704",
    "name": "Jackson Atkins",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1943864441101021184/DbNcS5yB_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/1913258409677512704/1752291678",
    "description": "",
    "location": "United States",
    "followers": 2562,
    "following": 187,
    "status": "",
    "canDm": false,
    "canMediaTag": true,
    "createdAt": "Fri Apr 18 15:49:44 +0000 2025",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 1774,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 219,
    "statusesCount": 1719,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "1965536188372025721"
    ],
    "profile_bio": {
      "description": "Director of Engineering. Shipped million dollar systems in days. Surfacing AI breakthroughs in academic papers. Follow for AI intel before it hits mainstream.",
      "entities": {
        "description": {},
        "url": {
          "urls": [
            {
              "display_url": "jacksonatkins.dev",
              "expanded_url": "http://jacksonatkins.dev",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/c3VJIV2gvv"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/5kMnhLcPr0",
        "expanded_url": "https://twitter.com/JacksonAtkinsX/status/1967284333678350342/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": [
              {
                "h": 312,
                "w": 312,
                "x": 647,
                "y": 355
              }
            ]
          },
          "orig": {
            "faces": [
              {
                "h": 312,
                "w": 312,
                "x": 647,
                "y": 355
              }
            ]
          }
        },
        "id_str": "1967264409157332992",
        "indices": [
          281,
          304
        ],
        "media_key": "3_1967264409157332992",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARtNII6fF3AACgACG00yrakW0AYAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABG00gjp8XcAAKAAIbTTKtqRbQBgAA",
            "media_key": "3_1967264409157332992"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/G00gjp8XcAAukA0.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 881,
              "w": 1574,
              "x": 0,
              "y": 0
            },
            {
              "h": 1006,
              "w": 1006,
              "x": 284,
              "y": 0
            },
            {
              "h": 1006,
              "w": 882,
              "x": 346,
              "y": 0
            },
            {
              "h": 1006,
              "w": 503,
              "x": 536,
              "y": 0
            },
            {
              "h": 1006,
              "w": 1574,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1006,
          "width": 1574
        },
        "sizes": {
          "large": {
            "h": 1006,
            "w": 1574
          }
        },
        "type": "photo",
        "url": "https://t.co/5kMnhLcPr0"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {},
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}