🐦 Twitter Post Details

Viewing enriched Twitter post

@ArtificialAnlys

xAI’s new Grok Voice Agent is the new leading Speech to Speech model, surpassing Gemini 2.5 Flash Native Audio and GPT Realtime in our Big Bench Audio benchmark The new model achieves a score of 92.3% on Big Bench Audio, just ahead of the previous leader, Google’s Gemini 2.5 Flash Native Audio Thinking. This model is @xAI’s first public Speech to Speech API, bringing increased competition to the space. The model has tool calling support and xAI has said it’s ready to be used across voice assistants, phone agents, and interactive voice applications. Benchmark context: Big Bench Audio is the first dedicated dataset for evaluating reasoning performance of speech models. Big Bench Audio comprises 1,000 audio questions adapted from the Big Bench Hard text test set, chosen for its rigorous testing of advanced reasoning, translated into the audio domain. Performance: ➤ Reasoning: Achieves 92.3% on Big Bench Audio, setting a new state-of-the-art for native Speech to Speech reasoning. Congratulations @xai and @elonmusk on this impressive release! ➤ Latency: At an average time to first token of 0.78 seconds, it is the third fastest model on our leaderboard behind Google’s Gemini 2.5 Flash Native Audio Dialog and Gemini 2.5 Flash Live ➤ Price: Simple pricing of 5 cents per minute connected, or $3 per hour of audio Key features: ➤ Tool calling: Use built-in tools such as web search, RAG-powered search, or define your own tools with JSON schema ➤ Telephony: Connect to Session Initiation Protocol (SIP) providers like Twilio and Vonage ➤ Multilingual: Converse in over 100 languages with 5 voices to choose from

Media 1

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2001388724987527353/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2001388724987527353/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-12-18T06:16:06.987376",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2001388724987527353",
  "url": "https://x.com/ArtificialAnlys/status/2001388724987527353",
  "twitterUrl": "https://twitter.com/ArtificialAnlys/status/2001388724987527353",
  "text": "xAI’s new Grok Voice Agent is the new leading Speech to Speech model, surpassing Gemini 2.5 Flash Native Audio and GPT Realtime in our Big Bench Audio benchmark\n\nThe new model achieves a score of 92.3% on Big Bench Audio, just ahead of the previous leader, Google’s Gemini 2.5 Flash Native Audio Thinking. This model is @xAI’s first public Speech to Speech API, bringing increased competition to the space. The model has tool calling support and xAI has said it’s ready to be used across voice assistants, phone agents, and interactive voice applications.\n\nBenchmark context: Big Bench Audio is the first dedicated dataset for evaluating reasoning performance of speech models. Big Bench Audio comprises 1,000 audio questions adapted from the Big Bench Hard text test set, chosen for its rigorous testing of advanced reasoning, translated into the audio domain.\n\nPerformance:\n➤ Reasoning: Achieves 92.3% on Big Bench Audio, setting a new state-of-the-art for native Speech to Speech reasoning. Congratulations @xai and @elonmusk on this impressive release! \n\n➤ Latency: At an average time to first token of 0.78 seconds, it is the third fastest model on our leaderboard behind Google’s Gemini 2.5 Flash Native Audio Dialog and Gemini 2.5 Flash Live\n\n➤ Price: Simple pricing of 5 cents per minute connected, or $3 per hour of audio\n\nKey features:\n➤ Tool calling: Use built-in tools such as web search, RAG-powered search, or define your own tools with JSON schema\n\n➤ Telephony: Connect to Session Initiation Protocol (SIP) providers like Twilio and Vonage\n\n➤ Multilingual: Converse in over 100 languages with 5 voices to choose from",
  "source": "Twitter for iPhone",
  "retweetCount": 89,
  "replyCount": 51,
  "likeCount": 568,
  "quoteCount": 9,
  "viewCount": 588603,
  "createdAt": "Wed Dec 17 20:27:10 +0000 2025",
  "lang": "en",
  "bookmarkCount": 88,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2001388724987527353",
  "displayTextRange": [
    0,
    277
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "ArtificialAnlys",
    "url": "https://x.com/ArtificialAnlys",
    "twitterUrl": "https://twitter.com/ArtificialAnlys",
    "id": "1743487864934162432",
    "name": "Artificial Analysis",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1810946341511766016/3mg9KIaQ_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/1743487864934162432/1704519394",
    "description": "",
    "location": "San Francisco",
    "followers": 70412,
    "following": 598,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Sat Jan 06 04:21:21 +0000 2024",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 1933,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 1098,
    "statusesCount": 1747,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "1809670091778207901"
    ],
    "profile_bio": {
      "description": "Independent analysis of AI models and hosting providers - choose the best model and API provider for your use-case",
      "entities": {
        "description": {},
        "url": {
          "urls": [
            {
              "display_url": "artificialanalysis.ai",
              "expanded_url": "http://artificialanalysis.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/hEm5Kv0ktE"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/OH6oXxwhCu",
        "expanded_url": "https://twitter.com/ArtificialAnlys/status/2001388724987527353/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {},
          "orig": {}
        },
        "id_str": "2001386349879013376",
        "indices": [
          278,
          301
        ],
        "media_key": "3_2001386349879013376",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARvGWkgK2rAACgACG8ZccQpagLkAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABG8ZaSArasAAKAAIbxlxxClqAuQAA",
            "media_key": "3_2001386349879013376"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/G8ZaSArasAAzQxp.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 1434,
              "w": 2561,
              "x": 0,
              "y": 0
            },
            {
              "h": 1434,
              "w": 1434,
              "x": 2,
              "y": 0
            },
            {
              "h": 1434,
              "w": 1258,
              "x": 90,
              "y": 0
            },
            {
              "h": 1434,
              "w": 717,
              "x": 361,
              "y": 0
            },
            {
              "h": 1434,
              "w": 2616,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1434,
          "width": 2616
        },
        "sizes": {
          "large": {
            "h": 1123,
            "w": 2048
          }
        },
        "type": "photo",
        "url": "https://t.co/OH6oXxwhCu"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "user_mentions": [
      {
        "id_str": "1661523610111193088",
        "indices": [
          320,
          324
        ],
        "name": "xAI",
        "screen_name": "xAI"
      },
      {
        "id_str": "1661523610111193088",
        "indices": [
          1010,
          1014
        ],
        "name": "xAI",
        "screen_name": "xai"
      },
      {
        "id_str": "44196397",
        "indices": [
          1019,
          1028
        ],
        "name": "Elon Musk",
        "screen_name": "elonmusk"
      }
    ]
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}