🐦 Twitter Post Details

Viewing enriched Twitter post

@ArtificialAnlys

Inworld TTS 1 Max is the new leader on the Artificial Analysis Speech Arena Leaderboard, surpassing MiniMax’s Speech-02 series and OpenAI’s TTS-1 series The Artificial Analysis Speech Arena ranks leading Text to Speech models based on human preferences. In the arena, users compare two pieces of generated speech side by side and select their preferred output without knowing which models created them. The speech arena includes prompts across four real-world categories of prompts: Customer Service, Knowledge Sharing, Digital Assistants, and Entertainment. Inworld TTS 1 Max and Inworld TTS 1 both support 12 languages including English, Spanish, French, Korean, and Chinese, and voice cloning from 2-15 seconds of audio. Inworld TTS 1 processes ~153 characters per second of generation time on average, with the larger model, Inworld TTS 1 Max processing ~69 characters on average. Both models also support voice tags, allowing users to add emotion, delivery style, and non-verbal sounds, such as “whispering”, “cough”, and “surprised”. Both TTS-1 and TTS-1-Max are transformer-based, autoregressive models employing LLaMA-3.2-1B and LLaMA-3.1-8B respectively as their SpeechLM backbones. See the leading models in the Speech Arena, and listen to sample clips below 🎧

Media 1

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1986464484492447801/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1986464484492447801/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-11-27T20:20:14.887380",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "1986464484492447801",
  "url": "https://x.com/ArtificialAnlys/status/1986464484492447801",
  "twitterUrl": "https://twitter.com/ArtificialAnlys/status/1986464484492447801",
  "text": "Inworld TTS 1 Max is the new leader on the Artificial Analysis Speech Arena Leaderboard, surpassing MiniMax’s Speech-02 series and OpenAI’s TTS-1 series\n\nThe Artificial Analysis Speech Arena ranks leading Text to Speech models based on human preferences. In the arena, users compare two pieces of generated speech side by side and select their preferred output without knowing which models created them. The speech arena includes prompts across four real-world categories of prompts: Customer Service, Knowledge Sharing, Digital Assistants, and Entertainment.\n\nInworld TTS 1 Max and Inworld TTS 1 both support 12 languages including English, Spanish, French, Korean, and Chinese, and voice cloning from 2-15 seconds of audio. Inworld TTS 1 processes ~153 characters per second of generation time on average, with the larger model, Inworld TTS 1 Max processing ~69 characters on average. Both models also support voice tags, allowing users to add emotion, delivery style, and non-verbal sounds, such as “whispering”, “cough”, and “surprised”.\n\nBoth TTS-1 and TTS-1-Max are transformer-based, autoregressive models employing LLaMA-3.2-1B and LLaMA-3.1-8B respectively as their SpeechLM backbones.\n\nSee the leading models in the Speech Arena, and listen to sample clips below 🎧",
  "source": "Twitter for iPhone",
  "retweetCount": 19,
  "replyCount": 6,
  "likeCount": 190,
  "quoteCount": 12,
  "viewCount": 59666,
  "createdAt": "Thu Nov 06 16:03:34 +0000 2025",
  "lang": "en",
  "bookmarkCount": 89,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "1986464484492447801",
  "displayTextRange": [
    0,
    275
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "ArtificialAnlys",
    "url": "https://x.com/ArtificialAnlys",
    "twitterUrl": "https://twitter.com/ArtificialAnlys",
    "id": "1743487864934162432",
    "name": "Artificial Analysis",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1810946341511766016/3mg9KIaQ_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/1743487864934162432/1704519394",
    "description": "",
    "location": "San Francisco",
    "followers": 68013,
    "following": 595,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Sat Jan 06 04:21:21 +0000 2024",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 1874,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 1004,
    "statusesCount": 1618,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "1809670091778207901"
    ],
    "profile_bio": {
      "description": "Independent analysis of AI models and hosting providers - choose the best model and API provider for your use-case",
      "entities": {
        "description": {},
        "url": {
          "urls": [
            {
              "display_url": "artificialanalysis.ai",
              "expanded_url": "http://artificialanalysis.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/hEm5Kv0ktE"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/Qx3Pe7nWHM",
        "expanded_url": "https://twitter.com/ArtificialAnlys/status/1986464484492447801/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {},
          "orig": {}
        },
        "id_str": "1986463582952062976",
        "indices": [
          276,
          299
        ],
        "media_key": "3_1986463582952062976",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARuRVhqdG4AACgACG5FW7IUaEDkAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABG5FWGp0bgAAKAAIbkVbshRoQOQAA",
            "media_key": "3_1986463582952062976"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/G5FWGp0bgAARweG.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 1131,
              "w": 2019,
              "x": 0,
              "y": 0
            },
            {
              "h": 1428,
              "w": 1428,
              "x": 446,
              "y": 0
            },
            {
              "h": 1428,
              "w": 1253,
              "x": 534,
              "y": 0
            },
            {
              "h": 1428,
              "w": 714,
              "x": 803,
              "y": 0
            },
            {
              "h": 1428,
              "w": 2019,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1428,
          "width": 2019
        },
        "sizes": {
          "large": {
            "h": 1428,
            "w": 2019
          }
        },
        "type": "photo",
        "url": "https://t.co/Qx3Pe7nWHM"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {},
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}