🐦 Twitter Post Details

Viewing enriched Twitter post

@emollick

Multimodal vision continues to be the most difficult AI ability to get a strong intuition for. The models can do incredible things like recognize places from subtle clues or read emotion & attitudes, but also miss stuff, like the fact that this image is upsettingly distorted. https://t.co/VhpUoRd0Uw

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1857958965893599705/media_0.jpg",
      "type": "photo",
      "original_url": "https://pbs.twimg.com/media/GcjL0fiWEAE6kib.jpg",
      "recovered_from_supabase": true
    },
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1857958965893599705/media_1.jpg",
      "type": "photo",
      "original_url": "https://pbs.twimg.com/media/GcjL0zUWUAAHVmb.jpg",
      "recovered_from_supabase": true
    },
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1857958965893599705/media_2.jpg",
      "type": "photo",
      "original_url": "https://pbs.twimg.com/media/GcjL1DbX0AAGMgH.jpg",
      "recovered_from_supabase": true
    },
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1857958965893599705/media_3.jpg",
      "type": "photo",
      "original_url": "https://pbs.twimg.com/media/GcjL1R_XYAA1NI4.jpg",
      "recovered_from_supabase": true
    }
  ],
  "conversion_date": "2025-08-13T00:28:36.992612",
  "format_converted": true,
  "original_structure": "had_media_only"
}

🔧 Raw API Response

{
  "user": {
    "created_at": "2009-05-10T22:33:52.000Z",
    "default_profile_image": false,
    "description": "Professor @Wharton studying AI, innovation & startups. Democratizing education using tech\nBook: https://t.co/CSmipbJ2jV\nSubstack: https://t.co/UIBhxu4bgq",
    "fast_followers_count": 0,
    "favourites_count": 6584,
    "followers_count": 242521,
    "friends_count": 557,
    "has_custom_timelines": false,
    "is_translator": false,
    "listed_count": 5555,
    "location": "Philadelphia, PA",
    "media_count": 12485,
    "name": "Ethan Mollick",
    "normal_followers_count": 242521,
    "possibly_sensitive": false,
    "profile_banner_url": "https://pbs.twimg.com/profile_banners/39125788/1713840725",
    "profile_image_url_https": "https://pbs.twimg.com/profile_images/1601382188712398850/3AAOlqrX_normal.jpg",
    "screen_name": "emollick",
    "statuses_count": 29379,
    "translator_type": "none",
    "url": "https://t.co/uItckI9ujc",
    "verified": true,
    "withheld_in_countries": [],
    "id_str": "39125788"
  },
  "id": "1857958965893599705",
  "conversation_id": "1857958965893599705",
  "full_text": "Multimodal vision continues to be the most difficult AI ability to get a strong intuition for. The models can do incredible things like recognize places from subtle clues or read emotion &amp; attitudes, but also miss stuff, like the fact that this image is upsettingly distorted. https://t.co/VhpUoRd0Uw",
  "reply_count": 14,
  "retweet_count": 12,
  "favorite_count": 203,
  "hashtags": [],
  "symbols": [],
  "user_mentions": [],
  "urls": [],
  "media": [
    {
      "media_url": "https://pbs.twimg.com/media/GcjL0fiWEAE6kib.jpg",
      "type": "photo"
    },
    {
      "media_url": "https://pbs.twimg.com/media/GcjL0zUWUAAHVmb.jpg",
      "type": "photo"
    },
    {
      "media_url": "https://pbs.twimg.com/media/GcjL1DbX0AAGMgH.jpg",
      "type": "photo"
    },
    {
      "media_url": "https://pbs.twimg.com/media/GcjL1R_XYAA1NI4.jpg",
      "type": "photo"
    }
  ],
  "url": "https://twitter.com/emollick/status/1857958965893599705",
  "created_at": "2024-11-17T01:28:31.000Z",
  "#sort_index": "1857958965893599705",
  "view_count": 23890,
  "quote_count": 2,
  "is_quote_tweet": false,
  "is_retweet": false,
  "is_pinned": false,
  "is_truncated": false,
  "startUrl": "https://x.com/emollick/status/1857958965893599705"
}