🐦 Twitter Post Details

Viewing enriched Twitter post

@s_batzoglou

I tried GPT o1 in three simple Hilbert-style logical derivations. It failed in two of them, with one failure after repeated attempts. Here is an example of a failed attempt to show (not phi => phi) => phi. Step 7 is wrong, as can be easily observed by doing the substitutinos that o1 claims. And a failed attempt to show (not not phi => phi), where Step 5 is obviously wrong. Here is the prompt I tried. If someone has o1 pro, let me know if it succeeds. @DeryaTR_ @OpenAI

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "id": "",
      "type": "photo",
      "url": null,
      "media_url": "https://pbs.twimg.com/media/GeSAoSPXwAAvOoa.png",
      "media_url_https": null,
      "display_url": null,
      "expanded_url": null
    },
    {
      "id": "",
      "type": "photo",
      "url": null,
      "media_url": "https://pbs.twimg.com/media/GeSBSEkWwAA9U8V.png",
      "media_url_https": null,
      "display_url": null,
      "expanded_url": null
    },
    {
      "id": "",
      "type": "photo",
      "url": null,
      "media_url": "https://pbs.twimg.com/media/GeSBxB_WsAEzShc.png",
      "media_url_https": null,
      "display_url": null,
      "expanded_url": null
    }
  ],
  "nlp": {
    "sentiment": "negative",
    "processed_at": "2025-08-06T12:57:16.000300"
  },
  "original_structure": "had_media_only"
}

🔧 Raw API Response

{
  "user": {
    "created_at": "2022-04-25T23:37:36.000Z",
    "default_profile_image": false,
    "description": "Genomics-computation-ML-biotech-foundations of math-philosophy of mind; CDO @seer_bio; former prof @StanfordAILab; cofounder @dnanexus; opinions entirely my own",
    "fast_followers_count": 0,
    "favourites_count": 27107,
    "followers_count": 2443,
    "friends_count": 737,
    "has_custom_timelines": true,
    "is_translator": false,
    "listed_count": 28,
    "location": "San Francisco and Miami",
    "media_count": 260,
    "name": "Serafim Batzoglou",
    "normal_followers_count": 2443,
    "possibly_sensitive": false,
    "profile_banner_url": "https://pbs.twimg.com/profile_banners/1518735949458378752/1731329061",
    "profile_image_url_https": "https://pbs.twimg.com/profile_images/1518736918527152128/hV7H_k58_normal.jpg",
    "screen_name": "s_batzoglou",
    "statuses_count": 5122,
    "translator_type": "none",
    "verified": true,
    "withheld_in_countries": [],
    "id_str": "1518735949458378752"
  },
  "id": "1865758903666954597",
  "conversation_id": "1865758903666954597",
  "full_text": "I tried GPT o1 in three simple Hilbert-style logical derivations. It failed in two of them, with one failure after repeated attempts. \n\nHere is an example of a failed attempt to show (not phi => phi) => phi. Step 7 is wrong, as can be easily observed by doing the substitutinos that o1 claims. And a failed attempt to show (not not phi => phi), where Step 5 is obviously wrong.\n\nHere is the prompt I tried. If someone has o1 pro, let me know if it succeeds. @DeryaTR_ @OpenAI",
  "reply_count": 3,
  "retweet_count": 0,
  "favorite_count": 13,
  "hashtags": [],
  "symbols": [],
  "user_mentions": [],
  "urls": [],
  "media": [
    {
      "media_url": "https://pbs.twimg.com/media/GeSAoSPXwAAvOoa.png",
      "type": "photo"
    },
    {
      "media_url": "https://pbs.twimg.com/media/GeSBSEkWwAA9U8V.png",
      "type": "photo"
    },
    {
      "media_url": "https://pbs.twimg.com/media/GeSBxB_WsAEzShc.png",
      "type": "photo"
    }
  ],
  "url": "https://twitter.com/s_batzoglou/status/1865758903666954597",
  "created_at": "2024-12-08T14:02:41.000Z",
  "#sort_index": "1865758903666954597",
  "view_count": 1546,
  "quote_count": 0,
  "is_quote_tweet": false,
  "is_retweet": false,
  "is_pinned": false,
  "is_truncated": true,
  "startUrl": "https://x.com/s_batzoglou/status/1865758903666954597"
}