🐦 Twitter Post Details

Viewing enriched Twitter post

@DescriptApp

We taught our AI to watch videos. Which sounds either completely obvious or completely impossible, depending on how you think about it. Our engineering team spent months figuring out how to get AI beyond just transcribing what people say to actually understanding what's happening visually in footage. The result? You can now ask our video agent to find "the clip where someone writes physics equations on a chalkboard," and it'll actually find it. In seconds. A few things this unlocks: - No more naming files "video_5300.mp4" and hoping for the best - Auto B-roll insertion based on what you actually have, not just stock footage - Search across your entire media library for "that time someone clapped" But here's what's really interesting... We went from prototype to production in about 6 weeks. Not because we're geniuses, but because the foundational AI models are getting so good that the hard part isn't the tech anymore—it's figuring out what people actually want to do with it. Turns out, that's a lot weirder than we expected. Recent user requests include "insert a GIF every time someone sneezes" and "chop up this kid's baseball game into clips." The internet remains beautifully strange. What we're building toward isn't just faster video editing—it's the shift from needing to know software to needing to know what story you want to tell. Our engineering team explains the whole journey, including the part where they definitely did not get their cost estimates wrong. (They definitely did.)

Media 1

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1960778151845364028/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1960778151845364028/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-08-27T21:34:16.207012",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "1960778151845364028",
  "url": "https://x.com/DescriptApp/status/1960778151845364028",
  "twitterUrl": "https://twitter.com/DescriptApp/status/1960778151845364028",
  "text": "We taught our AI to watch videos.\n\nWhich sounds either completely obvious or completely impossible, depending on how you think about it.\n\nOur engineering team spent months figuring out how to get AI beyond just transcribing what people say to actually understanding what's happening visually in footage.\n\nThe result? You can now ask our video agent to find \"the clip where someone writes physics equations on a chalkboard,\" and it'll actually find it. In seconds.\n\nA few things this unlocks:\n\n- No more naming files \"video_5300.mp4\" and hoping for the best\n- Auto B-roll insertion based on what you actually have, not just stock footage\n- Search across your entire media library for \"that time someone clapped\"\n\nBut here's what's really interesting...\n\nWe went from prototype to production in about 6 weeks. Not because we're geniuses, but because the foundational AI models are getting so good that the hard part isn't the tech anymore—it's figuring out what people actually want to do with it.\n\nTurns out, that's a lot weirder than we expected. Recent user requests include \"insert a GIF every time someone sneezes\" and \"chop up this kid's baseball game into clips.\"\n\nThe internet remains beautifully strange.\n\nWhat we're building toward isn't just faster video editing—it's the shift from needing to know software to needing to know what story you want to tell.\n\nOur engineering team explains the whole journey, including the part where they definitely did not get their cost estimates wrong. (They definitely did.)",
  "source": "Twitter for iPhone",
  "retweetCount": 3,
  "replyCount": 4,
  "likeCount": 19,
  "quoteCount": 1,
  "viewCount": 4128,
  "createdAt": "Wed Aug 27 18:55:15 +0000 2025",
  "lang": "en",
  "bookmarkCount": 8,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "1960778151845364028",
  "displayTextRange": [
    0,
    273
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "DescriptApp",
    "url": "https://x.com/DescriptApp",
    "twitterUrl": "https://twitter.com/DescriptApp",
    "id": "892859500125757440",
    "name": "Descript",
    "isVerified": false,
    "isBlueVerified": false,
    "verifiedType": "Business",
    "profilePicture": "https://pbs.twimg.com/profile_images/1019643227672961024/VzsNrztT_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/892859500125757440/1719942998",
    "description": "",
    "location": "",
    "followers": 30519,
    "following": 248,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Wed Aug 02 21:27:48 +0000 2017",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 5879,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 699,
    "statusesCount": 4375,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "1950585228633968957"
    ],
    "profile_bio": {
      "description": "If you can edit text, you can make videos, podcasts, and clips. And Descript's AI makes it fast and easy. Try Underlord: https://t.co/e3stAQ7kY0",
      "entities": {
        "description": {
          "urls": [
            {
              "display_url": "descri.pt/underlord",
              "expanded_url": "http://descri.pt/underlord",
              "indices": [
                121,
                144
              ],
              "url": "https://t.co/e3stAQ7kY0"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "descri.pt/tw",
              "expanded_url": "https://descri.pt/tw",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/EgQTOMl8AP"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/3QgPse7W3A",
        "expanded_url": "https://twitter.com/DescriptApp/status/1960778151845364028/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": [
              {
                "h": 663,
                "w": 663,
                "x": 699,
                "y": 266
              }
            ]
          },
          "orig": {
            "faces": [
              {
                "h": 663,
                "w": 663,
                "x": 699,
                "y": 266
              }
            ]
          }
        },
        "id_str": "1960778048313196544",
        "indices": [
          274,
          297
        ],
        "media_key": "3_1960778048313196544",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARs2FT8NF4AACgACGzYVVygXATwAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABGzYVPw0XgAAKAAIbNhVXKBcBPAAA",
            "media_key": "3_1960778048313196544"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/GzYVPw0XgAAV-P8.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 1075,
              "w": 1920,
              "x": 0,
              "y": 5
            },
            {
              "h": 1080,
              "w": 1080,
              "x": 468,
              "y": 0
            },
            {
              "h": 1080,
              "w": 947,
              "x": 535,
              "y": 0
            },
            {
              "h": 1080,
              "w": 540,
              "x": 738,
              "y": 0
            },
            {
              "h": 1080,
              "w": 1920,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1080,
          "width": 1920
        },
        "sizes": {
          "large": {
            "h": 1080,
            "w": 1920
          }
        },
        "type": "photo",
        "url": "https://t.co/3QgPse7W3A"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {},
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}