🐦 Twitter Post Details

Viewing enriched Twitter post

@AndrewYNg

New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a Member of Technical Staff at RadixArk. Running LLMs in production is expensive, and much of that cost comes from redundant computation. This short course teaches you to eliminate that waste using SGLang, an open-source inference framework that caches computation already done and reuses it across future requests. When ten users share the same system prompt, SGLang processes it once, not ten times. The speedups compound quickly, especially when there's a lot of shared context across requests. Skills you'll gain: - Implement a KV cache from scratch to eliminate redundant computation within a single request - Scale caching across users and requests with RadixAttention, so shared context is only processed once - Accelerate image generation with diffusion models using SGLang's caching and multi-GPU parallelism Join and learn to make LLM inference faster and more cost-efficient at scale! https://t.co/vUiu6goWCO

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://video.twimg.com/amplify_video/2042286068570157056/vid/avc1/1920x1080/3Xv-N_YHDs5DJk_T.mp4?tag=21",
      "type": "video"
    }
  ],
  "processed_at": "2026-04-09T17:21:29.110394",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2042289428702642588",
  "url": "https://x.com/AndrewYNg/status/2042289428702642588",
  "twitterUrl": "https://twitter.com/AndrewYNg/status/2042289428702642588",
  "text": "New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a Member of Technical Staff at RadixArk.\n\nRunning LLMs in production is expensive, and much of that cost comes from redundant computation. This short course teaches you to eliminate that waste using SGLang, an open-source inference framework that caches computation already done and reuses it across future requests.\n\nWhen ten users share the same system prompt, SGLang processes it once, not ten times. The speedups compound quickly, especially when there's a lot of shared context across requests.\n\nSkills you'll gain:\n- Implement a KV cache from scratch to eliminate redundant computation within a single request\n- Scale caching across users and requests with RadixAttention, so shared context is only processed once\n- Accelerate image generation with diffusion models using SGLang's caching and multi-GPU parallelism\n\nJoin and learn to make LLM inference faster and more cost-efficient at scale!\nhttps://t.co/vUiu6goWCO",
  "source": "Twitter for iPhone",
  "retweetCount": 4,
  "replyCount": 7,
  "likeCount": 30,
  "quoteCount": 0,
  "viewCount": 2071,
  "createdAt": "Thu Apr 09 17:11:58 +0000 2026",
  "lang": "en",
  "bookmarkCount": 8,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2042289428702642588",
  "displayTextRange": [
    0,
    277
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "AndrewYNg",
    "url": "https://x.com/AndrewYNg",
    "twitterUrl": "https://twitter.com/AndrewYNg",
    "id": "216939636",
    "name": "Andrew Ng",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/733174243714682880/oyG30NEH_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/216939636/1483126470",
    "description": "",
    "location": "Palo Alto, CA",
    "followers": 1446312,
    "following": 1065,
    "status": "",
    "canDm": false,
    "canMediaTag": true,
    "createdAt": "Thu Nov 18 03:39:11 +0000 2010",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 1733,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 457,
    "statusesCount": 1990,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2008956639894786402"
    ],
    "profile_bio": {
      "description": "Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs",
      "entities": {
        "description": {
          "hashtags": [
            {
              "indices": [
                97,
                100
              ],
              "text": "ai"
            },
            {
              "indices": [
                101,
                117
              ],
              "text": "machinelearning"
            },
            {
              "indices": [
                119,
                132
              ],
              "text": "deeplearning"
            },
            {
              "indices": [
                133,
                139
              ],
              "text": "MOOCs"
            }
          ],
          "symbols": [],
          "urls": [],
          "user_mentions": []
        },
        "url": {
          "urls": [
            {
              "display_url": "andrewng.org",
              "expanded_url": "http://www.andrewng.org",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/XidcMETENd"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "additional_media_info": {
          "monetizable": true
        },
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/baiT6LKDYY",
        "expanded_url": "https://twitter.com/AndrewYNg/status/2042289428702642588/video/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "id_str": "2042286068570157056",
        "indices": [
          278,
          301
        ],
        "media_key": "13_2042286068570157056",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwABAoAARxXqFve2nAAAAA=",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAAECgABHFeoW97acAAAAA==",
            "media_key": "13_2042286068570157056"
          }
        },
        "media_url_https": "https://pbs.twimg.com/amplify_video_thumb/2042286068570157056/img/MBqNQhXj9-YOyRn8.jpg",
        "original_info": {
          "focus_rects": [],
          "height": 1080,
          "width": 1920
        },
        "sizes": {
          "large": {
            "h": 1080,
            "w": 1920
          }
        },
        "type": "video",
        "url": "https://t.co/baiT6LKDYY",
        "video_info": {
          "aspect_ratio": [
            16,
            9
          ],
          "duration_millis": 111377,
          "variants": [
            {
              "content_type": "application/x-mpegURL",
              "url": "https://video.twimg.com/amplify_video/2042286068570157056/pl/CaVlECI2XSv15yoU.m3u8?tag=21&v=16d"
            },
            {
              "bitrate": 256000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2042286068570157056/vid/avc1/480x270/8oGwp_f0XzMIMvi3.mp4?tag=21"
            },
            {
              "bitrate": 832000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2042286068570157056/vid/avc1/640x360/A6K0upbd71lDk_W8.mp4?tag=21"
            },
            {
              "bitrate": 2176000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2042286068570157056/vid/avc1/1280x720/fu13lVdVtoO_7d_K.mp4?tag=21"
            },
            {
              "bitrate": 10368000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2042286068570157056/vid/avc1/1920x1080/3Xv-N_YHDs5DJk_T.mp4?tag=21"
            }
          ]
        }
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "timestamps": [],
    "urls": [
      {
        "display_url": "deeplearning.ai/short-courses/…",
        "expanded_url": "https://www.deeplearning.ai/short-courses/efficient-inference-with-sglang-text-and-image-generation",
        "indices": [
          1078,
          1101
        ],
        "url": "https://t.co/vUiu6goWCO"
      }
    ],
    "user_mentions": [
      {
        "id_str": "1822588444046249984",
        "indices": [
          104,
          113
        ],
        "name": "LMSYS Org",
        "screen_name": "lmsysorg"
      },
      {
        "id_str": "1994288895320838149",
        "indices": [
          127,
          136
        ],
        "name": "RadixArk",
        "screen_name": "radixark"
      },
      {
        "id_str": "1524622862040018944",
        "indices": [
          165,
          176
        ],
        "name": "Richard Chen",
        "screen_name": "richardczl"
      }
    ]
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "communityInfo": null,
  "article": null
}