🐦 Twitter Post Details

Viewing enriched Twitter post

@hasanunlu9

After 8+ years on the Tesla Autopilot team and 3 years at Intel, I started @apexcompute to design a new architecture for efficient AI inference. For the past 9 months, we’ve been building our custom inference accelerator. Today we’re releasing Unified Engine v1. Last June we raised our seed round with @maxitechinc , DeepFin Research, @Soma_Capital and an incredible group of angel investors. In less than 9 months, we completed our RTL architecture and brought our first pre-silicon prototype to life on FPGA. Our architecture combines systolic array and vector processing in a single compute engine with multiple architectural optimizations, achieving very high FLOPs utilization. A single engine is super lean and it uses less than 90K LUTs and 1 MB Block RAM. It may also be one of the smallest logic-footprint compute engines developed so far. Our Unified Engine v1 supports: -matrix-matrix multiplication (~95% FLOPs utilization) -softmax (~90% FLOPs utilization) -broadcast and element-wise operations -RMSNorm / LayerNorm -block quantization/dequantization (fp4, int4) -multi-engine synchronization and many other operations. We even implemented memory-efficient attention similar to FlashAttention, reaching ~90% FLOP utilization. Full benchmarks and the software stack are available on our GitHub: https://t.co/KqTKbB2Inl We have basic compiler written in Python and it supports PyTorch tensors directly to easily test and transfer tensors between the accelerator and host using bf16, fp4 and int4 formats. Our FPGA prototype can already run LLM inference and outperform NVIDIA Jetson Orin Nano, even on a mid-tier FPGA setup (6.4x lower memory bandwidth, 18% slower clock speed at 4.5 Watts). Check the side-by-side comparison video below. Our GitHub includes low-level operator implementations, examples for tiled matrix multiplication, operation chaining, tensor parallelism, attention kernel and a full Gemma 3 1B model implementation. Many more models(Vision Transformers and VLA) are coming soon. Our accelerator IP is AXI-ready for deployment on any AMD(Xilinx) FPGA platform today. Even better, our two-engine prototype runs on an entry-level AMD(Xilinx) FPGA as a PCIe accelerator card. You can purchase it here https://t.co/8B9NOcueVu for $50 to experiment our pre-silicon prototype on your desktop PC or Raspberry Pi 5. We will be releasing hardware bitstream updates as the architecture gets new features. More to come soon! We are expanding our team and looking for compiler engineers and floating-point hardware design engineers. If you're interested, please send me a DM.

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2032080166504317103/media_0.mp4",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2032080166504317103/media_0.mp4",
      "type": "video",
      "filename": "media_0.mp4"
    },
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2032080166504317103/media_1.jpg",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2032080166504317103/media_1.jpg",
      "type": "photo",
      "filename": "media_1.jpg"
    }
  ],
  "processed_at": "2026-03-12T15:07:17.004726",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2032080166504317103",
  "url": "https://x.com/hasanunlu9/status/2032080166504317103",
  "twitterUrl": "https://twitter.com/hasanunlu9/status/2032080166504317103",
  "text": "After 8+ years on the Tesla Autopilot team and 3 years at Intel, I started @apexcompute to  design a new architecture for efficient AI inference. For the past 9  months, we’ve been building our custom inference accelerator. Today  we’re releasing Unified Engine v1. Last June we raised our seed round  with @maxitechinc , DeepFin Research, @Soma_Capital and  an incredible group of angel investors. In less than 9 months, we  completed our RTL architecture and brought our first pre-silicon  prototype to life on FPGA.\n\nOur  architecture combines systolic array and vector processing in a single  compute engine with multiple architectural optimizations, achieving very  high FLOPs utilization. A single engine is super lean and it uses less  than 90K LUTs and 1 MB Block RAM. It may also be one of the smallest  logic-footprint compute engines developed so far.\nOur Unified Engine v1 supports:\n-matrix-matrix multiplication (~95% FLOPs utilization)\n-softmax (~90% FLOPs utilization)\n-broadcast and element-wise operations\n-RMSNorm / LayerNorm\n-block quantization/dequantization (fp4, int4)\n-multi-engine synchronization and many other operations.\nWe  even implemented memory-efficient attention similar to FlashAttention,  reaching ~90% FLOP utilization. Full benchmarks and the software stack  are available on our GitHub: https://t.co/KqTKbB2Inl We have basic compiler written in Python and it supports PyTorch tensors  directly to easily test and transfer tensors between the accelerator and  host using bf16, fp4 and int4 formats.\n\nOur  FPGA prototype can already run LLM inference and outperform NVIDIA  Jetson Orin Nano, even on a mid-tier FPGA setup (6.4x lower memory  bandwidth, 18% slower clock speed at 4.5 Watts). Check the side-by-side  comparison video below.\n\nOur GitHub  includes low-level operator implementations, examples for tiled matrix  multiplication, operation chaining, tensor parallelism, attention kernel  and a full Gemma 3 1B model implementation. Many more models(Vision  Transformers and VLA) are coming soon.\n\nOur accelerator IP is AXI-ready for deployment on any AMD(Xilinx) FPGA platform today.\n\nEven  better, our two-engine prototype runs on an entry-level AMD(Xilinx)  FPGA as a PCIe accelerator card. You can purchase it here https://t.co/8B9NOcueVu for  $50 to experiment our pre-silicon prototype on your desktop PC or  Raspberry Pi 5. We will be releasing hardware bitstream updates as the  architecture gets new features.\n\nMore to come soon!\n\nWe are expanding our team and looking for compiler engineers and floating-point hardware design engineers. If you're interested, please send me a DM.",
  "source": "Twitter for iPhone",
  "retweetCount": 7,
  "replyCount": 4,
  "likeCount": 47,
  "quoteCount": 3,
  "viewCount": 2254,
  "createdAt": "Thu Mar 12 13:04:00 +0000 2026",
  "lang": "en",
  "bookmarkCount": 12,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2032080166504317103",
  "displayTextRange": [
    0,
    278
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "hasanunlu9",
    "url": "https://x.com/hasanunlu9",
    "twitterUrl": "https://twitter.com/hasanunlu9",
    "id": "621120184",
    "name": "Hasan",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1776873288217862144/LB88uL6m_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/621120184/1719722847",
    "description": "",
    "location": "Los Altos, California",
    "followers": 1989,
    "following": 911,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Thu Jun 28 16:21:31 +0000 2012",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 11312,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 10,
    "statusesCount": 55,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [],
    "profile_bio": {
      "description": "Working on efficient TPU, founder @apexcompute, prev Autopilot @Tesla_AI",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [],
          "user_mentions": [
            {
              "id_str": "0",
              "indices": [
                34,
                46
              ],
              "name": "",
              "screen_name": "apexcompute"
            },
            {
              "id_str": "0",
              "indices": [
                63,
                72
              ],
              "name": "",
              "screen_name": "Tesla_AI"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "hasanunlu.com",
              "expanded_url": "http://hasanunlu.com",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/lLc9HfUnzD"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "additional_media_info": {
          "monetizable": false
        },
        "display_url": "pic.twitter.com/ES5LK0xcM6",
        "expanded_url": "https://twitter.com/hasanunlu9/status/2032080166504317103/video/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "id_str": "2031982222446903306",
        "indices": [
          279,
          302
        ],
        "media_key": "13_2031982222446903306",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwABAoAARwzDRC+20AKAAA=",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAAECgABHDMNEL7bQAoAAA==",
            "media_key": "13_2031982222446903306"
          }
        },
        "media_url_https": "https://pbs.twimg.com/amplify_video_thumb/2031982222446903306/img/jQpEw3PSvFR-Q4Er.jpg",
        "original_info": {
          "focus_rects": [],
          "height": 1080,
          "width": 1920
        },
        "sizes": {
          "large": {
            "h": 1080,
            "w": 1920
          }
        },
        "type": "video",
        "url": "https://t.co/ES5LK0xcM6",
        "video_info": {
          "aspect_ratio": [
            16,
            9
          ],
          "duration_millis": 77376,
          "variants": [
            {
              "content_type": "application/x-mpegURL",
              "url": "https://video.twimg.com/amplify_video/2031982222446903306/pl/NuUDMaDxkFW3Rrhi.m3u8?tag=21&v=cfc"
            },
            {
              "bitrate": 256000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2031982222446903306/vid/avc1/480x270/aCejin5-XIXfaMMX.mp4?tag=21"
            },
            {
              "bitrate": 832000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2031982222446903306/vid/avc1/640x360/eYdfa2CEiXLZLBhL.mp4?tag=21"
            },
            {
              "bitrate": 2176000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2031982222446903306/vid/avc1/1280x720/OKd8yxdv3ivtOW_1.mp4?tag=21"
            },
            {
              "bitrate": 10368000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2031982222446903306/vid/avc1/1920x1080/6RyPYoUG2sNazS3K.mp4?tag=21"
            }
          ]
        }
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "timestamps": [],
    "urls": [
      {
        "display_url": "github.com/apex-compute/u…",
        "expanded_url": "https://github.com/apex-compute/unified-engine",
        "indices": [
          1325,
          1348
        ],
        "url": "https://t.co/KqTKbB2Inl"
      },
      {
        "display_url": "buy.stripe.com/6oUaEQf6365bgA…",
        "expanded_url": "https://buy.stripe.com/6oUaEQf6365bgAt0QHds401",
        "indices": [
          2264,
          2287
        ],
        "url": "https://t.co/8B9NOcueVu"
      }
    ],
    "user_mentions": [
      {
        "id_str": "1884704243590062081",
        "indices": [
          75,
          87
        ],
        "name": "Apex Compute",
        "screen_name": "apexcompute"
      },
      {
        "id_str": "722467812770504704",
        "indices": [
          307,
          319
        ],
        "name": "Maxitech",
        "screen_name": "maxitechinc"
      },
      {
        "id_str": "1243264845308403712",
        "indices": [
          340,
          353
        ],
        "name": "Soma Capital",
        "screen_name": "Soma_Capital"
      }
    ]
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}