🐦 Twitter Post Details

Viewing enriched Twitter post

@jerryjliu0

Text splitting is a crucial component of setting up an ETL pipeline for your LLM/RAG app. But you can do way more than split in a flat list! ✨Our brand-new @llama_index parser allows you to *hierarchically* parse a data graph of text and tables, letting you model/query both unstructured and tabular data in the same document 🔥 ✅ Structured Table Summarization: We use LLMs to extract a structured summary + schema from each unformatted table. ✅ Hierarchical Node References: Have each summary link to the table. Plug into recursive retrieval. Built on top of @UnstructuredIO 🙌. Previously, we had very custom, involved notebook tutorials showing how you can parse out tables from SEC filings. Now you can parse out a table/text data graph in 5 lines of code 🔥 https://t.co/Q7JMzELKU4

View on Twitter

🔧 Raw API Response

{
  "user": {
    "created_at": "2011-09-07T22:54:31.000Z",
    "default_profile_image": false,
    "description": "co-founder/CEO @llama_index\n\nEx-ML @robusthq,  AI research @Uber_ATG, ML Eng @Quora, @princeton",
    "fast_followers_count": 0,
    "favourites_count": 4062,
    "followers_count": 24950,
    "friends_count": 1174,
    "has_custom_timelines": true,
    "is_translator": false,
    "listed_count": 633,
    "location": "",
    "media_count": 608,
    "name": "Jerry Liu",
    "normal_followers_count": 24950,
    "possibly_sensitive": false,
    "profile_image_url_https": "https://pbs.twimg.com/profile_images/1283610285031460864/1Q4zYhtb_normal.jpg",
    "screen_name": "jerryjliu0",
    "statuses_count": 2796,
    "translator_type": "none",
    "url": "https://t.co/S7FkTSefQ0",
    "verified": false,
    "withheld_in_countries": [],
    "id_str": "369777416"
  },
  "id": "1711768455429722613",
  "conversation_id": "1711768455429722613",
  "full_text": "Text splitting is a crucial component of setting up an ETL pipeline for your LLM/RAG app. But you can do way more than split in a flat list!\n\n✨Our brand-new @llama_index parser allows you to *hierarchically* parse a data graph of text and tables, letting you model/query both unstructured and tabular data in the same document 🔥\n\n✅ Structured Table Summarization: We use LLMs to extract a structured summary + schema from each unformatted table.\n✅ Hierarchical Node References: Have each summary link to the table. Plug into recursive retrieval.\n\nBuilt on top of @UnstructuredIO 🙌. Previously, we had very custom, involved notebook tutorials showing how you can parse out tables from SEC filings. Now you can parse out a table/text data graph in 5 lines of code 🔥\n\nhttps://t.co/Q7JMzELKU4",
  "reply_count": 5,
  "retweet_count": 50,
  "favorite_count": 269,
  "hashtags": [],
  "symbols": [],
  "user_mentions": [
    {
      "id_str": "1604278358296055808",
      "name": "LlamaIndex 🦙",
      "screen_name": "llama_index",
      "profile": "https://twitter.com/llama_index"
    }
  ],
  "urls": [],
  "media": [
    {
      "media_url": "https://pbs.twimg.com/media/F8FsE1Xa0AARqR9.png",
      "type": "photo"
    }
  ],
  "url": "https://twitter.com/jerryjliu0/status/1711768455429722613",
  "created_at": "2023-10-10T15:39:37.000Z",
  "#sort_index": "1711768455429722613",
  "view_count": 47846,
  "quote_count": 2,
  "is_quote_tweet": false,
  "is_retweet": false,
  "is_pinned": false,
  "is_truncated": true,
  "startUrl": "https://twitter.com/jerryjliu0/status/1711768455429722613"
}