@jerryjliu0
Text splitting is a crucial component of setting up an ETL pipeline for your LLM/RAG app. But you can do way more than split in a flat list! ✨Our brand-new @llama_index parser allows you to *hierarchically* parse a data graph of text and tables, letting you model/query both unstructured and tabular data in the same document 🔥 ✅ Structured Table Summarization: We use LLMs to extract a structured summary + schema from each unformatted table. ✅ Hierarchical Node References: Have each summary link to the table. Plug into recursive retrieval. Built on top of @UnstructuredIO 🙌. Previously, we had very custom, involved notebook tutorials showing how you can parse out tables from SEC filings. Now you can parse out a table/text data graph in 5 lines of code 🔥 https://t.co/Q7JMzELKU4