@llama_index
Multimodal RAG with Contextual Retrieval πΌοΈπ€ RAG over slide decks is hard. We first show you how to build a multimodal RAG pipeline over a slide deck to pre-extract and index the visual content on each slide, as both text and image chunks. π You can do this thanks to LlamaParse premium, which is now 4.5c per page! (Down from 7.5c per page π) We also add in contextual summaries to each slide using @AnthropicAI prompt caching + metadata generation. This helps ground each slide in the section itβs in! Check out our full cookbook combining both techniques: https://t.co/Mo0JUyxze3