@_inesmontani
The first version of my spaCy + Docling integration is here: š process PDFs, Word documents & more š structured text-based output via @spacy_io's Doc š· layout spans for sections, headings etc. š® apply NLP pipelines to PDFs āļø chunk your data for RAG https://t.co/PNbGNNrom1