@HKydlicek
The final step of the FinePDFs saga is here! The FinePDFs 📃 BOOK We put everything we know about PDFs inside: - How to make the SoTA PDFs dataset? - How much old internet is dead now? - Why we chose RolmOCR for OCR - What is https://t.co/i3PivBI9hh And many more🤗 https://t.co/m8mC0Xjksc