@NielsRogge
New blog post: converting 30k @arxiv papers to Markdown using SOTA OCR models to enable chat with paper functionality Includes: > leveraging an open OCR model (Chandra 2 by @datalabto) > running on GPU infra - @huggingface Jobs > using Codex with a SKILL.md https://t.co/jrpin9oq5u