Process documents at scale: PDFs, DOCX, HTML, images. Extract structured data via Unstructured.io.