PDFs look clean to humans. Extracting them is a mess. Research papers and reports are packed with tables, figures, captions, and complex layouts. Basic PDF extractors usually weren’t built to deal ...