Working with PDFs: The best tools for extracting text, tables and images

Lan Chu
Level Up Coding
Published in
9 min readApr 30, 2024

--

Lots of information comes from text data, for example in PDF documents. Handling PDFs can be particularly challenging, especially with tables and images.

Photo by Jonathan Simcoe on Unsplash

If you work with single modality language model, then you probably already know that it doesn’t have the ability to directly interpret or “read” documents. It is only capable of handling…

--

--