Department of Information Systems

PDF Analysis Tools

Authors:Burget Radek
Licence:required - no fee
Keywords:document analysis, PDF
A set of utilities for advanced PDF document analysis. Unlike the existing PDF to HTML convertors that focus on obtaining a DOM or HTML representation of the document that is visually as close as possible to the original document, the goal of the PDF Analysis Tools is to produce an output document that has the same logical stucture. For this purpose, the tools implement different algorithms for detecting common graphical patterns in the source PDF document that can be represented by some standard HTML elements and CSS constructions. The resulting document may not display exactly as the source PDF but it is more suitable for further analysis and/or editing.
Research groups:
Licence terms:
Free software under the terms of the GNU GPL license.

Your IPv4 address:
Switch to IPv6 connection

DNSSEC [dnssec]