p

pdf2dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
http://cssbox.sourceforge.net/pdf2dom
GNU Lesser General Public License 3.0
Radek Burget
Aggregated version Version Update time
2.0 2.0.3 Oct 17, 2022
2.0.2 Oct 12, 2022
2.0.1 Oct 19, 2021
2.0.0 Feb 01, 2021
1.9 1.9 Jan 03, 2020
1.8 1.8 May 27, 2019
1.7 1.7 Jan 30, 2018
1.6 1.6 Jul 24, 2016
1.5 1.5 Mar 23, 2016
1.4 1.4 Nov 06, 2015
1.3 1.3 Feb 19, 2015
1.2 1.2 Feb 17, 2014
12 Records