p

pdf2dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
http://cssbox.sourceforge.net/pdf2dom
GNU Lesser General Public License 3.0
Radek Burget
Files download
File Operation
pdf2dom-2.0.3.jar download
pdf2dom-2.0.3.pom download
pdf2dom-2.0.3-sources.jar download
Apache Maven
<dependency>
  <groupId>net.sf.cssbox</groupId>
  <artifactId>pdf2dom</artifactId>
  <version>2.0.3</version>
</dependency>
Gradle Groovy
implementation 'net.sf.cssbox:pdf2dom:2.0.3'
Gradle Kotlin
implementation("net.sf.cssbox:pdf2dom:2.0.3")
Scala SBT
libraryDependencies += "net.sf.cssbox" % "pdf2dom" % "2.0.3"
Groovy Grape
@Grapes(
  @Grab(group='net.sf.cssbox', module='pdf2dom', version='2.0.3')
)
Apache Ivy
<dependency org="net.sf.cssbox" name="pdf2dom" rev="2.0.3" />
Leiningen
[net.sf.cssbox/pdf2dom "2.0.3"]
Apache Buildr
'net.sf.cssbox:pdf2dom:jar:2.0.3'