p

pdf2dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
http://cssbox.sourceforge.net/pdf2dom
GNU Lesser General Public License 3.0
Radek Burget
Files download
File Operation
pdf2dom-1.6.jar download
pdf2dom-1.6.pom download
pdf2dom-1.6-sources.jar download
Apache Maven
<dependency>
  <groupId>net.sf.cssbox</groupId>
  <artifactId>pdf2dom</artifactId>
  <version>1.6</version>
</dependency>
Gradle Groovy
implementation 'net.sf.cssbox:pdf2dom:1.6'
Gradle Kotlin
implementation("net.sf.cssbox:pdf2dom:1.6")
Scala SBT
libraryDependencies += "net.sf.cssbox" % "pdf2dom" % "1.6"
Groovy Grape
@Grapes(
  @Grab(group='net.sf.cssbox', module='pdf2dom', version='1.6')
)
Apache Ivy
<dependency org="net.sf.cssbox" name="pdf2dom" rev="1.6" />
Leiningen
[net.sf.cssbox/pdf2dom "1.6"]
Apache Buildr
'net.sf.cssbox:pdf2dom:jar:1.6'