p

pdf2dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
http://cssbox.sourceforge.net/pdf2dom
GNU Lesser General Public License 3.0
Radek Burget
Files download
File Operation
pdf2dom-1.5.jar download
pdf2dom-1.5.pom download
pdf2dom-1.5-sources.jar download
Apache Maven
<dependency>
  <groupId>net.sf.cssbox</groupId>
  <artifactId>pdf2dom</artifactId>
  <version>1.5</version>
</dependency>
Gradle Groovy
implementation 'net.sf.cssbox:pdf2dom:1.5'
Gradle Kotlin
implementation("net.sf.cssbox:pdf2dom:1.5")
Scala SBT
libraryDependencies += "net.sf.cssbox" % "pdf2dom" % "1.5"
Groovy Grape
@Grapes(
  @Grab(group='net.sf.cssbox', module='pdf2dom', version='1.5')
)
Apache Ivy
<dependency org="net.sf.cssbox" name="pdf2dom" rev="1.5" />
Leiningen
[net.sf.cssbox/pdf2dom "1.5"]
Apache Buildr
'net.sf.cssbox:pdf2dom:jar:1.5'