t

tika-normaliser

This service is an integration of Apache Tika project. It enables to extract metadata and text content of many kinds of files format. The WebLab document in input is enriched with RDF properties for the metadata and Text unit(s) for the content. The service can be configured through the Spring bean of CXF to handle various kind of features (identifying language or not, provide a normalised XHTML output of the document...).
Files download
File Operation
tika-normaliser-1.8.2.pom download
tika-normaliser-1.8.2-sources.jar download
Apache Maven
<dependency>
  <groupId>org.ow2.weblab.webservices</groupId>
  <artifactId>tika-normaliser</artifactId>
  <version>1.8.2</version>
  <type>war</type>
</dependency>
Gradle Groovy
implementation 'org.ow2.weblab.webservices:tika-normaliser:1.8.2'
Gradle Kotlin
implementation("org.ow2.weblab.webservices:tika-normaliser:1.8.2")
Scala SBT
libraryDependencies += "org.ow2.weblab.webservices" % "tika-normaliser" % "1.8.2"
Groovy Grape
@Grapes(
  @Grab(group='org.ow2.weblab.webservices', module='tika-normaliser', version='1.8.2')
)
Apache Ivy
<dependency org="org.ow2.weblab.webservices" name="tika-normaliser" rev="1.8.2" />
Leiningen
[org.ow2.weblab.webservices/tika-normaliser "1.8.2"]
Apache Buildr
'org.ow2.weblab.webservices:tika-normaliser:jar:1.8.2'