t

tagsoup

TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.
http://home.ccil.org/~cowan/XML/tagsoup/
Apache License 2.0
John Cowan
Files download
File Operation
tagsoup-1.0.5.jar download
tagsoup-1.0.5.pom download
tagsoup-1.0.5-sources.jar download
Apache Maven
<dependency>
  <groupId>com.github.fansu.tagsoup</groupId>
  <artifactId>tagsoup</artifactId>
  <version>1.0.5</version>
</dependency>
Gradle Groovy
implementation 'com.github.fansu.tagsoup:tagsoup:1.0.5'
Gradle Kotlin
implementation("com.github.fansu.tagsoup:tagsoup:1.0.5")
Scala SBT
libraryDependencies += "com.github.fansu.tagsoup" % "tagsoup" % "1.0.5"
Groovy Grape
@Grapes(
  @Grab(group='com.github.fansu.tagsoup', module='tagsoup', version='1.0.5')
)
Apache Ivy
<dependency org="com.github.fansu.tagsoup" name="tagsoup" rev="1.0.5" />
Leiningen
[com.github.fansu.tagsoup/tagsoup "1.0.5"]
Apache Buildr
'com.github.fansu.tagsoup:tagsoup:jar:1.0.5'
Dependencies
The project has no third-party dependencies