n

norconex-importer

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a computer file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before importing/using it in your own service or application.
https://opensource.norconex.com/importer
Aggregated version Version Update time
3.0 3.0.1 Jul 09, 2023
3.0.0 Jan 03, 2022
3.0.0-RC1 Oct 09, 2021
3.0.0-M2 Jul 29, 2021
3.0.0-M1 Mar 01, 2021
2.11 2.11.0 Oct 18, 2021
2.10 2.10.0 Dec 22, 2019
2.9 2.9.0 Jun 18, 2018
2.8 2.8.0 Nov 27, 2017
2.7 2.7.2 May 27, 2017
2.7.1 May 25, 2017
2.7.0 Apr 26, 2017
2.6 2.6.1 Dec 15, 2016
2.6.0 Aug 26, 2016
2.5 2.5.2 May 31, 2016
2.5.1 Mar 22, 2016
2.5.0 Feb 29, 2016
2.4 2.4.0 Nov 03, 2015
2.3 2.3.1 Aug 08, 2015
2.3.0 Jul 22, 2015
2.2 2.2.0 Jun 16, 2015
2.1 2.1.1 Apr 09, 2015
2.1.0 Apr 01, 2015
2.0 2.0.0 Nov 26, 2014
24 Records