S

SiteCrawler

This project provides a simple WebCrawler with retry-capabilities, functionality to distinguish between http/https sites. It biggest feature is that it allows for plugins (or CrawlerActions), which allows you to hook your scripts into the crawling process. It also allow for setting "blocked" URLs. Those URLs or patterns will not be crawled.
https://github.com/forcedotcom/SiteCrawler
The BSD 2-Clause License
Salesforce.com
Jasper Roel
Aggregated version Version Update time
1.0 1.0.0 Jul 31, 2018
1 Records