Package : boilerpipe > RPM : boilerpipe-1.2.0-11.mga7.src.rpm

Basic items

Name boilerpipe
Version 1.2.0
Release 11.mga7
URL https://github.com/kohlschutter/boilerpipe
Group Development/Java
Summary Boilerplate Removal and Fulltext Extraction from HTML pages
Size 139KB
Arch noarch
License ASL 2.0

Description

The boilerpipe library provides algorithms to detect and
remove the surplus "clutter" (boilerplate, templates)
around the main textual content of a web page.

The library already provides specific strategies
for common tasks (for example: news article extraction) and
may also be easily extended for individual problem settings.

Extracting content is very fast (milliseconds), just needs the
input document (no global or site-level information required) and
is usually quite accurate.

Media information

Distribution release Mageia 7
Media name core-release
Media arch i586

Advanced items

Source RPM NOT IN DATABASE ?!
Build time 2018-09-18 18:16:48
Changelog View in Sophie
Files View in Sophie
Dependencies View in Sophie