Package : python-guess-language

Package details

Summary: Guess the natural language of a text

Description:
Attempts to determine the natural language of a selection of Unicode (utf-8)
text.

Based on guesslanguage.cpp by Jacob R Rideout for KDE which itself is based on
Language::Guess by Maciej Ceglowski.

Detects over 60 languages - all languages listed in the trigrams directory plus
Japanese, Chinese, Korean and Greek.

guess_language uses heuristics based on the character set and trigrams in a
sample text to detect the language. It works better with longer samples and
will be confused if the sample text includes markup such as HTML tags.


URL: https://pypi.python.org/pypi/guess-language
License: LGPL

Maintainer: nobody

List of RPMs