libtextcat: A library for efficient, lightweight text classification1

Package available in: [trunk] [8.0] [7.0] [6.0]

Libtextcat is a library with functions that implement the classification technique described in Cavnar Trenkle, N-Gram-Based Text Categorization. It was primarily developed for language guessing, a task on which it is known to perform with near- perfect accuracy. Considerable effort went into making this implementation fast and efficient. The language guesser processes over 100 documents/second on a simple PC, which makes it practical for many uses.

... part of T2, get it here

URL: http://software.wise-guys.nl/libtextcat/

Author: Frank Scheelen <frank [at] wise-guys [dot] nl>
Maintainer: The T2 Project <t2 [at] t2-project [dot] org>

License: BSD
Status: Stable
Version: 2.2

Remark: Does cross compile (as setup and patched in T2).

Download: http://software.wise-guys.nl/download/ libtextcat-2.2.tar.gz

T2 source: libtextcat.cache
T2 source: libtextcat.conf
T2 source: libtextcat.desc

Build time (on reference hardware): 5% (relative to binutils)2

Installed size (on reference hardware): 0.06 MB, 12 files

Dependencies (build time detected): 00-dirtree bash binutils bzip2 coreutils diffutils findutils gcc glibc grep linux-header make mktemp net-tools sed sysfiles tar

Installed files (on reference hardware): [show]

1) This page was automatically generated from the T2 package source. Corrections, such as dead links, URL changes or typos need to be performed directly on that source.

2) Compatible with Linux From Scratch's "Standard Build Unit" (SBU).