ucto 0.5.3-3.1ubuntu1 (amd64 binary) in ubuntu vivid
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and offers several other
basic preprocessing steps (change case, count words/characters and reverse
lines) that make your text suited for further processing such as indexing,
part-of-speech tagging, or machine translation.
.
Ucto is a product of the ILK Research Group, Tilburg University (The
Netherlands).
.
If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
do scientific research in natural language processing, ucto will likely be of
use to you.
Details
- Package version:
- 0.5.3-3.1ubuntu1
- Status:
- Obsolete
- Component:
- universe
- Priority:
- Extra
Downloadable files
amd64 build of ucto 0.5.3-3.1ubuntu1 in ubuntu trusty PROPOSED produced
these files:
- ucto_0.5.3-3.1ubuntu1_amd64.deb (17.1 KiB)