NAME
tricount - count trigrams
SYNOPSIS
tricount [-v] [-i] [-b|-e] [-p number] -d trigrams [-f text]
[-l level]
DESCRIPTION
tricount counts the occurences of trigrams found in the file
trigrams. This file must contain only ascii text one word
per line in your favorite language. You will need to pipe
the output thru sort -r to get a nice listing.
tricount uses the same tree structure as wordtree.
OPTIONS
-d trigrams
The argument to option -d is the file containing
strings where to extract or count trigrams from. If the
-v option is given (count mode on), trigrams are pre-
fixed by the number of matches. If this number is
greater than zero the corresponding trigram has been
matched (n+1) times.
-f text
The argument to option -f is the file containing
strings to be classified if they contain the right tri-
grams or not.
-l level
This option is only valid with -f. The argument is the
number of occurences a trigram has to have in trigrams
to be classified as a "proper" trigram.
-i invert mode. This flag has effect only if -f is given.
It causes to print all strings from the file text that
have trigrams listed in file trigrams.
-b begin. tricount will only count trigrams if it is on
the beginning of a string. Furthermore if option -f is
given tricount will only match the beginning of a
string from text.
-p number
Changes the semantics of tricount to match other
lengths of pre-, in-, or postfixes. For Example pen-
tagrams are matched if number equals 5.
-e end. tricount will only count trigrams if it is on the
end of a string. Furthermore if option -f is given
tricount will only match the trailer of a string from
text.
-v be verbose.
ENVIRONMENT
No environment variables are used
SEE ALSO
wordtree(1), sort(1)
BUGS
no bugs discovered
AUTHOR
Thomas Pantzer, pantec@informatik.uni-leipzig.de, 1999