identification; language identification; on-line documents; word shapes; coded characters; character classes; visual characteristics; word bigrams; word trigrams; linear score value combination; expert system; knowledge acquisition; on-line texts; rules; accuracy; stability; varied parameter settings
(xsd:string)