Products computational linguistics figure iii
PRODUCTS OF COMPUTATIONAL LINGUISTICS 67
•Second, it puts into action a large dictionary of thesaurus type, which gives, for each word in its standard form, its correspond-
68 COMPUTATIONAL LINGUISTICS AND LINGUISTIC MODELS
Another system, TextAnalyst, for determining the main topics of the document and the relationships between words in the docu-ment was developed by MicroSystems, in Russia (see Figure III.4). This system is not dictionary-based, though it does have a small dictionary of stop-words (these are prepositions, articles, etc., and they should not be processed as meaningful words).
This system reveals the relationships between words. Words are considered related to each other if they co-occurred closely enough in the text, e.g., in the same sentence. The program builds a network of the relationships between words. Figure III.4 shows the most im-portant words found by TextAnalyst in the early draft of this book, and the network of their relationships.