From visone manual
Jump to navigation Jump to search

Word-Network Analysis (WNA) extracts a network from a text by interpreting the text as a chain of words and connecting relevant words within a certain interval.

Consider the following text example and take a closer look at the WNA procedure.

The two planes crashed into the towers of the World Trade Center in New York

First of all the WNA identifies the relevant words of the text. This preprocessing step includes:

  • Stemming
  • Filtering words with a length less than a preset word length
  • Filtering stop words
  • Filtering part of speech

The preprocessed text example could look like this:

plane crash tower world trade center new york

In the second and final step the network is constructed. Every word of the preprocessed text constitutes a node of the network, where equal words are represented by solely one node. The connections are created by using one of the following methods:

  • Co-occurrence: Every word within in the same sentence is connected
  • k-Windowing: Every word is connected to the k-1 subsequent words

The following figures show the final network depending on the utilized method.