CRA

From visone manual
Revision as of 22:58, 25 November 2010 by Fratz (talk | contribs)
Jump to navigation Jump to search

Centering Resonance Analysis (CRA) extracts a network from a text by analysing its centers, for which the Centering Theory states that they contain the main contents of the text. According to Centering Theory, these centers are the Noun Phrases (NPs) of a text, that is the nouns together with any modifiers belonging to them. Thus, words within these centers define the words within the CRA text network, and the way they occur in the text can cause links between them.

Let us consider the following example sentence taken from the short story We Can Remember It for You Wholesale by Philip K. Dick:

Half an ancient silver fifty cent piece, several quotations from John Donne's sermons written incorrectly, each on a separate piece of transparent tissue-thin paper, ...

In the first step, this sentence is parsed to extract the NPs. For each of the words appearing in an NP, a node is created in the graph.

Half an ancient silver fifty cent piece, several quotations from John Donne's sermons written incorrectly, each on a separate piece of transparent tissue-thin paper, ...

http://tqzamf.ath.cx/pkd/cra0.png

Next, words that occur within the same NP are connected, regardless of their distance within the NPs. For example, the NP transparent tissue-thin paper causes the edges between transparent and paper, paper and tissue-thin as well as tissue-thin and transparent. Had there been five words in the NP, each of them would have been connected with all of the four others.

Half an ancient silver fifty cent piece, several quotations from John Donne's sermons written incorrectly, each on a separate piece of transparent tissue-thin paper, ...

http://tqzamf.ath.cx/pkd/cra2.png

Words that are not part of the same NP are still connected if one of them is at the end of an NP and the other is at the beginning of the following NP. That is, because piece is the last word of then NP separate piece and transparent if the first word of the NP following it, the two words are connected in the network.

Half an ancient silver fifty cent piece, several quotations from John Donne's sermons written incorrectly, each on a separate piece of transparent tissue-thin paper, ...

http://tqzamf.ath.cx/pkd/cra3.png

Finally, duplicate nodes for the same word are merged. For example, because piece appears twice, two nodes were created for it; these are now merged into just one. http://tqzamf.ath.cx/pkd/cra4.png http://tqzamf.ath.cx/pkd/cra5.png