Think! Evidence

Coherence in natural language : data structures and applications

Show simple item record

dc.contributor Edward Gibson.
dc.contributor Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
dc.contributor Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
dc.creator Wolf, Florian, 1975-
dc.date 2005-09-27T18:42:33Z
dc.date 2005-09-27T18:42:33Z
dc.date 2004
dc.date 2005
dc.identifier http://hdl.handle.net/1721.1/28854
dc.identifier 60405129
dc.description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, February 2005.
dc.description Includes bibliographical references (leaves [143]-148).
dc.description (cont.) baseline, and that some coherence-based approaches best predict the human data. However, coherence-based algorithms that operate on trees did not perform as well as coherence-based algorithms that operate on more general graphs. It is suggested that that might in part be due to the fact that more general graphs are more descriptively adequate than trees for representing discourse coherence.
dc.description The general topic of this thesis is coherence in natural language, where coherence refers to informational relations that hold between segments of a discourse. More specifically, this thesis aims to (1) develop criteria for a descriptively adequate data structure for representing discourse coherence; (2) test the influence of coherence on psycholinguistic processes, in particular, pronoun processing; (3) test the influence of coherence on the relative saliency of discourse segments in a text. In order to address the first aim, a method was developed for hand-annotating a database of naturally occurring texts for coherence structures. The thus obtained database of coherence structures was used to test assumptions about descriptively adequate data structures for representing discourse coherence. In particular, the assumption that discourse coherence can be represented in trees was tested, and results suggest that more powerful data structures than trees are needed (labeled chain graphs, where the labels represent types of coherence relations, and an ordered array of nodes represents the temporal order of discourse segments in a text). The second aim was addressed in an on-line comprehension and an off-line production experiment. Results from both experiments suggest that only a coherence-based account predicted the full range of observed data. In that account, the observed preferences in pronoun processing are not a result of pronoun-specific mechanisms, but a byproduct of more general cognitive mechanisms that operate when establishing coherence. In order to address the third aim, layout-, word-, and coherence-based approaches to discourse segment ranking were compared to human rankings. Results suggest that word-based accounts provide a strong
dc.description by Florian Wolf.
dc.description Ph.D.
dc.format 150 leaves
dc.format 8749340 bytes
dc.format 8769604 bytes
dc.format application/pdf
dc.format application/pdf
dc.format application/pdf
dc.language en_US
dc.publisher Massachusetts Institute of Technology
dc.rights M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.
dc.rights http://dspace.mit.edu/handle/1721.1/7582
dc.subject Brain and Cognitive Sciences.
dc.title Coherence in natural language : data structures and applications
dc.type Thesis


Files in this item

Files Size Format View
60405129-MIT.pdf 8.769Mb application/pdf View/Open

Files in this item

Files Size Format View
60405129-MIT.pdf 8.769Mb application/pdf View/Open

Files in this item

Files Size Format View
60405129-MIT.pdf 8.769Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search Think! Evidence


Browse

My Account