KNAW

Onderzoek

Voorbij Tree-Bank annotaties: desambiguering middels distributionele...

Pagina-navigatie:


Wijzig Onderzoekgegevens


Titel Voorbij Tree-Bank annotaties: desambiguering middels distributionele similarity-gebaseerde performance modellen
Looptijd 01 / 2002 - 01 / 2004
Status Afgesloten
Onderzoeknummer OND1283754

Samenvatting (EN)

The proposed project addresses the development of an accurate probabilistic model of syntactic ambiguity resolution in the processing of natural languages. The new model deviates from current practice in performance models in that it starts out from corpora annotated according to a weak linguistic theory, and aims at inducing further syntactic constraints, by machine learning techniques, that are not anticipated by that linguistic theory. Modern, advanced linguistic theories constitute a source of inspiration for the new model. More specifically, the focus of the project is on the question of how to model, by automatic means, some of the implicitly available lexical constraints over phrase-structure annotations, within a memory-based performance model of human syntactic processing. The proposed model uses highly context- and lexically-sensitive répresentations (called Graph-Symbols) and distributional similarity-based algorithms to approximate the influence of these implicit constraints. The present research is located at the interdisciplinary meeting point between linguistic theory, probabilistic modeling methods, and machine learning techniques. lt has an important empirical component and it touches upon technological issues.

Betrokken organisaties

Betrokken personen

Onderzoeker Prof.dr.ir. R.J.H. Scha
Projectleider Dr. K. Sima'an

Classificatie

A90000 Zuiver-wetenschappelijk onderzoek
D16000 Informatica
D36000 Taal- en literatuurwetenschappen

Omhoog
Ga terug naar de inhoud
Ga terug naar de site navigatie