KNAW

Research

Large Scale Syntactic Annotation of written Dutch (LASSY)

Pagina-navigatie:


Update Research data


Title Large Scale Syntactic Annotation of written Dutch (LASSY)
Period 01 / 2007 - unknown
Status Current
Research number OND1328247
Data Supplier Website CLCG

Abstract

LASSY (Large Scale Syntactic Annotation of written Dutch) is a STEVIN project. STEVIN is a Flemish-Dutch Language and Speech Processing Technology Programme launched by de Nederlandse Taalunie. The STEVIN programme office is run jointly by NWO Humanities Division and SenterNovem. A large corpus of written Dutch texts (1,000,000 words) is syntactically annotated (manually corrected), based on D-COI and its successor. In addition, the full corpus to be developed in the successor op D-COI (500,000,000 words) is syntactically annotated automatically. The project aims to extend the available syntactically annotated corpora for Dutch both in size as well as with respect to the various text genres and topical domains. In addition, various browse and search tools for syntactically annotated corpora will be further developed and made available. Their potential for applications in corpus linguistics and information extraction will be illustrated and evaluated.

Related organisations

Other involved organisations

Katholieke Universiteit Leuven

Related people

Project leader Prof.dr. G.J.M. van Noord

Related research (upper level)


Go to page top
Go back to contents
Go back to site navigation