Publikationsansicht

User Needs for Textual Corpora in Natural Language Processing (1993)

Abstract
We discuss the needs of natural language processing (NLP) researchers in relation to corpora. Reasons for the growing interest in corpora by NLP researchers are given. Their needs are quite different to those of theoretical linguists, as end-users of NLP systems require robust systems for ‘real language’. Monolithic general language descriptions are contrasted with sublanguage descriptions and found to be wanting. Ideal needs of NLP are contrasted with realistic needs. Ideal needs cannot be satisfied without first having solved problems whose solution requires accurately tagged and analysed corpora. Currently, partial skeletal analysis of corpora can yield useful patterns and structures. Various computational linguistic and probability or statistically based tools are required to allow further exploration of especially sublanguage corpora.

Details der Publikation
Download http://llc.oxfordjournals.org/cgi/content/short/8/4/227
http://dx.doi.org/10.1093/llc/8.4.227
Herausgeber Oxford University Press
Archiv HighWire Press OAI Repository (United States)
Keywords Articles
Typ TEXT
Sprache Englisch