The study of language through computerized corpora, or enormous samples of machine-readable text drawn from authentic language situations.- Category ID : 427449
The Linguistic Data Consortium (LDC) creates, collects and distributes speech and text databases, annotated corpora, treebanks, lexicons and other linguistic resources for research, education and development.
A subgroup of the Association for Computational Linguistics (ACL), this group is concerned with all aspects of linguistic annotation of language resources (linguistic corpora), especially the advancement of interoperability. Sponsors the annual Linguistic Annotation Workshop (LAW).
A subgroup of the Association for Computational Linguistics (ACL) which promotes interest in the use of the Internet as a source of linguistic data, and as an object of study in its own right. Organizes the WAC workshops.