5

Building Computer-Mediated Communication Corpora for sociolinguistic Analysis

Communication between humans via networked devices has become an everyday part of people's lives across generations, cultures, geographical areas, and social classes. Shaped by the specific social and technical context in which it is produced, …

Proceedings of the 5th Conference on CMC and Social Media Corpora for the Humanities

This volume presents the proceedings of the 5th edition of the annual conference series on CMC and Social Media Corpora for the Humanities (cmc-corpora2017). This conference series is dedicated to the collection, annotation, processing, and …

Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task

The World Wide Web has become increasingly popular as a source of linguistic data, not only within the NLP communities, but also with theoretical linguists facing problems of data sparseness or data diversity. Accordingly, web corpora continue to …

Proceedings of the 8th Web as Corpus Workshop (WAC-8)

Web corpora and other Web-derived data have become a gold mine for corpus linguistics and natural language processing. The Web is an easy source of unprecedented amounts of linguistic data from a broad range of registers and text types. However, a …