NLP4CMC 2016: 3rd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media - Workshop at KONVENS 2015, Bochum/Germany September 22, 2016.
TOPIC AND SCOPE: Over the past decade, there has been a growing interest in collecting, processing and analyzing data from genres of social media and computer-mediated communication (CMC): As part of large corpora which have been automatically crawled from the web, CMC data are often regarded as an unloved “bycatch” which is difficult to handle with NLP tools that have been optimized for processing edited text; on the other hand, the existence of CMC data in web corpora is relevant for all research and application contexts which require data sets that represent the full diversity of genres and linguistic variation on the web.