CLARIN K(nowledge)-centre for Computer-Mediated Communication and Social Media Corpora

The CLARIN Knowledge Centre for Computer-Mediated Communication and Social Media Corpora (CKCMC) offers expertise on language resources and technologies for Computer-Mediated Communication and Social Media. Its basic activities are to

  1. Give researchers, students, and other interested parties information about the available resources, technologies, and community activities,
  2. Support interested parties in producing, modifying or publishing relevant resources and technologies and
  3. Organize training activities.

Computer-Mediated Communication (CMC)

User-generated CMC and social media content offers a wide range of research opportunities for a growing multidisciplinary research community to examine themes that often relate to—but are not limited to—the interaction between language, CMC, and society like, for example, language variation, pragmatics, media and communication studies. The data is also very important for the development of robust NLP tools that can deal with non-standard spelling, vocabulary and grammar. Compilation and dissemination of such corpora are hindered by the unclear legal status of CMC data when distributed as resource to the scientific community, which is further exacerbated by the rapidly changing terms of service by content providers.

Partners

Eurac Research, Bolzano/Bozen, IT
Seeks answers in the interaction between many different disciplines with the Institute for Applied Linguistics aiming to answer current linguistic issues involving both educational and economic subjects as well as social questions.
Eurac Research, Bolzano/Bozen, IT
Jožef Stefan Institute (IJS), Ljubljana, SI
The leading Slovenian research institute for natural sciences with the Department of Knowledge Technologies performing research in advanced information technologies.
Jožef Stefan Institute (IJS), Ljubljana, SI
Laboratoire de Linguistique Formelle (LLF), FR
A French research unit co-sponsored by the CNRS and the Université of Paris.
Laboratoire de Linguistique Formelle (LLF), FR
Leibniz-Institut für Deutsche Sprache (IDS), Mannheim, DE
The central non-university institution for the study and documentation of the contemporary usage and recent history of the German language.
Leibniz-Institut für Deutsche Sprache (IDS), Mannheim, DE

Documentation

Helpdesk

Our helpdesk can be contacted via email to contact-ckcmc @ CLARIN DOT EU. The helpdesk offers additional clarifications regarding the documentation and support in using, modifying, producing, or publishing CMC resources and technologies.