Welcome

CLARIN K(nowledge)-Centre for Computer-Mediated Communication and Social Media Corpora

The CLARIN Knowledge Centre for Computer-Mediated Communication and Social Media Corpora (CKCMC) offers expertise on language resources and technologies for Computer-Mediated Communication and Social Media. Its basic activities are to

  1. Give researchers, students, and other interested parties information about the available resources, technologies, and community activities,
  2. Support interested parties in producing, modifying or publishing relevant resources and technologies and
  3. Organize training activities.

Computer-Mediated Communication (CMC)

User-generated CMC and social media content offers a wide range of research opportunities for a growing multidisciplinary research community to examine themes that often relate to—but are not limited to—the interaction between language, CMC, and society like, for example, language variation, pragmatics, media and communication studies. The data is also very important for the development of robust NLP tools that can deal with non-standard spelling, vocabulary and grammar. Compilation and dissemination of such corpora are hindered by the unclear legal status of CMC data when distributed as resource to the scientific community, which is further exacerbated by the rapidly changing terms of service by content providers.

Partners

Eurac Research, Bolzano/Bozen, IT

Eurac Research addresses the challenges of the future and seeks answers in the interaction between many different disciplines on three major themes: regions fit for living in, diversity as a life-enhancing feature, a healthy society.

Jožef Stefan Institute (IJS), Ljubljana, SI
IJS is the leading Slovenian research institute for natural sciences with is KT Dept.performing research in advanced information technologies.
Laboratoire de Linguistique Formelle (LLF), FR
The Laboratoire de Linguistique Formelle is a French research unit co-sponsored by the CNRS and the Université of Paris.
Leibniz-Institut für Deutsche Sprache (IDS), Mannheim, DE
IDS is the central non-university institution for the study and documentation of the contemporary usage and recent history of the German language.

Contact

Our helpdesk can be contacted via email to helpdesk @ THIS DOMAIN. The helpdesk offers additional clarifications regarding the documentation and support in using, modifying, producing, or publishing CMC resources and technologies.

Documentation

For more detailed information, see the dedicated documentation section.