TY - JOUR
T1 - The Database of Cross-Linguistic Colexifications, reproducible analysis of cross-linguistic polysemies
AU - Rzymski, Christoph
AU - Tresoldi, Tiago
AU - Greenhill, Simon
AU - Wu, Mei-Shin
AU - Schweikhard, Nathanael E.
AU - Koptjevskaja-Tamm, Maria
AU - Gast, Volker
AU - Bodt, Timotheus A
AU - Hantgan, Abbie
AU - Kaiping, Gereon
AU - Chang, Sophie
PY - 2020
Y1 - 2020
N2 - Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.
AB - Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.
U2 - 10.1038/s41597-019-0341-x
DO - 10.1038/s41597-019-0341-x
M3 - Article
SN - 2052-4463
VL - 7
SP - -
JO - Scientific data
JF - Scientific data
IS - 13
ER -