4.5 Article

Linking norms, ratings, and relations of words and concepts across multiple language varieties

Journal

BEHAVIOR RESEARCH METHODS
Volume 54, Issue 2, Pages 864-884

Publisher

SPRINGER
DOI: 10.3758/s13428-021-01650-1

Keywords

Word and concept properties; Interdisciplinary database; Cross-linguistic comparison; Test-driven data curation; Psycholinguistic norms; Ratings; Linguistic data

Funding

  1. ERC [715618]
  2. European Research Council (ERC) [715618] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

Researchers have found that psychologists and linguists have collected a vast amount of data on word and concept properties, and have attempted to combine information from both fields to establish the NoRaRe database. This database integrates data from 98 data sets in psychology and linguistics, covering 65 unique properties for 40 languages, and is managed through different workflows.
Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between words and concepts. Until now, however, there have been no efforts to combine information from the two fields, which would allow comparison of psychological and linguistic properties across different languages. The Database of Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts (NoRaRe) is the first attempt to close this gap. Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data sets, covering 65 unique properties for 40 languages. The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data. The database is accessible via a web application, the software API, or using scripting languages. In this study, we present how the database is structured, how it can be extended, and how we control the quality of the data curation process. To illustrate its application, we present three case studies that test the validity of our approach, the accuracy of our workflows, and the integrative potential of the database. Due to regular version updates, the NoRaRe database has the potential to advance research in psychology and linguistics by offering researchers an integrated perspective on both fields.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available