The dataset is a shortened version of the data sets of Study 1 from Kjell, et al., 2016.

centrality_data_harmony

Format

A data frame with 2,146 and 4 variables:

words

unique words

n

overall word frequency

central_semantic_similarity

cosine semantic similarity to the aggregated word embedding

n_percent

frequency in percent