Testing a Distributional Semantics Account of Grammatical Gender Effects on Semantic Gender Perception

George Flint, Anna Ivanova


One well-known prediction of linguistic relativity theories is the effect of a noun's grammatical gender on its semantics; for instance, "key" is feminine in Spanish but masculine in German and thus might be associated with feminine traits for Spanish speakers but with masculine traits for German speakers. Experimental and corpus evidence for these effects has been mixed. In this work, we considered a distributional semantics account of putative grammatical gender effects on semantics and tested its predictions in Spanish, German, and English (control). In Part 1, we hypothesized that grammatical gender of concrete nouns affects the similarity of noun embeddings to embeddings of adjectives semantically associated with men or with women. We found support for this hypothesis in fastText embeddings, showing that nouns with the same meaning but with opposite genders in Spanish and German show opposite attraction effects both for words "man" and "woman" and for adjectives associated with men and women, although the effect size was weaker for German than for Spanish. BERT embeddings also showed consistent effects for Spanish but mixed results for German, suggesting possible variation across languages. In Part 2, we asked whether people systematically choose adjectives associated with women/men for grammatically feminine/masculine nouns, respectively. In a noun-adjective matching experiment (432 participants total), we found predicted grammatical gender effects for Spanish but not for German. Cosine similarity between the noun and the adjectives in fastText embeddings significantly predicted trial-level responses in all 3 languages; however, Spanish showed an additional effect of grammatical gender, indicating that participant noun-adjective associations are not fully explained by distributional semantics.


word embeddings