| Title: |
A curated global dataset of social contact between diverse language communities |
| Authors: |
Kashima, Eri; Di Garbo, Francesca; Raatikainen, Oona; Forkel, Robert; Avelino, Rosnátaly; Beck, Sacha; Berge, Anna; Blanco Pena, Ana; Bowden, Ross; Brid, Nicolás; Brincat, Joseph M.; Carpio, María Belén; Cobbinah, Alexander; Cúneo, Paola; Deginet Wotango Doyiso, Wotango Doyiso; Fehn, Anne Maria; Gholami, Saloumeh; Ghosh, Arun; Gibson, Hannah; Hall, Elizabeth; Hannß, Katja; Haynie, Hannah; Jacka, Jerry J.; Jenny, Mathias; Kowalik, Richard; Kulkarni-Joshi, Sonal; Mous, Maarten; Mendoza, Marcela; Messineo, Cristina; Moro, Francesca Romana; Nater, Hank; Ocasio, Michelle; Olsson, Bruno; Ospina Bozzi, Ana María; Paredes, Agustina; Phiri, Admire; Quint, Nicolas; Sandman, Erika; Schokkin, Dineke; Singer, Ruth; Smith-Dennis, Ellen; Souag, Lameen; Sulistyono, Yunus; Treis, Yvonne; Urban, Matthias; Vaughan, Jill; Ziegelmeyer, Georg; Zikmundová, Veronika; Napoleão de Souza, Ricardo; Sinnemäki, Kaius |
| Contributors: |
Subject English, Helsinki |
| Publisher Information: |
Nature Research |
| Publication Year: |
2026 |
| Collection: |
Helsingfors Universitet: HELDA – Helsingin yliopiston digitaalinen arkisto |
| Subject Terms: |
612,1 Languages; KOTA2025?; PREM0000; 1 - Publication available open access by the publisher; 1 - Open access publication channel; 1 - Self archived; https://hdl.handle.net/10138/625616; 1- Minst en av författarna har en utländsk affiliation; 1- Publicerad utomlands; 0- Ingen affiliation med ett företag |
| Description: |
The GramAdapt Social Contact Dataset is a curated dataset of 34 language pairs with qualitative and quantifiable data on social interaction and aspects of societal multilingualism. The language pairs were sampled globally to represent the world’s linguistic diversity. The dataset can be used to interrogate the social dimensions of language contact independently or in conjunction with appropriate linguistic data. The data were collected by distributing a questionnaire to experts who have experience with either one or both of the language communities of a pair. The data represent subjective expert assessments based on choices from predetermined answers which can be quantified. Authors 1, 2 and 3 manually checked the response to identify possible misjudgments or misunderstandings. This results in a dataset containing 13,493 data points. This dataset is a first of its kind in the field of linguistics, built upon wide findings from sociolinguistics, historical linguistics, psycholinguistics, and linguistic anthropology. ; Peer reviewed |
| Document Type: |
article in journal/newspaper |
| File Description: |
application/pdf |
| Language: |
English |
| Relation: |
https://hdl.handle.net/10138/625616; 105025232912 |
| Availability: |
https://hdl.handle.net/10138/625616 |
| Rights: |
cc_by ; info:eu-repo/semantics/openAccess ; openAccess |
| Accession Number: |
edsbas.55A0524 |
| Database: |
BASE |