| Description: |
This repository hosts the Expanded Natural History of Song Discography. It contains 1007 audio recordings of vocal music gathered from many human societies, each annotated with a world region, language, and behavioural context. Each song file contains a 10-second excerpt of the source audio, selected at random from only portions of the recording that contain an audible singer. Given the short form of each excerpt, and the intended use of these files only for research purposes, they have been made available under Fair Use. NHS2-songs.zip contains the audio files, volume-matched and with 1s fade in/out added, in MP3 format. These can be analysed as-is or used in experiments. NHS2-metadata.csv contains annotations, where each row corresponds to a song. The four columns include song, which includes a unique identifier for each song in the format `NHS2-XXXX.mp3`; region, which indicates an approximate geographical location where the song was recorded, using Human Relations Area Files categories (see https://ehrafworldcultures.yale.edu); glottocode, which indicates the language in which the song is produced (see https://glottolog.org); and type, which indicates the behavioural context in which the song was produced, from a set of 10 categories (dance, healing, love, lullaby, play, procession, mourning, work, story, and praise). For assistance with the corpus, contact Martynas Snarskis (martysnarskis@gmail.com), Mila Bertolo (mila.bertolo@mail.mcgill.ca), and Samuel Mehr (mehr@hey.com). Further information about the construction of this corpus will be made available in a forthcoming paper; we will update this Zenodo archive when the paper is publicly available. ; Version 5 updates the audio selection for one of the songs (NHS2-E2SX), which previously did not include vocals |