The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension

Abstract

The “Narratives” collection aggregates a variety of functional MRI datasets collected while human subjects listened to naturalistic spoken stories. The current release includes 345 subjects, 891 functional scans, and 27 diverse stories of varying duration totaling ~4.6 hours of unique stimuli (~43,000 words). This data collection is well-suited for naturalistic neuroimaging analysis, and is intended to serve as a benchmark for models of language and narrative comprehension. We provide standardized MRI data accompanied by rich metadata, preprocessed versions of the data ready for immediate use, and the spoken story stimuli with time-stamped phoneme- and word-level transcripts. All code and data are publicly available with full provenance in keeping with current best practices in transparent and reproducible neuroimaging.

Document Details

Document Type: Pub Defense Publication
Publication Date: Sep 28, 2021
Source ID: 10.1038/s41597-021-01033-3

Entities

People

Ariel Goldstein
Asieh Zadbood
Christopher Baldassano
Christopher J. Honey
Claire H C Chang
Emily Micciche
Erez Simony
Gina Choe
Hanna Hillman
Janice Chen
Kenneth A. Norman
Liat Hasenfratz
Mai Nguyen
Michael A. Chow
Mor Regev
Neggin Keshavarzian
Olga Lositsky
Paula P. Brooks
Samuel A Nastase
Tamara Vanderwal
Uri Hasson
Yaara Yeshurun
Yaroslav O. Halchenko
Yuan Chang Leong
Yun-fei Liu

Organizations

Intel Corporation
National Institute of Mental Health
United States Department of Defense
United States Department of Health and Human Services

The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension

Abstract

Document Details

Entities

People

Organizations

Tags

Readers