The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension
Abstract
The “Narratives” collection aggregates a variety of functional MRI datasets collected while human subjects listened to naturalistic spoken stories. The current release includes 345 subjects, 891 functional scans, and 27 diverse stories of varying duration totaling ~4.6 hours of unique stimuli (~43,000 words). This data collection is well-suited for naturalistic neuroimaging analysis, and is intended to serve as a benchmark for models of language and narrative comprehension. We provide standardized MRI data accompanied by rich metadata, preprocessed versions of the data ready for immediate use, and the spoken story stimuli with time-stamped phoneme- and word-level transcripts. All code and data are publicly available with full provenance in keeping with current best practices in transparent and reproducible neuroimaging.
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Sep 28, 2021
- Source ID
- 10.1038/s41597-021-01033-3
Entities
People
- Ariel Goldstein
- Asieh Zadbood
- Christopher Baldassano
- Christopher J. Honey
- Claire H C Chang
- Emily Micciche
- Erez Simony
- Gina Choe
- Hanna Hillman
- Janice Chen
- Kenneth A. Norman
- Liat Hasenfratz
- Mai Nguyen
- Michael A. Chow
- Mor Regev
- Neggin Keshavarzian
- Olga Lositsky
- Paula P. Brooks
- Samuel A Nastase
- Tamara Vanderwal
- Uri Hasson
- Yaara Yeshurun
- Yaroslav O. Halchenko
- Yuan Chang Leong
- Yun-fei Liu
Organizations
- Intel Corporation
- National Institute of Mental Health
- United States Department of Defense
- United States Department of Health and Human Services