Fast and flexible estimation of effective migration surfaces

Abstract

Spatial population genetic data often exhibits ‘isolation-by-distance,’ where genetic similarity tends to decrease as individuals become more geographically distant. The rate at which genetic similarity decays with distance is often spatially heterogeneous due to variable population processes like genetic drift, gene flow, and natural selection. Petkova et al., 2016 developed a statistical method called Estimating Effective Migration Surfaces (EEMS) for visualizing spatially heterogeneous isolation-by-distance on a geographic map. While EEMS is a powerful tool for depicting spatial population structure, it can suffer from slow runtimes. Here, we develop a related method called Fast Estimation of Effective Migration Surfaces (FEEMS). FEEMS uses a Gaussian Markov Random Field model in a penalized likelihood framework that allows for efficient optimization and output of effective migration surfaces. Further, the efficient optimization facilitates the inference of migration parameters per edge in the graph, rather than per node (as in EEMS). With simulations, we show conditions under which FEEMS can accurately recover effective migration surfaces with complex gene-flow histories, including those with anisotropy. We apply FEEMS to population genetic data from North American gray wolves and show it performs favorably in comparison to EEMS, with solutions obtained orders of magnitude faster. Overall, FEEMS expands the ability of users to quickly visualize and interpret spatial structure in their data.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jul 30, 2021
Source ID
10.7554/elife.61927

Entities

People

  • John Novembre
  • Joseph H Marcus
  • Rina Foygel Barber
  • Wooseok Ha

Organizations

  • National Institute of General Medical Sciences
  • National Science Foundation
  • Office of Naval Research
  • Statistics New Zealand
  • University of California, Berkeley
  • University of Chicago

Tags

Readers

  • Cellular and Molecular Pathways of Apoptosis.
  • Computer Vision.
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms
  • Biotechnology