A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction

Abstract

Zeolites are porous, aluminosilicate materials with many industrial and green applications. Despite their industrial relevance, many aspects of zeolite synthesis remain poorly understood requiring costly trial and error synthesis. In this paper, we create natural language processing techniques and text markup parsing tools to automatically extract synthesis information and trends from zeolite journal articles. We further engineer a data set of germanium-containing zeolites to test the accuracy of the extracted data and to discover potential opportunities for zeolites containing germanium. We also create a regression model for a zeolites framework density from the synthesis conditions. This model has a cross-validated root mean squared error of 0.98 T/1000 Angstrom^3, and many of the model decision boundaries correspond to known synthesis heuristics in germanium-containing zeolites. We propose that this automatic data extraction can be applied to many different problems in zeolite synthesis and enable novel zeolite morphologies.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 19, 2019
Accession Number
AD1103797

Entities

People

  • Avel·lí Corma Canós
  • Edward Kim
  • Elsa A Olivetti
  • Manuel Moliner
  • Soonhyoung Kwon
  • Terry Z. Gani
  • Yuriy Román-Leshkov
  • Zach Jensen

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Chemical Compounds
  • Chemical Engineering
  • Chemical Synthesis
  • Chemistry
  • Data Mining
  • Data Science
  • Engineering
  • Information Science
  • Inorganic Chemistry
  • Machine Learning
  • Materials
  • Materials Science
  • Natural Language Processing
  • Natural Languages
  • Neural Networks
  • Silicates
  • Supervised Machine Learning

Readers

  • Computational Linguistics
  • Computational Modeling and Simulation
  • Organic Chemistry

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation
  • AI & ML - Neural Networks