Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Abstract

Reservoir computers (RCs) and recurrent neural networks (RNNs) can mimic any finite-state automaton in theory, and some workers demonstrated that this can hold in practice. We test the capability of generalized linear models, RCs, and Long Short-Term Memory (LSTM) RNN architectures to predict the stochastic processes generated by a large suite of probabilistic deterministic finite-state automata (PDFA) in the small-data limit according to two metrics: predictive accuracy and distance to a predictive rate-distortion curve. The latter provides a sense of whether or not the RNN is a lossy predictive feature extractor in the information-theoretic sense. PDFAs provide an excellent performance benchmark in that they can be systematically enumerated, the randomness and correlation structure of their generated processes are exactly known, and their optimal memory-limited predictors are easily computed. With less data than is needed to make a good prediction, LSTMs surprisingly lose at predictive accuracy, but win at lossy predictive feature extraction. These results highlight the utility of causal states in understanding the capabilities of RNNs to predict.

Document Details

Document Type: Pub Defense Publication
Publication Date: Jan 06, 2022
Source ID: 10.3390/e24010090

Entities

People

James P. Crutchfield
Sarah E. Marzen

Organizations

Air Force Office of Scientific Research
United States Army Research Laboratory

Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Abstract

Document Details

Entities

People

Organizations

Tags

Fields of Study

Readers

Technology Areas