Numerical Algorithms for the Analysis of Expert Opinions Elicited in Text Format

Abstract

Latent Dirichlet Allocation (LDA) is a scheme which may be used to estimate topics and their probabilities within a corpus of text data. The fundamental assumptions in this scheme are that text is a realisation of a stochastic generative model and that this model is well described by the combination of multinomial probability distributions and Dirichlet probability distributions. Various means can be used to solve the Bayesian estimation task arising in LDA. Our formulations of LDA are applied to subject matter expert text data elicited through carefully constructed decision support workshops. In the main these workshops address substantial problems in Australian Defence Capability. The application of LDA here is motivated by a need to provide insights into the collected text, which is often voluminous and complex in form. Additional investigations described in this report concern questions of identifying and quantifying di erences between stake-holder group text written to a common subject matter. Sentiment scores and key-phase estimators are used to indicate stake-holder di erences. Some examples are provided using unclassi ed data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2013
Accession Number
ADA581076

Entities

People

  • W. P. Malcolm
  • Wray Buntine

Organizations

  • Defence Science and Technology Group

Tags

Communities of Interest

  • Autonomy
  • Biomedical
  • C4I
  • Ground and Sea Platforms
  • Human Systems

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Artificial Intelligence Software
  • Cognitive Science
  • Computational Science
  • Computer Languages
  • Data Mining
  • Databases
  • Information Processing
  • Information Retrieval
  • Information Science
  • Information Systems
  • Machine Learning
  • Monte Carlo Method
  • Natural Language Processing
  • Network Science
  • Probability Distributions
  • Random Variables

Readers

  • Computational Linguistics
  • Organizational Process Management (OPM).
  • Statistical inference.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Information Retrieval
  • AI & ML - Machine Learning Algorithms