Integrated Data Driven Solutions (I2DS) Project in the Active Social Engineering Defense (ASED) Program

Abstract

The Purdue Team's proposal is only for TA1, which focuses on using machine learning models to detect social engineering messages. The Purdue team joined teams led by Berkeley and CMU to form the LASER team. The Purdue team developed techniques to train classification models for social engineering emails, and participated in the dry-run and the evaluations. Three models were developed. Two models analyze the subject and the text in the body. A TF-IDF (term frequency-inverse document frequency) model uses standard term frequency information. A second model extracts motive features from the text to identify the message authors intent (e.g., get information, access social network). A third model is a knowledge and graph model that extracts relation features from the sender and receiver information. An ensemble model aggregates output from the three models to make a prediction, and is comprised of Logistic Regression model and Neural Network model. The team has extensively explored different models, training techniques, and their impacts on accuracy.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 15, 2020
Accession Number
AD1137122

Entities

People

  • Dan Goldwasser
  • Jennifer Neville
  • Ninghui Li

Organizations

  • Purdue University

Tags

Communities of Interest

  • Cyber
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Computational Linguistics
  • Databases
  • Engineering
  • False Alarms
  • Government Procurement
  • Information Science
  • Jet Propulsion
  • Language
  • Linguistics
  • Machine Learning
  • Motor Skills
  • Natural Language Processing
  • Natural Languages
  • Neural Networks
  • Social Engineering
  • Social Networks

Fields of Study

  • Computer science

Readers

  • Computational Modeling and Simulation
  • Neural Network Machine Learning.
  • Research Science/Academic Research

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Neural Networks
  • Directed Energy