Repeatable Reverse Engineering with PANDA

Abstract

We present PANDA, an open-source tool that has been purpose-built to support whole system reverse engineering. It is built upon the QEMU whole system emulator, and so analyses have access to all code executing in the guest and all data. PANDA adds the ability to record and replay executions, enabling iterative, deep, whole system analyses. Further, the replay log files are compact and shareable, allowing for repeatable experiments. A nine billion instruction boot of FreeBSD, e.g., is represented by only a few hundred MB. PANDA leverages QEMU's support of thirteen different CPU architectures to make analyses of those diverse instruction sets possible within the LLVM IR. In this way, PANDA can have a single dynamic taint analysis, for example, that precisely supports many CPUs. PANDA analyses are written in a simple plugin architecture which includes a mechanism to share functionality between plugins, increasing analysis code re-use and simplifying complex analysis development. We demonstrate PANDA's effectiveness via a number of use cases, including enabling an old but legitimately purchased game to run despite a lost CD key, in-depth diagnosis of an Internet Explorer crash, and uncovering the censorship activities and mechanisms of an IM client.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 08, 2015
Accession Number
AD1034415

Entities

People

  • Joshua E Hodosh
  • Timothy R. Leek

Organizations

  • MIT Lincoln Laboratory

Tags

Communities of Interest

  • Cyber

DTIC Thesaurus Topics

  • Algorithms
  • Computations
  • Computer Programming
  • Computer Programs
  • Computers
  • Debugging
  • Detectors
  • Engineering
  • Mathematical Analysis
  • Operating Systems
  • Reliability
  • Reverse Engineering
  • Software Development
  • Transient Response Analysis
  • Video Games
  • Virtual Machines
  • Web Browsers

Fields of Study

  • Computer science

Readers

  • Database Systems and Applications
  • Distributed Systems and Data Platform Development
  • Parallel and Distributed Computing.