Developing a Modular Framework for Implementing a Semantic Search Engine

Abstract

Current methods of information retrieval (IR) are adequate for everyday search needs, but they are not appropriate for many military and industrial tasks. The underlying mechanism of typical search methods is based on keyword matching, which has demonstrated poor performance compared to highly technical requirements documents found within the field of acquisitions. Instead of matching keywords, an IR method that understands the meaning of the words in a query is needed to provide the necessary performance over these types of documents; this is known as semantic search. This work utilizes sound software engineering practices to specify, design and develop a modular framework to aid in the design, testing, and development of new semantic search methods and IR techniques, in general. The development of the Modular Search Engine framework is documented in its entirety, from userneeds analysis to the production of a full application-programming interface. By exploiting the powerful techniques of polymorphism and object-oriented programming in the Java programming language, users are able to design new IR techniques that will function seamlessly within the framework. Finally, a reference implementation is provided as a proof-of-concept to demonstrate the capabilities and usefulness of the framework design.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Feb 12, 2010
Accession Number
ADA555393

Entities

People

  • Brian M. Hawkins

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • C4I
  • Energy and Power Technologies
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Acquisition
  • Algorithms
  • Business Administration
  • Computer Programming
  • Computer Programs
  • Computers
  • Engineering
  • Graphical User Interface
  • Information Retrieval
  • Java Programming Language
  • Language
  • Logistics
  • Management Personnel
  • Object Oriented Programming
  • Organizational Structure
  • Programming Languages
  • Software Development

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Software Engineering.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval