Enhancing a Web Crawler with Arabic Search Capability

Abstract

Many advantages of the Internet- ease of access, limited regulation, vast potential audience, and fast flow of information- have turned it into the most popular way to communicate and exchange ideas. Criminal and terrorist groups also use these advantages to turn the Internet into their new play/battle fields to conduct their illegal/terror activities. There are millions of Web sites in different languages on the Internet, but the lack of foreign language search engines makes it impossible to analyze foreign language Web sites efficiently. This thesis will enhance an open source Web crawler with Arabic search capability, thus improving an existing social networking tool to perform page correlation and analysis of Arabic Web sites. A social networking tool with Arabic search capabilities could become a valuable tool for the intelligence community. Its page correlation and analysis results could be used to collect open source intelligence and build a network of Web sites that are related to terrorist or criminal activities

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2010
Accession Number
ADA558705

Entities

People

  • Qui V. Nguyen

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Weapons Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Arabic Language
  • California
  • Commerce
  • Computers
  • Information Retrieval
  • Information Science
  • Internet
  • Language
  • Network Science
  • Networks
  • Operating Systems
  • Probability
  • Terrorists
  • United States
  • Websites
  • World Wide Web

Fields of Study

  • Computer science

Readers

  • Computer Networking
  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Political Violence and Terrorism Studies.