Structure Based Identification of XML Documents

Abstract

This report results from a contract tasking Pazmany-Eotvos Foundation of Natural Sciences and Informatics as follows: The contractor shall have the following topics investigated under the supervision of Professor Andras Lorincz. Application of the eXtensible Markup Language (XML) to building large-scale information resources out of millions of interrelated documents will be investigated. Potential advantages of XML for improvement of document processing by separating presentation from content and by revealing semantic structure of the documents will be exploited.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 21, 2001
Accession Number
ADA398186

Entities

People

  • Andras Lorincz

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Abstracts
  • Data Mining
  • Department Of Defense
  • Histograms
  • Identification
  • Information Operations
  • Instructions
  • Probability
  • Probability Distributions
  • Standards
  • Three Dimensional
  • Two Dimensional

Readers

  • Database Systems and Applications
  • Neural Network Machine Learning.
  • Technical Research and Report Writing.