Markup language validating system

In particular, it explains XML--the simple new language being developed by a working group (WG) of the W3 Consortium.

markup language validating system-72

Originally published as an HTML file, this paper was part of the CDROM e-publication ECHET96 ("Electronic Conference on Heterocyclic Chemistry"), run by Henry Rzepa, Chris Leach, and others at Imperial College, London, U. The CDROM was sponsored by the Royal Society of Chemistry, who (along with Cambridge, Leeds, and IC) are participants in the CLIC project. As part of this project I have been developing CML, one of the first applications of XML.

CML, and its associated software JUMBO, probably represented one of the first complete XML applications (authoring tools, documents, and browser) in any discipline.

For that reason, complex systems such as natural language processing (NLP) are required to extract implicit information from the documents, and they rely on having appropriate text to analyze.

Automatic extraction of numerical and other nontextual information will be much more difficult.

The paper was originally written as a simple structured document, using HTML, although it could have been written in CML.

I shall slant it towards those who wish to carry ; if you are comfortable with object-oriented languages you may like to substitute "object" for "document." In practice, XML documents can be directly and automatically transformed into objects, although the reverse may not always be quite so easy.

Markup languages are often seen as key tools in making them "future-proof" and interchangeable between applications (interoperability).

The important point about the XML approach is that it has been designed to separate different parts of the problem and to solve them independently.

Although the CML component was essentially a proof-of-concept, it was robust enough to be distributed as a standalone Java-based XML application.

Tags: , ,