Notes about XML and XML Tools and Related Stuff: SGML, XSL, XSLT, UML, …
|
|
HMG Consulting 20044 Glen Brae Drive Saratoga, CA 95070 |
© 2002-6 H.M. Gladney 20-Aug-06 |
Standards Tracks and XML Standards Reference.
SOAP (Simple Object Access Protocol) and Java SAX (Simple API for XML)
Information Society Technologies (IST) Electronic Publishing
Standard Naming (XML at Builder.com, 30th Oct 02)
Digital Object Models (DOM, CSS, HTML, Web, XML, SVG, …)
W3C Specification: Namespaces in XML
Resource Directory Description Language (RDDL)
XML Name Access Control (NAC) Repository
W3C Releases XHTML 1.0 Spec (OpenEnterprise Trends 16th Aug 2002)
SRU protocol (query in the URL, XML as response):
Full Text Search Functionality in XQuery--a Status Report
XQEngine (formerly XML Query Engine)
James (Java SAX for MARC) Beta (9th May)
XOBIS: the XML Organic Bibliographic Information Schema
Introducing XOBIS to the FRBR Working Group (2003)
MarcXchange standard (April 2005)
Using Greenstone (Gordon Paynter to XML4Lib, 17th April 2002)
Wrapping Websites into XML for Archival Storage
TEI and XML: A Marriage made in Heaven
Create XML Document from Undefined Legacy Data
Sleepycat Berkeley DB XML (29-Jul-03)
IBM DB2 9 “Viper” Swallows XML Whole
Tools (some also shown by W3C main page)
Free XML tools and software (XML Tools by Category)
Web Developer’s Virtual Library
Tips for Unlocking XML’s Secret Powers
Logictran RTF Converter ($70 and $25 annual maintenance)
Metadata Resources from Nelinet
Extensible Document Level Metadata
Metadata for Science, Research, Education and Technology (Lopatenko)
Generation of Java from XML Schema
OOL {Java SAX filters for working with out-of-line markup}
Making a Document Searchable with Swish
xTagger: Authoring Document-centric XML
Summary Comment on XML Editors
Embedding XML into PDFs (Charles Myers, Adobe, 30th May)
READY YOUR ENTERPRISE NETWORK FOR XML
AlphaWorks XML and Web Services Development Environment (WSDE)
IBM XML Generator with VA Java 3.5
WebSphere Development Environment
XML Apache consists of seven sub-projects, each focused on a different aspect of XML:
Cocoon - XML-based web publishing, in Java (downloaded to g:\xml\downloads)
The eXtensible Rule Markup Language (XRML)
XML Security Library Releases Tools, Resources
White Paper: XML Web Services Security
XML Key Management Specification (XKMS) (March 2001)
XML-Signature Syntax and Processing
Draft ETSI TS on XML Advanced Electronic Signatures
eXtensible Access Control Language (XACL)
SAML for Security Assertions (OASIS XML-based Security Services TC)
XACML -- A No-Nonsense Developer's Guide
XML Security Risks (PC Magazine, April 2, 2002)
Whitepaper on Data Integration with MAPFORCE 2004
XML Schema, Structures, Datatypes, and Schema Registry
Comparing DTDs and XML Schema: Are DTD’s Dead?
XML Schema: Formal Description
From “Getting Started with XML Schemas”.
Data Exchange Descriptions with XML and Java
XML Schema for Schemas:Structures and Data Types
XML Schema for Particular Topics or Disciplines
More Specialized Discipline Metadata
From LandXML, for Land management
From NetBryx, XML Schema Files (also StyleSheets)
XOBIS: the XML Organic Bibliographic Information Schema
DTD for Content Model for Electronic Archiving and Publishing of Journal Articles
DATA VALIDATION WITH XML SCHEMAS
Maintaining Schemas for Pipelined Stages
Specify Dataset Needs in XML, not Code
Publishing Dissertations in XML at UMich
The XML Family of Technologies, Technology Watch Briefing 7, DigiCULT
Schema for Specific Domains and Disciplines
Common Lisp support for the Extensible Markup Language (CL-XML)
XML Encoding of Simple Dublin Core Metadata
U.S. Congress: XML and Legislative Documents
METS Opening Day Presentations
Data Dictionary for Technical Metadata for Digital Still Images
Digital Video: Video Development Initiative (ViDe)
XFDL (Extensible Forms Description Language)
SVG (Scalable Vector Graphics)
Books, Articles, and Tutorials
Using XSL tools to publish DocBook documents
XSLT-Mediated File Format Migration as a Digital Preservation Strategy
Quick Study: Improve XML Transformations
XInclude to Promote Modularity and Reusability
Tim Bray, XML and More, May 2000.
XML: The Digital Library Hammer, Tennant, March 2001.
LEARN HOW TO CREATE WEB PAGES IN OO STYLE (from Java Builder e-mail
Application profiles: mixing and matching metadata schemas
XML: Draft Specification for DC-based Application Profile
Appendix A: XML and XML Schema Samples
In XML with link to XSL Stylesheet
Poem and Recipe: all pieces for Web Display
In XML with link to XSL Stylesheet
Metadata Server in the CARMEN Project
Sample of a Dublin Core Description
DiML (Dissertation Markup Language - 1999)
XML for Molecular Biology: eXpressML
Markup Languages (LegalXML, NewsML, Commerce XML, …)
Appendix B: BNF for XML, XSL, XML Schema, DTD, …
BNF and EBNF: What are they and how do they work?
The extended Backus-Naur format (EBNF)
W3C XML Specification DTD (“XMLspec”)
UML, XMI, and code generation, Part 1
In this first article
in a new series on UML and XML schema development, Benoit discusses the motivations
for modeling XML schema through the use of UML.
..., Part 2
In the second part
of this series on UML and XML, Benoit introduces the UML metamodel. He proceeds
to XMI, the XML-based specification for the exchange of ...
..., Part 3
In his third
article on UML modeling and XML, Benoit further refines the conversion
stylesheet with the introduction of stereotypes and tags.
..., Part 4
In this final
article in his series on UML and XML, Benoit wraps up the technique. He
discusses the need to simplify the model by burying some of the logic ...
Using XSLT for content management
This is the first
installment of Working XML, a column with companion project code
that demonstrates the evolution of full-fledged XML applications.
Link management and preparing the future
This article shows how
to use XML filters to add new functionality to XM, an open-source Web publishing
application.
Fundamentals of Web publishing with XML
As more developers
learn and experiment with XML, many have become interested in using stylesheets
to publish and manage Web sites.
Define and load extension points
In this article, Benoit
takes integration between XM, the simple content-management solution, and
Eclipse one step further. Publishing a Web site requires ...
Creating a project
Work continues to
integrate Eclipse -- IBM's open-source project to build an extensible IDE for
Java developers -- and Benoit Marchal's simple ...
A lightweight XML client
While excellent
solutions are available for large corporations that want to implement XML, few
solutions exist for smaller organizations.
Processing instructions and parameters
This month our
hardworking columnist adds support for multiple style sheets to the XM
content-management project. In so doing, he taps into TrAX URIResolver ...
Compiling the proxy
In this column, Benoit
provides the front end for the Handler Compiler, HC, and encounters unexpected
problems with the DFA. A stable but less than optimal ...
A first version of the lightweight client
Benoit continues to
develop a lightweight XML client. In this article, he shows you how to create
SOAP transactions through XSLT.
Importing text as XML with XI
This column marks the
launch of the third 'Working XML' project. This new project deals
with importing text documents in an XML publishing solution (or any ...
Wrestling with Java NIO
This column takes the
XI project to the next step. Here, Benoît reports his findings with the new
Java technology APIs -- in particular, ...
Wrapping up XM version 1
In this month's column,
developer and author Benoît Marchal adds final features to the first release of
XM, a low-cost open-source content management ...
Compiling the paths and automating tests
This month, our
columnist discusses the compilation algorithm. He also invests a bit of time
automating tests with JUnit.
Building a compiler for the SAX ContentHandler
This installment of the
column describes the requirements for the Java project
and analyzes its overall design. The new project, called HC (short for ...
Compiling XPaths
This month our columnist
describes how he implements the DFA construction algorithm, giving the first
concrete example of using the compiler to recognize ...
Map files into SOAP requests, Part 1
Many applications are
being upgraded to accommodate e-commerce transactions. In the first of two
articles on the subject, Benoit Marchal shows one simple ...
Mapping files into SOAP requests, Part 2
Many applications are
being upgraded to accommodate e-commerce transactions. In his previous column,
Benoit Marchal analyzed legacy data and showed how to ...
Wrapping up XI
Columnist Benoît
Marchal continues to shape XI, an open-source project that converts legacy text
to XML. For increased efficiency, XI now implements the SAX ...
Putting XI to good use
When it comes to user
interfaces, simplification is the key. Fewer options and fewer controls mean
less confusion and less chance for error.
XML : Technical library view
... Take advantage of lessons learned
by refactoring XM ... Working XML:A first version of the
lightweight client ...
Link management and preparing the future
In this installment of Working
XML, Benoit Marchal uses XML filters to add new functionality to XM, his
open-source Web publishing application.
Importing text as XML with XI
Importing text as XML
with XI, e-mail it! Related content:.Subscribe to the developerWorks newsletter.
Also in the XML zone:. Tutorials ...
Wrapping up XM version 1
Wrapping up XM version
1, e-mail it! Managing a list of links and the table of contents.
Compiling the proxy
... br>");
writer.println(" * A project developed for the 'Working XML'
column at developerWorks."); writer.println(" * @see <a ...
Compiling the proxy
Compiling the proxy,
e-mail it! Related content:. Subscribe to the developerWorks newsletter. Also
in the XML zone:. Tutorials ...
Compiling XPaths
The Java-based Handler
Compiler (HC) project for SAX parsing nears its alpha release. This month our
columnist describes how he implements the DFA ...
Also Standards and Interoperability
NT Explorer: Simple Object Access Protocol (SOAP): Brett Burridge investigates the use of the Simple Object Access Protocol (SOAP), the XML-based protocol that is taking a leading role in the emerging area of Web Services.
OASIS, the Organization for the Advancement of Structured Information Standards, is a non-profit, international consortium that creates interoperable industry specifications based on public standards such as XML and SGML, as well as others that are related to structured information processing.
A number of emerging Web standards will provide much of the basic architecture for digital libraries (RDF, Dublin Core, INDECS, DIENST protocol, UNICODE, XML, Z39.50, etc.). Many of these standards have just begun to move from research to deployment. Implementers will need to follow the progress of research, while researchers will need to monitor the experience of early adopters. The refinement of a stable architecture for digital libraries will require several iterations as these standards are adapted for various applications. The Standardisation Forum focuses on issues related to the deployment of standard metadata schemas and of related infrastructures for publishing, registering, and cross-linking application schemas based on standards.
Several DELOS Standardization working groups will examine the relation between standards for high-level resource description, such as Dublin Core, and standards for fine-grained description within specialised applications, and how multiple standards can be used in combination to address the unique needs of applications. This issue was the focus of a metadata workshop held in Vienna on 30 June 2000 organised by the Standardisation Forum at an EC concertation event presenting projects sponsored by the Digital Heritage and Cultural Content programme, as well as of a workshop session on "Building Digital Library Portals with Harvested Metadata" held on 8 February 2001 at the First EU-DL All Projects Concertation meeting in Luxembourg.
· Agent’s Requirements in Digital Libraries
Notes about secure data exchange
How many times have you received an XML document from an external source and the data was all there, but you knew you would need to tweak it to make it work with your system? It's a common problem that is often the result of a lack of precision. Being more precise can alleviate problems with many common data types. Let's look at some examples of how you can create precision standards for your XML elements.
One of the most common problematic elements is one that includes name information. Here are a few of the many name formats:
* SCHAFFNER, BRIAN
* Brian T. Schaffner
* Mr. Brian Schaffner
* Rev. Dr. Brian Schaffner, III
There are several different formats for someone's name, and communicating it in XML can be confusing if there's not enough precision in your elements to hold each component. Let's try to create an all-encompassing name element that you can use for nearly any name.
We'll start with the first part of the name--the prefix. Of course, there may be more than one prefix (as in the Reverend Doctor), so you'll need space for multiple prefixes, and probably something that indicates their proper order. Next is the first or proper name followed by one or more middle names (or perhaps an initial, or perhaps no middle name). Nearly last is the last name or surname followed by the suffix (as in Jr.). When put together, it might end up like this:
<Name>
<Prefix order="1">
<Abbreviated>Rev.</Abbreviated>
<LongForm>Reverend</LongForm>
</Prefix>
<Prefix order="2">
<Abbreviated>Dr.</Abbreviated>
<LongForm>Doctor</LongForm>
</Prefix>
<Proper>Brian</Proper>
<Middle>T.</Middle>
<Surname>Schaffner</Surname>
<Suffix order="1">
<Abbreviated>III</Abbreviated>
<LongForm>the third</LongForm>
</Suffix>
</Name>
Another element in XML documents that frequently suffers from format abuse is telephone numbers. Here are a few of the many ways to format telephone numbers:
* (502) 555-1212
* 502.555.1212
* 5025551212
* 502-555-1212
* +1(502)-555-1212 x334
There certainly are others that aren't listed here. You can run into the same problem with phone numbers as you do with names. The solution is to determine what are all the possible components that a telephone number may include and create a model that supports them. That way you won't lose any information in your XML documents.
We've identified five important components of a telephone number above. They start with the country code and end with the extension number. Creating a comprehensive telephone number element shouldn't be too difficult with this information:
<TelephoneNumber>
<CountryCode>1</CountryCode>
<AreaCode>502</AreaCode>
<Exchange>555</Exchange>
<Number>1212</Number>
<Extension>334</Extension>
</TelephoneNumber>
My new book "Dictionary of XML Technologies and the Semantic Web" (Springer-Verlag, 2004) is now available for sale world-wide.
This 250-page hardcover includes over 1,800 terms/acronyms and 264 illustrations. An accompanying CD-ROM contains a searchable version of the dictionary.
For more information, visit Amazon http://www.amazon.com/exec/obidos/tg/detail/-/1852337680/ or see sample pages at Springer http://www.springeronline.com/sgw/cda/frontpage/0,10735,3-40109-22-17629028-0,00.html
tutorial at w3schools <http://www.w3schools.com/dom/default.asp> will get you into the XML DOM quickly. Perl shouldn't be any different from any other language; that's the point of the DOM.
The list had some 1840 postings through July 03, 2000. See in particular the post of C. M. Sperberg-McQueen on 2000-07-03: "Moving toward a decision", [Proposed by the Plenary:] "Proposed: to deprecate the use of relative URI references in namespace declarations; that is: to say that while t