This document records all known errors in the Fifth Edition of the Extensible Markup Language (XML) 1.0 Specification ; for updates see the latest version.
The errata are numbered, classified as Substantive or Editorial, and listed in reverse chronological order of their date of publication in each category. Changes to the text of the spec are indicated thus: deleted text, new text, modified text . Substantive corrections are proposed by the XML Core Working Group, which has consensus that they are appropriate; they are not to be considered normative until approved by a Call for Review of Proposed Corrections or a Call for Review of an Edited Recommendation.
Please email error reports to [email protected].
Delete the following paragraph:
Names beginning with the string "xml", or with any string which would match (('X'|'x') ('M'|'m') ('L'|'l')), are reserved for standardization in this or future versions of this specification.
Make the following change:
PIs are not part of the document's character data,
but must be passed through to the application. The PI begins with a
target (PITarget) used to identify the application to which the
instruction is directed.
The target names " XML ", " xml ", and so
on are reserved for standardization in this or future versions of
this specification.Target
names that begin "xml-
" are reserved for
standardization in this or future specifications from the
XML Core Working Group
or its successors. The target name "xml
" is
forbidden as it introduces the XML declaration.
The XML Notation mechanism may be used for
formal declaration of PI targets. Parameter entity references must
not be recognized within processing instructions.
Make the following changes:
This specification does not constrain the application
semantics, use, or (beyond syntax) names of the element types and
attributes, except that names beginning with
a match to
(('X'|'x')('M'|'m')('L'|'l'))
"xml:
"
are reserved for standardization in this or future
versions of this specification
specifications from the
XML Core Working Group
or its successors.
Add a Note at the very end of the section as follows:
Note:
[Unicode] (conformance clause C06) says that canonically equivalent sequences of characters ought to be treated as identical. However, XML parsed entities (including document entities) that are canonically equivalent according to Unicode but which use distinct code point (character) sequences are considered distinct by XML processors. Therefore, all XML parsed entities SHOULD be created in a "fully normalized" form per [CharMod-Norm]. Otherwise the user might unknowingly create canonically equivalent but unequal sequences that appear identical to the user but which are treated as distinct by XML processors.
A document can still be well-formed, even if it is not in a normalized form. XML processors MAY verify that the document being processed is in a fully-normalized form and report to the application whether it is or not.
Add a reference to CharMod-Norm: