SDATA or UNICODE
Paul Prescod
papresco at technologist.com
Wed Jan 28 22:30:18 GMT 1998
On Wed, 28 Jan 1998, Gavin McKenzie wrote:
>
> XML provides a way for specifying the encoding of an entity with the
> ?XML pi encoding declaration. Why wouldn't this be sufficient. If the
> euro or florin symbol is available in some non-Unicode character
> encoding scheme, isn't it sufficient to encode the text which requires
> the symbol in the appropriate scheme and use the encoding declaration?
No, for the reason Tim points out. On the other hand, you might be on the
right track. A processing instruction would serve as a hack to tell the
application where to insert the euro. <?EURO>
> On a related note...I have felt that it should be possible to attach the
> encoding declaration to any element in a manner similar to xml:lang.
> Typically our customers (who often are not able to make use of Unicode)
> require the ability to switch from one character encoding scheme to
> another on the fly within the same physical document (e.g. switching
> from Shift-JIS to Latin-1 and back). Referencing an external entity
> makes it possible, but not acceptable for our customers.
Egad. This is one of those things that is a good idea at the user level,
but would make implementation prohibitive. Imagine the poor desperate
Python hacker trying to "grep" through that!
I think you should implement a language that allows this and is preprocessed
into XML. If I were you I would use marked sections and not attributes to
describe the boundaries. Marked sections are really easy to scan for.
Paul Prescod
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list