Just how many types of anxiety are there, anyway? I got to thinking about this as I read a blog that mentioned "RSS Anxiety." For those of you who have not yet come face-to-face with this little acronym, it stands for Real Simple Syndication and it spreads whatever you want all over the internet, virtually creating an immortal life all its own.
Can you kill an idea once it is out on the internet? No. Can you try to correct it? Yes, but you'll never accomplish this goal. -- Patricia Farrell, author of How to Be Your Own Therapist
Earlier this week, the Press-Enterprise newspaper of Riverside, Calif., ran a business story about an industrial zone six miles from the border of North and South Korea that seeks to strengthen ties between the bitter, long-warring rivals.
The newspaper gave the article the headline, "Business park makes ties that build Korean détente."
The word détente is one of many in English that includes a diacritical mark, a symbol such as an accent, cedilla or diaeresis.
The paper included the story in a Really Simple Syndication feed. If the editors take a look at how the headline appears in some of the leading RSS software, they'll discover one of the unfortunate realities of working with the format:
RSS does not allow détente.
The Press-Enterprise gave the headline the following formatting:
<title>Business park makes ties that build Korean d&eacute;tente</title>
The "d&eacute;tente" part is an attempt to get an RSS reader to produce the output "détente". The "é" in détente is an acute accent diacritical mark, and one way to write one on the web is by using the HTML entity é.
The following screen captures show how this headline appears in eight highly popular RSS readers and web browsers:
Bloglines:
BottomFeeder 4.1:
FeedDemon 1.5:
Microsoft Internet Explorer 7:
Mozilla Firefox:
My Yahoo
Newsgator Online:
Opera 9:
As you can see, five of the eight display détente and the other three display "détente," including the two most popular web browsers on the planet. The difference occurs because the first group expects an RSS item's title to contain HTML, while the second group expects it to be plain text.
The simplest conclusion is that one group's not implementing the RSS title element properly, but there's nothing simple about the issue in the current specification. The spec states that an item's description can be HTML, but it doesn't state whether any other elements can do likewise.
One section of the draft specification attempts to solve the problem in this manner:
For all elements defined in this specification that enclose character data, the text must be interpreted as plain text with the exception of an item's description element, which must be suitable for presentation as HTML.
Dave Winer declared today that the war to clarify the spec is over and everybody won:
We live with the imperfections of RSS 2.0, because that's the way life is. Nothing and no one is exactly as we'd like them to be.
If that's supposed to be the final word on RSS, can somebody tell me how to build détente?
