Login  Register

Re: Encoding of special (German Umlaut) characters in html backend

Posted by LightGuardjp on Sep 09, 2013; 4:05pm
URL: https://discuss.asciidoctor.org/Encoding-of-special-German-Umlaut-characters-in-html-backend-tp548p550.html

If the generated files are UTF-8 encoded, why does it matter if they're transformed with the different entities? The whole idea of the different entities is so you can include characters from a different character set in something more restrictive.


On Mon, Sep 9, 2013 at 2:32 AM, ChrLipp [via Asciidoctor :: Discussion] <[hidden email]> wrote:
Hello!

I am aware of this thread: http://discuss.asciidoctor.org/German-special-Characters-incompatible-character-encodings-CP850-and-UTF-8-td175.html and therefore create my *.adoc files encoded in UTF-8 without BOM.

When I use the HTML backend I noticed (when looking at the source of the HTML) that the German Umlaut characters (ä, Ä, ö, Ö, ü, Ü, ß) are included in the HTML without any processing.
But in my opinion the chars should be transformed, e.g. ä should become &a uml;  (without the space, see http://www.w3schools.com/tags/ref_entities.asp)

In my opinion the transformation should also performed within the docbook backend (since this is XML).
What do you think?

Kind regards, Christian


If you reply to this email, your message will be added to the discussion below:
http://discuss.asciidoctor.org/Encoding-of-special-German-Umlaut-characters-in-html-backend-tp548.html
To start a new topic under Asciidoctor :: Discussion, email [hidden email]
To unsubscribe from Asciidoctor :: Discussion, click here.
NAML



--
Jason Porter
http://en.gravatar.com/lightguardjp