From mboxrd@z Thu Jan 1 00:00:00 1970 From: Sebastian Rose Subject: Re: HTML export: how to delimit escaped HTML entity? Date: Thu, 28 Aug 2008 02:00:05 +0200 Message-ID: <48B5EA85.1090401@gmx.de> References: <48B5D733.60008@mtheo.net> Reply-To: sebastian_rose@gmx.de Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Return-path: Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1KYUvL-0005V4-F9 for emacs-orgmode@gnu.org; Wed, 27 Aug 2008 19:59:39 -0400 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1KYUvJ-0005Us-27 for emacs-orgmode@gnu.org; Wed, 27 Aug 2008 19:59:38 -0400 Received: from [199.232.76.173] (port=35665 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1KYUvI-0005Up-TZ for emacs-orgmode@gnu.org; Wed, 27 Aug 2008 19:59:36 -0400 Received: from mail.gmx.net ([213.165.64.20]:39458) by monty-python.gnu.org with smtp (Exim 4.60) (envelope-from ) id 1KYUvI-0004hR-0m for emacs-orgmode@gnu.org; Wed, 27 Aug 2008 19:59:36 -0400 In-Reply-To: <48B5D733.60008@mtheo.net> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: org-mode list What's in your XHTML-Head section? How does your Content-Type line look? That is, Change the 'utf-8' to what ever encoding your exported HTML files have. You can detect the real encoding of the files, by visiting souch an exported HTML file with emacs. In the modeline on the left, you can see some indicators, the second of which shows if the encoding is multibyte ('U') or singlebyte ('1'). Click it to see a full description of the files encoding. | | | | |_________________________________________ |-U-- ^ | Click Some more on this: The world is big. Consider to change your System to use UTF-8 everywhere. This is what I do here. Every file (and every filename) is UTF-8 on my system and thus things like ö ß á Á € ø · folder │ ├> file └> another_file are always displayed correct. So is the XHTML export of my Org-files. The characters are not changed to their html entities at all. Switch to UTF-8 and get rid of all your local problems :-) The easiest way to do this is just to install a modern GNU/Linux distribution (use Debian). Just 1 hour and your done, once for all. Regards, - Sebastian mtheo wrote: > This is probably bonehead simple, but so far that bone seems to be > broken here. > > The entities in org-html-entities work fine for me as long as followed > by a space (or another \-escaped entity), but I can't seem to discover > how they're delimited within a word. > > E.g. > "Ren\eacute " produces "René " > > which is just fine; but I've failed to find a way to get the correct > > "Ástor" > > out of any combination of "\Aacute" and "stor". > > "\Aacutestor" -> "\Aacutestor" > "\Aacute stor" (naturally) -> "Á stor" > > and nothing I've tried by way of analogy with shells (e.g. > "\{Aacute}stor") or other syntax ("\Aacute\stor"? "\Aacute$stor"? etc.) > has produced anything but the same string in the generated code. > > I'm sure my ignorance of {insert bonehead lacuna here} is showing, but > does anyone know how to enter poor old Sr Piazzolla's given name into an > org file so that it will export? > > Thanks -- and repeated & continual thanks to Carsten and all other > generous contributors. > > Mark > > > GNU Emacs 23.0.60.1 (i386-mingw-nt5.1.2600) > org-version 6.06b > >