From mboxrd@z Thu Jan 1 00:00:00 1970 From: Daniel Szmulewicz Subject: utf8x and org-mode 8 Date: Mon, 25 Aug 2014 08:31:41 +0300 Message-ID: <22F1E9A4-20F3-4043-9556-F8D3CF073891@gmail.com> Mime-Version: 1.0 (Mac OS X Mail 7.3 \(1878.6\)) Content-Type: multipart/alternative; boundary="Apple-Mail=_4C886FBB-1E08-4972-801F-A3480AA6E951" Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:39019) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XLmsm-00005F-3j for emacs-orgmode@gnu.org; Mon, 25 Aug 2014 01:32:05 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XLmsd-0002dD-2B for emacs-orgmode@gnu.org; Mon, 25 Aug 2014 01:31:56 -0400 Received: from mail-we0-x22a.google.com ([2a00:1450:400c:c03::22a]:42758) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XLmsc-0002d2-Qs for emacs-orgmode@gnu.org; Mon, 25 Aug 2014 01:31:46 -0400 Received: by mail-we0-f170.google.com with SMTP id w62so12739586wes.1 for ; Sun, 24 Aug 2014 22:31:45 -0700 (PDT) Received: from [192.168.1.106] ([5.22.135.171]) by mx.google.com with ESMTPSA id q2sm97138561wjo.13.2014.08.24.22.31.43 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sun, 24 Aug 2014 22:31:44 -0700 (PDT) List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org --Apple-Mail=_4C886FBB-1E08-4972-801F-A3480AA6E951 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=windows-1252 Hi,=20 I understand that org-latex derives the encoding from the buffer.=20 In the ".tex=94 file produced by the exporter, I see this line: \usepackage[utf8]{inputenc} However, I would like to set it to: \usepackage[utf8x]{inputenc} I need to account for special French spacing characters: =93espaces = ins=E9cables."=20 Old tricks to achieve this with previous org-mode versions don=92t seem = to work.=20 What is the best way to export an org buffer and have it set to utf8x = with the latest org-mode version? Thank you in advace. -- Daniel Szmulewicz daniel.szmulewicz@gmail.com --Apple-Mail=_4C886FBB-1E08-4972-801F-A3480AA6E951 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=windows-1252 Hi, 

I understand that = org-latex derives the encoding from the buffer. 
In the = ".tex=94 file produced by the exporter, I see this = line:
\usepackage[utf8]{inputenc}

Howev= er, I would like to set it = to:
\usepackage[utf8x]{inputenc}

I = need to account for special French spacing characters: =93espaces = ins=E9cables." 

Old tricks to achieve this = with previous org-mode versions don=92t seem to = work. 
What is the best way to export an org buffer and = have it set to utf8x with the latest org-mode = version?

Thank you in advace.
--
Daniel Szmulewicz




= --Apple-Mail=_4C886FBB-1E08-4972-801F-A3480AA6E951-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Sebastien Vauban Subject: Re: utf8x and org-mode 8 Date: Mon, 25 Aug 2014 09:55:20 +0200 Message-ID: <86wq9xj95j.fsf@somewhere.org> References: <22F1E9A4-20F3-4043-9556-F8D3CF073891@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org-mXXj517/zsQ@public.gmane.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org-mXXj517/zsQ@public.gmane.org To: emacs-orgmode-mXXj517/zsQ@public.gmane.org Daniel Szmulewicz wrote: > I understand that org-latex derives the encoding from the buffer.=20 > In the ".tex=E2=80=9D file produced by the exporter, I see this line: > \usepackage[utf8]{inputenc} > > However, I would like to set it to: > \usepackage[utf8x]{inputenc} > > I need to account for special French spacing characters: =E2=80=9Cespaces= ins=C3=A9cables."=20 > > Old tricks to achieve this with previous org-mode versions don=E2=80=99t = seem to work.=20 > What is the best way to export an org buffer and have it set to utf8x wit= h the latest org-mode version? AFAICT, when using the `utf8' option to `inputenc' (and not `utf8x' which should be avoided, as it uses `ucs' which is no longer maintained), we have 2 solutions to support the UTF-8 non-breaking space: - Convert it in Org mode, when exporting (via a filter) #+begin_src emacs-lisp ;; filter for non-breaking spaces (defun leuven--latex-filter-nbsp (text backend info) "Convert non-breaking spaces when exporting to LaTeX/Beamer." (when (memq backend '(latex beamer)) (replace-regexp-in-string "=C2=A0" "~" text))) (add-to-list 'org-export-filter-plain-text-functions 'leuven--latex-filter-nbsp) #+end_src - or Convert it in LaTeX: #+begin_src emacs-lisp :tangle no ;; convert `nbsp' to its LaTeX equivalent (add-to-list 'org-latex-packages-alist (concat "\\ifdefined\\DeclareUnicodeCharacter{" "\\DeclareUnicodeCharacter{00A0}{~}" "}\\fi") t) #+end_src I think that the first one is "better", though its problem is that, being done by a filter (hence, on the user-side), sharing an Org file is not easy anymore (without sharing the filter to be added to the personal .emacs file). Best regards, Seb --=20 Sebastien Vauban