From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Vladimir Alexiev" Subject: HTML export should leave IRIs as is Date: Sun, 8 Feb 2015 16:05:40 +0200 Message-ID: <056d01d043a8$53f5f8a0$fbe1e9e0$@bg> Reply-To: vladimir@sirma.bg Mime-Version: 1.0 Content-Type: text/plain; charset="windows-1251" Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:49405) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YKSUj-0001zJ-Nz for emacs-orgmode@gnu.org; Sun, 08 Feb 2015 09:05:54 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YKSUe-0001uU-Nz for emacs-orgmode@gnu.org; Sun, 08 Feb 2015 09:05:53 -0500 Received: from pmx.sirma.bg ([93.123.103.134]:20590) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YKSUe-0001tO-5m for emacs-orgmode@gnu.org; Sun, 08 Feb 2015 09:05:48 -0500 Received: from pmx.sirma.bg (localhost [127.0.0.1]) by pmx.sirma.bg (Sirma mail system) with ESMTP id ED68C1400D for ; Sun, 8 Feb 2015 16:05:42 +0200 (EET) Received: from vladoa (unknown [88.128.80.149]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (Client did not present a certificate) by pmx.sirma.bg (Sirma mail system) with ESMTP id B291914002 for ; Sun, 8 Feb 2015 16:05:42 +0200 (EET) Content-Language: en-us List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org I have this in a file: - = [[https://chrome.google.com/webstore/detail/copy-url/mkhnbhdofgaendegcgbm= ndipmijhbili][CopyURL]] Chrome Plugin - Many URLs in dbpedia are International Resource Identifiers (IRIs) - When you copy from the browser's address bar, an IRI is URL-encoded - Unreadable ugliness like = http://bg.dbpedia.org/resource/%D0%9A%D0%BE%D0%BD%D1%81%D1%82%D0%B0%D0%BD= %D1%82%D0%B8%D0%BD%D0%B0_%D0%96%D0%B8%D0%B2%D0%BE%D0%B2%D0%B0 - Instead, this plugin allows you to copy & paste IRIs like = http://bg.dbpedia.org/resource/=CA=EE=ED=F1=F2=E0=ED=F2=E8=ED=E0_=C6=E8=E2= =EE=E2=E0 With default settings, the HTML export made both URLs the same = unreadable ugliness. I see no good reason for ox-html to escape IRIs, I think it should emit = them as IRIs. - I expected (setq org-html-use-unicode-chars t) to fix this, but for = some reason (mm-url-decode-entities) doesn't do it - This fixed the problem: (org-url-hexify-p nil), but I'm afraid it may = break something else. Furthermore, this fix converted BOTH URLs to an IRI. So to illustrate = the point, I had to change my org file to: - Unreadable ugliness like : = http://bg.dbpedia.org/resource/%D0%9A%D0%BE%D0%BD%D1%81%D1%82%D0%B0%D0%BD= %D1%82%D0%B8%D0%BD%D0%B0_%D0%96%D0%B8%D0%B2%D0%BE%D0%B2%D0%B0