From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jambunathan K Subject: Re: [patch][org-entities] More symbols Date: Mon, 02 Sep 2013 18:17:43 +0530 Message-ID: <878uzfs7lk.fsf@gmail.com> References: <87y57zouv5.fsf@gmx.us> <87pptat5zf.fsf@gmail.com> <87vc2qqc3l.fsf@gmx.us> <874na4bsxn.fsf@gmail.com> <87wqmzfokv.fsf@gmx.us> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:40118) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VGTgQ-00081L-Or for emacs-orgmode@gnu.org; Mon, 02 Sep 2013 08:56:48 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1VGTgK-0001Rc-Tj for emacs-orgmode@gnu.org; Mon, 02 Sep 2013 08:56:42 -0400 Received: from mail-pd0-x22a.google.com ([2607:f8b0:400e:c02::22a]:47649) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VGTgK-0001RW-LX for emacs-orgmode@gnu.org; Mon, 02 Sep 2013 08:56:36 -0400 Received: by mail-pd0-f170.google.com with SMTP id x10so4754575pdj.15 for ; Mon, 02 Sep 2013 05:56:35 -0700 (PDT) List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Rasmus Cc: emacs-orgmode@gnu.org Rasmus writes: >> With some scripting, this pulling can be made less laborious but more >> complete. > > Would you be able to get the HTML entities? Nicolas said that Org > "prefers" entity names due to encoding. I can find the unicode number > in Emacs, but not it's name. This is often the laborious part. Why use name when it is easier to use the numerical value?=20=20 Something like — should be good for =E2=80=94. (You can get the code value by doing the C-u C-x =3D on the displayed character.) ,---- | character: =E2=80=94 (displayed as =E2=80=94) (codepoint 8212, #o20= 024, #x2014) | ^^^^^^ | name: EM DASH `---- ---------------------------------------------------------------- I see that the entity names are listed in http://www.w3.org/TR/xml-entity-names/byalpha.html ---------------------------------------------------------------- Load the above file within Emacs. M-x eww http://www.w3.org/TR/xml-entity-names/byalpha.html RET or=20 M-x browse-url-emacs RET http://www.w3.org/TR/xml-entity-names/byalpha.html RET =20=20 M-x load-library RET shr RET=20 M-x shr-render-buffer RET Write the resulting buffer to an Org buffer or a text file. Then C-s for the unicode codepoint, C-a to get the entity name. You are done.