From mboxrd@z Thu Jan 1 00:00:00 1970 From: Fabrice Popineau Subject: Re: HTML2Org ? Date: Wed, 31 Jul 2013 00:15:35 +0200 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a1132f7e2eaf97b04e2c1f3a5 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:50298) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1V4Inh-0006zn-Im for emacs-orgmode@gnu.org; Tue, 30 Jul 2013 18:53:57 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1V4Inc-0001VU-Kv for emacs-orgmode@gnu.org; Tue, 30 Jul 2013 18:53:53 -0400 Received: from mail-ea0-x234.google.com ([2a00:1450:4013:c01::234]:50704) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1V4ICy-0004Kg-BZ for emacs-orgmode@gnu.org; Tue, 30 Jul 2013 18:15:56 -0400 Received: by mail-ea0-f180.google.com with SMTP id h10so1362174eaj.25 for ; Tue, 30 Jul 2013 15:15:55 -0700 (PDT) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Neil Smithline Cc: Org Mode --001a1132f7e2eaf97b04e2c1f3a5 Content-Type: text/plain; charset=ISO-8859-1 I was wondering about something doing the reverse of the exporter: get some fragment of Org text from an exported HTML fragment. However, it won't be much easier: links, macros, babel ... I guess only the basic markup could be reversed. Fabrice 2013/7/31 Neil Smithline > How would you get the document structure our of the HTML unless it only > used heading tags? > > Even something as simple as bold could be hidden within some monstrous > CSS. > > From my mobile. Please excuse abbrvs, tpyos, and auto ward correction. > On Jul 30, 2013 5:36 PM, "Fabrice Popineau" > wrote: > >> Anybody tried to write an HTML to Org parser (even a crude one) ? >> >> Best regards, >> >> Fabrice >> > --001a1132f7e2eaf97b04e2c1f3a5 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
I was wondering about something doing the reverse of = the exporter: get some fragment of Org text
from an exported HTML= fragment.
However, it won't be much easier: links, macros, b= abel ... I guess only the basic markup could be reversed.

Fabrice

2013/7/31 Neil Smithline = <em= acs-orgmode@neilsmithline.com>

How would you get the documen= t structure our of the HTML unless it only used heading tags?

Even something as simple as bold could be hidden within some= monstrous CSS.

From my mobile. Please excuse abbrvs, tpyos, and auto ward c= orrection.

On Jul 30, 2013 5:36 PM, "Fabrice Popineau&= quot; <f= abrice.popineau@gmail.com> wrote:
Anybody tried to write an HTML to Org parser (even a crude= one) ?

Best regards,

Fabrice

--001a1132f7e2eaf97b04e2c1f3a5--