emacs-orgmode@gnu.org archives
 help / color / mirror / code / Atom feed
From: torys.anderson@gmail.com (Tory S. Anderson)
To: Matt Price <moptop99@gmail.com>
Cc: Org Mode <emacs-orgmode@gnu.org>
Subject: Re: HTML --> Org-mode?
Date: Mon, 26 Jan 2015 15:31:10 -0500	[thread overview]
Message-ID: <87siex473l.fsf@gmail.com> (raw)
In-Reply-To: <CAN_Dec8m8V65epo9=tFt+sn3=3VmxAuS2Dt-Rsj_MkSMsBNCsQ@mail.gmail.com> (Matt Price's message of "Mon, 26 Jan 2015 15:15:59 -0500")

Using the magic wizard program Pandoc, I just had success with a simple little example: 

    pandoc -o test.org test.html

Input test.html:
    <html>
      <body>
          <strong>TEST strong!</strong>
        <div class='table'>
          <div class='cell'>Cell 1</div>
          <div class='cell'>Cell 2</div>
          <div class='cell'>Cell 3</div>
          <div class='cell'>Cell 4</div>
        </div>
      </body>
    </html>

Output test.org:
    *TEST strong!*
    Cell 1
    Cell 2
    Cell 3
    Cell 4

I'm not sure how sophisticated the strings you are dealing with, but pandoc might do the trick for you. 
- Tory


Matt Price <moptop99@gmail.com> writes:

> Hmm,
>
> Looks like I asked this about a year ago and didn't follow up on it.
> Does anyone know a way to generate org-mode syntax from an html
> string? I would like to extend zotxt slightly (see my last post) and
> at present zotxt can pull citations 7 bibliography entries from Zotero
> only in plain-text and HTML form. The plaintext form loses
> information, so I would like to translate the HTML into org-mode
> syntax. 
>
> Since this would have to happen in the context of an 
>
> (org-add-link-type )
>
> invocation, it would be best if this could be done directly in emacs
> somehow... 
>
> Thanks as always,
>
> Matt

  reply	other threads:[~2015-01-26 20:31 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-01-26 20:15 HTML --> Org-mode? Matt Price
2015-01-26 20:31 ` Tory S. Anderson [this message]
     [not found]   ` <CAN_Dec_FD7ys9zoOQ3pNym+E_0=D=acHUapKYHbQWxjL=huoNA@mail.gmail.com>
     [not found]     ` <87a9154670.fsf@gmail.com>
2015-01-27  4:42       ` Matt Price
2015-01-27  8:23 ` Willem Rein Oudshoorn
2015-01-27 11:55   ` Matt Price
2015-01-27 13:58     ` Wim Oudshoorn
2015-01-27 18:59     ` Willem Rein Oudshoorn
2015-01-28  2:12       ` Matt Price
2015-01-27  8:27 ` Eric S Fraga
2015-01-27  9:51 ` Albert Krewinkel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.orgmode.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87siex473l.fsf@gmail.com \
    --to=torys.anderson@gmail.com \
    --cc=emacs-orgmode@gnu.org \
    --cc=moptop99@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).