From: torys.anderson@gmail.com (Tory S. Anderson)
To: Matt Price <moptop99@gmail.com>
Cc: Org Mode <emacs-orgmode@gnu.org>
Subject: Re: HTML --> Org-mode?
Date: Mon, 26 Jan 2015 15:31:10 -0500 [thread overview]
Message-ID: <87siex473l.fsf@gmail.com> (raw)
In-Reply-To: <CAN_Dec8m8V65epo9=tFt+sn3=3VmxAuS2Dt-Rsj_MkSMsBNCsQ@mail.gmail.com> (Matt Price's message of "Mon, 26 Jan 2015 15:15:59 -0500")
Using the magic wizard program Pandoc, I just had success with a simple little example:
pandoc -o test.org test.html
Input test.html:
<html>
<body>
<strong>TEST strong!</strong>
<div class='table'>
<div class='cell'>Cell 1</div>
<div class='cell'>Cell 2</div>
<div class='cell'>Cell 3</div>
<div class='cell'>Cell 4</div>
</div>
</body>
</html>
Output test.org:
*TEST strong!*
Cell 1
Cell 2
Cell 3
Cell 4
I'm not sure how sophisticated the strings you are dealing with, but pandoc might do the trick for you.
- Tory
Matt Price <moptop99@gmail.com> writes:
> Hmm,
>
> Looks like I asked this about a year ago and didn't follow up on it.
> Does anyone know a way to generate org-mode syntax from an html
> string? I would like to extend zotxt slightly (see my last post) and
> at present zotxt can pull citations 7 bibliography entries from Zotero
> only in plain-text and HTML form. The plaintext form loses
> information, so I would like to translate the HTML into org-mode
> syntax.
>
> Since this would have to happen in the context of an
>
> (org-add-link-type )
>
> invocation, it would be best if this could be done directly in emacs
> somehow...
>
> Thanks as always,
>
> Matt
next prev parent reply other threads:[~2015-01-26 20:31 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-26 20:15 HTML --> Org-mode? Matt Price
2015-01-26 20:31 ` Tory S. Anderson [this message]
[not found] ` <CAN_Dec_FD7ys9zoOQ3pNym+E_0=D=acHUapKYHbQWxjL=huoNA@mail.gmail.com>
[not found] ` <87a9154670.fsf@gmail.com>
2015-01-27 4:42 ` Matt Price
2015-01-27 8:23 ` Willem Rein Oudshoorn
2015-01-27 11:55 ` Matt Price
2015-01-27 13:58 ` Wim Oudshoorn
2015-01-27 18:59 ` Willem Rein Oudshoorn
2015-01-28 2:12 ` Matt Price
2015-01-27 8:27 ` Eric S Fraga
2015-01-27 9:51 ` Albert Krewinkel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.orgmode.org/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87siex473l.fsf@gmail.com \
--to=torys.anderson@gmail.com \
--cc=emacs-orgmode@gnu.org \
--cc=moptop99@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).