From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Lundin Subject: Re: [WISH] Org Importers Date: Thu, 28 Oct 2010 20:00:32 -0400 Message-ID: <87aalxamvz.fsf@fastmail.fm> References: <87wrp45n2a.fsf@berkeley.edu> <878w1j65va.fsf@berkeley.edu> <877hh2235b.fsf@pinto.chemeng.ucl.ac.uk> <81d3quu2zm.fsf_-_@gmail.com> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from [140.186.70.92] (port=40293 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PBcTe-00063a-3n for emacs-orgmode@gnu.org; Thu, 28 Oct 2010 20:05:54 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PBcOa-0006ur-Eh for emacs-orgmode@gnu.org; Thu, 28 Oct 2010 20:00:37 -0400 Received: from out2.smtp.messagingengine.com ([66.111.4.26]:54488) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PBcOa-0006tQ-Bq for emacs-orgmode@gnu.org; Thu, 28 Oct 2010 20:00:36 -0400 In-Reply-To: <81d3quu2zm.fsf_-_@gmail.com> (Jambunathan K.'s message of "Thu, 28 Oct 2010 14:07:49 +0530") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Jambunathan K Cc: Eric S Fraga , emacs-orgmode@gnu.org Jambunathan K writes: >> However, what would be ideal would be if there were a tool which would >> take a Word document with /track changes/ and generate a patch file for >> a text version of that document... that could then provide some >> mechanism for getting changes back into an org document (modulo problems >> with line re-arrangements unfortunately). Just a pipe dream... > > If you allow me some liberty, the suggestion is in two parts > > 1. Word/Pdf/Latex->Org converter > > I have suggested or hinted elsewhere (in a babel thread) the need for > importing in to Org from other formats. FWIW, I have a home-brewed perl script that converts latex documents to org-mode files. Loosely based on latex2doc[1], it uses a latex style file (generated by the perl script) to markup the pdf output (e.g. asterisks for section headings, etc.). The org-formatted pdf is then converted to plain text with pdftotext. It's a crude and by no means comprehensive hack designed to meet my own peculiar needs. But I'd be happy to share it on Worg (with ample disclaimers) if anyone is interested. Best, Matt Footnotes: [1] http://www.dur.ac.uk/p.j.heslin/Software/Latex/latex2doc.php