From mboxrd@z Thu Jan 1 00:00:00 1970 From: Rainer M Krug Subject: Re: Extracting pdf metadata Date: Fri, 25 Mar 2011 09:34:10 +0100 Message-ID: <4D8C5382.3070809@gmail.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Return-path: Received: from [140.186.70.92] (port=39410 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q32TL-0000ix-8w for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:20 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q32TG-0003Dl-6C for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:15 -0400 Received: from mail-ww0-f49.google.com ([74.125.82.49]:51649) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q32TG-0003Dc-1k for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:14 -0400 Received: by wwb39 with SMTP id 39so357842wwb.30 for ; Fri, 25 Mar 2011 01:34:13 -0700 (PDT) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Marvin Doyley Cc: emacs-orgmode@gnu.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 25/03/11 02:21, Marvin Doyley wrote: > Hi there, > > Does anybody have a lisp code that can extract metadata from pdf. There > is an interesting program called sciplpore > (http://www.sciplore.org/software/sciplore_mindmapping/ that does this > for freemind), it might be useful if were able to do the same with org > (i.e., important pdf meta data, bookmark and stickies directly into org). As far as I remember, sciplore is not only extracting metadata embedded in th pdf, but also from the text - they submit it to a server which uses the academic article and compares it to layouts from different publishers and uses those to extract bibliographic information from the text. If that is what you want, then it might be considerably more difficult then just extracting embedded metadata. Cheers, Rainer > > > Cheers > > M > > PS I think one of my goals this summer will be to learn lisp :) - -- Rainer M. Krug, PhD (Conservation Ecology, SUN), MSc (Conservation Biology, UCT), Dipl. Phys. (Germany) Centre of Excellence for Invasion Biology Natural Sciences Building Office Suite 2039 Stellenbosch University Main Campus, Merriman Avenue Stellenbosch South Africa Tel: +33 - (0)9 53 10 27 44 Cell: +27 - (0)8 39 47 90 42 Fax (SA): +27 - (0)8 65 16 27 82 Fax (D) : +49 - (0)3 21 21 25 22 44 Fax (FR): +33 - (0)9 58 10 27 44 email: Rainer@krugs.de Skype: RMkrug -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk2MU4IACgkQoYgNqgF2egqKNgCdH5J+8IOb8Sz5jjultIDXI/yU noUAnA++JSXpB7zMaY/bdNOWG8PppXGF =Fl62 -----END PGP SIGNATURE-----