From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 21:17:49 +0100 Message-ID: <877floffyq.fsf@gmail.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:51663) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwbqN-0001kL-UE for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:14 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwbq7-0006ts-LQ for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:11 -0500 Received: from mail-wm0-x230.google.com ([2a00:1450:400c:c09::230]:37501) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwbq7-0006tm-G3 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:17:55 -0500 Received: by wmww144 with SMTP id w144so60037404wmw.0 for ; Wed, 11 Nov 2015 12:17:54 -0800 (PST) In-reply-to: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode Dear Matt, On Wed, 11-11-2015, at 15:42, Matt Price wrote: > I've just written up a post on my workflow for PDF's Since my blog has, I > think, a readership of 0 (surely there's a way to get emacsers to follow > me? ah well), I will post a link here in the hopes that someone will be Add another 1 :-) > interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ > Really neat! A few comments/questions/ramblings: - The type of highlights you get from RepliGo contain the text itself. I mean, when in your pdf I use C-c C-a l, the buffer showing the contents of each highlight contain the highlighted text. This is not what I get from, say, EzPDF (which is what I use on Android), or from highlighting from pdf-tools itself using C-c C-a h, or from highlighting from Okular. The contents just gives the rectangle). Hummmm... Because of this, when I use your code on my pdfs, I only get things such as Highlight ([[pdfview:/home/ramon/Zotero-data/storage/ESHHD4KW/Frank_2015_Commentary.pdf::5][Frank_2015_Commentary]], 5) instead of the text. Bummer! I wonder if RepliGO gives you a lot more than the rest, or if I am doing something silly. - You have to call mwp/pdf-multi-extract on each file/set of files. I guess if I knew elisp, I'd find it trivial to iterate over a set of directories and subdirectories (and do this using a cron job at night), and also place everything in one single org file. Would this be something reasonable to do? (This might be related to your second Todo) - I know nothing about how it works, and it does not use pdf-tools, but in your first Todo you mention: "extend the pdfview link type (in org-pdfview) to permit me to specify the precise location of an annotation,". PDF.js (https://mozilla.github.io/pdf.js/), which is used for instance by zotfile (http://zotfile.com/) does that and it works out of the box with Okular (but I've not been able to get it to work with pdftools). - In case it matters, I have somewhat similar modus operandi. I do a lot of PDF reading, including note-taking and highlighting, in android tablets ---I use EzPDF, which also embeds the notes in the PDF. I have a cron job that extracts all the highlights and annotations of all the PDFs and places them in a single org file. The kludge is explained here: https://github.com/rdiaz02/Adios_Mendeley#extracting-all-pdf-annotations-and-placing-them-in-an-org-mode-file The truth is I use two mechanisms for PDF annotation and highlighting extraction, since none is fully satisfactory to me, but the one that uses Ruby (i.e., that does not depend on poppler) is able to actually extract the text of the highlights. Best, and thanks again for sharing, R. > All best! > Matt -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz