From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 09:42:13 -0500 Message-ID: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=047d7b41411eff32e6052444d409 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:36114) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWbH-0004YD-8v for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 09:42:16 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwWbG-00020J-DA for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 09:42:15 -0500 Received: from mail-ig0-x236.google.com ([2607:f8b0:4001:c05::236]:36960) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWbG-00020F-8J for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 09:42:14 -0500 Received: by igbhv6 with SMTP id hv6so22341251igb.0 for ; Wed, 11 Nov 2015 06:42:13 -0800 (PST) List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Org Mode --047d7b41411eff32e6052444d409 Content-Type: text/plain; charset=UTF-8 I've just written up a post on my workflow for PDF's Since my blog has, I think, a readership of 0 (surely there's a way to get emacsers to follow me? ah well), I will post a link here in the hopes that someone will be interested: http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ All best! Matt --047d7b41411eff32e6052444d409 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I've just written up a post on my workflow f= or PDF's Since my blog has, I think, a readership of 0 (surely there= 9;s a way to get emacsers to follow me? ah well), I will post a link here i= n the hopes that someone will be interested:

http://matt.hacki= nghistory.ca/2015/11/11/note-taking-with-pdf-tools/

All be= st!
Matt
--047d7b41411eff32e6052444d409-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Kaushal Modi Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 09:59:59 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:40722) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWtC-0003Pf-5q for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:00:50 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwWt6-0006Xm-Ln for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:00:46 -0500 Received: from mail-ob0-x232.google.com ([2607:f8b0:4003:c01::232]:33251) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWt6-0006Xa-By for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:00:40 -0500 Received: by obbww6 with SMTP id ww6so23680911obb.0 for ; Wed, 11 Nov 2015 07:00:39 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode Have you had a look at https://github.com/rudolfochrist/interleave The interleave package works great for me: taking notes in an org-mode buffer in one side (buffer) corresponding to the pdf page open in pdf-tools on the other side (buffer). From mboxrd@z Thu Jan 1 00:00:00 1970 From: Xebar Saram Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 17:06:06 +0200 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a114939046d93170524452afc Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:41692) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWyO-0004GS-IX for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:06:09 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwWyN-0007hJ-Ha for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:06:08 -0500 Received: from mail-yk0-x22b.google.com ([2607:f8b0:4002:c07::22b]:36051) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwWyN-0007hF-Ah for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:06:07 -0500 Received: by ykdr82 with SMTP id r82so52307519ykd.3 for ; Wed, 11 Nov 2015 07:06:06 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode --001a114939046d93170524452afc Content-Type: text/plain; charset=UTF-8 thanks for the interesting post! you now have 0+1 readers (just subscribed via rss) ;-) best Z On Wed, Nov 11, 2015 at 4:42 PM, Matt Price wrote: > I've just written up a post on my workflow for PDF's Since my blog has, I > think, a readership of 0 (surely there's a way to get emacsers to follow > me? ah well), I will post a link here in the hopes that someone will be > interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ > > All best! > Matt > --001a114939046d93170524452afc Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
thanks for the interesting post!

you no= w have 0+1 readers (just subscribed via rss) =C2=A0;-)

=
best

Z
=
On Wed, Nov 11, 2015 at 4:42 PM, Matt Price = <moptop99@gmail.com> wrote:
I've just written up a post on my workflo= w for PDF's Since my blog has, I think, a readership of 0 (surely there= 's a way to get emacsers to follow me? ah well), I will post a link her= e in the hopes that someone will be interested:

http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/

All best!
=
Matt
=

--001a114939046d93170524452afc-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Russell Adams Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 16:10:39 +0100 Message-ID: <20151111151039.GG4207@cardamom.adamsinfoserv.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:42843) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwX3X-0006qI-Ll for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:11:28 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwX3T-00009l-Em for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:11:27 -0500 Received: from kefka.asoshared.com ([65.99.237.198]:32851) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwX3T-0008Vd-AR for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:11:23 -0500 Received: from 5ed252e6.cm-7-3b.dynamic.ziggo.nl ([94.210.82.230]:42688 helo=localhost) by kefka.asoshared.com with esmtpsa (TLSv1.2:DHE-RSA-AES128-SHA256:128) (Exim 4.86) (envelope-from ) id 1ZwX2n-0000LU-GE for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:10:41 -0500 Content-Disposition: inline In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org On Wed, Nov 11, 2015 at 09:42:13AM -0500, Matt Price wrote: > I've just written up a post on my workflow for PDF's Since my blog has, I > think, a readership of 0 (surely there's a way to get emacsers to follow > me? ah well), I will post a link here in the hopes that someone will be > interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ > > All best! > Matt In the past I have just had the PDF open in emacs (docview?), made a bookmark, and inserted that into Org. It only links to the page though, not the text. Thanks. ------------------------------------------------------------------ Russell Adams RLAdams@AdamsInfoServ.com PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Peter Davis Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 10:15:44 -0500 Message-ID: <6sw6c7io58efdr.fsf@pfdstudio.com> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:44401) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwX8F-0002C2-Ux for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:16:45 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwX7j-0001i8-Tl for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:16:19 -0500 Received: from out4-smtp.messagingengine.com ([66.111.4.28]:44275) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwX7j-0001hq-Mk for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 10:15:47 -0500 Received: from compute5.internal (compute5.nyi.internal [10.202.2.45]) by mailout.nyi.internal (Postfix) with ESMTP id 0C54A20A9D for ; Wed, 11 Nov 2015 10:15:47 -0500 (EST) List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode Matt Price writes: > I've just written up a post on my workflow for PDF's Since my blog > has, I think, a readership of 0 (surely there's a way to get emacsers > to follow me? ah well), I will post a link here in the hopes that > someone will be interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ Thanks, Matt. This looks interesting. I've run into poppler before, but was not familiar with pdf-tools. I'll have to take a closer look. -pd From mboxrd@z Thu Jan 1 00:00:00 1970 From: Memnon Anon Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 17:09:40 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([208.118.235.92]:53912) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwZ1C-00018H-3r for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 12:19:49 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwYz1-0000bg-0q for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 12:17:10 -0500 Received: from plane.gmane.org ([80.91.229.3]:47412) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwYz0-0000bZ-Qv for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 12:14:54 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1ZwYyz-0007mw-2L for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 18:14:53 +0100 Received: from odin.sdf-eu.org ([178.63.35.194]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 11 Nov 2015 18:14:53 +0100 Received: from memnon+usenet by odin.sdf-eu.org with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 11 Nov 2015 18:14:53 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Matt Price writes: > [...] Since my blog > has, I think, a readership of 0 (surely there's a way to get emacsers > to follow me? ah well), I will post a link here [...] http://planet.emacsen.org/ -- /-------------------------------------------\ | SDF and SDF-EU Public Access UNIX System | | http://sdf.org || http://sdf-eu.org | ============================================= From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeffrey DeLeo Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 08:40:13 -0800 Message-ID: <87lha4xzf6.fsf@JUbuntu.dharma.lan> References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:52336) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwaEh-00076X-OL for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 13:35:13 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwaEe-0005j7-Iu for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 13:35:11 -0500 Received: from plane.gmane.org ([80.91.229.3]:52691) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwaEe-0005ie-CY for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 13:35:08 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1ZwaEc-00034c-Mz for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 19:35:06 +0100 Received: from 173-228-85-118.dsl.dynamic.fusionbroadband.com ([173.228.85.118]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 11 Nov 2015 19:35:06 +0100 Received: from JeffreyDeLeo by 173-228-85-118.dsl.dynamic.fusionbroadband.com with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 11 Nov 2015 19:35:06 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Matt Price writes: > I've just written up a post on my workflow for PDF's Thanks, this is helpful. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 21:17:49 +0100 Message-ID: <877floffyq.fsf@gmail.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:51663) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwbqN-0001kL-UE for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:14 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwbq7-0006ts-LQ for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:11 -0500 Received: from mail-wm0-x230.google.com ([2a00:1450:400c:c09::230]:37501) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwbq7-0006tm-G3 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:17:55 -0500 Received: by wmww144 with SMTP id w144so60037404wmw.0 for ; Wed, 11 Nov 2015 12:17:54 -0800 (PST) In-reply-to: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode Dear Matt, On Wed, 11-11-2015, at 15:42, Matt Price wrote: > I've just written up a post on my workflow for PDF's Since my blog has, I > think, a readership of 0 (surely there's a way to get emacsers to follow > me? ah well), I will post a link here in the hopes that someone will be Add another 1 :-) > interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ > Really neat! A few comments/questions/ramblings: - The type of highlights you get from RepliGo contain the text itself. I mean, when in your pdf I use C-c C-a l, the buffer showing the contents of each highlight contain the highlighted text. This is not what I get from, say, EzPDF (which is what I use on Android), or from highlighting from pdf-tools itself using C-c C-a h, or from highlighting from Okular. The contents just gives the rectangle). Hummmm... Because of this, when I use your code on my pdfs, I only get things such as Highlight ([[pdfview:/home/ramon/Zotero-data/storage/ESHHD4KW/Frank_2015_Commentary.pdf::5][Frank_2015_Commentary]], 5) instead of the text. Bummer! I wonder if RepliGO gives you a lot more than the rest, or if I am doing something silly. - You have to call mwp/pdf-multi-extract on each file/set of files. I guess if I knew elisp, I'd find it trivial to iterate over a set of directories and subdirectories (and do this using a cron job at night), and also place everything in one single org file. Would this be something reasonable to do? (This might be related to your second Todo) - I know nothing about how it works, and it does not use pdf-tools, but in your first Todo you mention: "extend the pdfview link type (in org-pdfview) to permit me to specify the precise location of an annotation,". PDF.js (https://mozilla.github.io/pdf.js/), which is used for instance by zotfile (http://zotfile.com/) does that and it works out of the box with Okular (but I've not been able to get it to work with pdftools). - In case it matters, I have somewhat similar modus operandi. I do a lot of PDF reading, including note-taking and highlighting, in android tablets ---I use EzPDF, which also embeds the notes in the PDF. I have a cron job that extracts all the highlights and annotations of all the PDFs and places them in a single org file. The kludge is explained here: https://github.com/rdiaz02/Adios_Mendeley#extracting-all-pdf-annotations-and-placing-them-in-an-org-mode-file The truth is I use two mechanisms for PDF annotation and highlighting extraction, since none is fully satisfactory to me, but the one that uses Ruby (i.e., that does not depend on poppler) is able to actually extract the text of the highlights. Best, and thanks again for sharing, R. > All best! > Matt -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:18:35 -0500 Message-ID: References: <87lha4xzf6.fsf@JUbuntu.dharma.lan> Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a1141cf8cf4fc280524498713 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:51890) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwbqn-0002KS-Hx for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:41 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwbqm-00072q-IX for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:37 -0500 Received: from mail-io0-x233.google.com ([2607:f8b0:4001:c06::233]:36818) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwbqm-00072g-73 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:18:36 -0500 Received: by iofh3 with SMTP id h3so44818954iof.3 for ; Wed, 11 Nov 2015 12:18:35 -0800 (PST) In-Reply-To: <87lha4xzf6.fsf@JUbuntu.dharma.lan> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Cc: Org Mode --001a1141cf8cf4fc280524498713 Content-Type: text/plain; charset=UTF-8 FWIW, I've just updated the post with - links that jump directly to the highlighted text - a new custom link type that opens highlights in a temporary buffer instead. http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ On Wed, Nov 11, 2015 at 11:40 AM, Jeffrey DeLeo wrote: > > Matt Price writes: > > I've just written up a post on my workflow for PDF's > > Thanks, this is helpful. > > > --001a1141cf8cf4fc280524498713 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

On Wed, Nov 11, 2015 at 11:40 AM, Jeffrey DeLeo <JeffreyDeLeo@gmai= l.com> wrote:

Matt Price <moptop99@gmail.com= > writes:
> I've just written up a post on my workflow for PDF's

Thanks, this is helpful.



--001a1141cf8cf4fc280524498713-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:33:52 -0500 Message-ID: References: <877floffyq.fsf@gmail.com> Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a113f8f3c9ef925052449befc Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:55505) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwc5b-0000Ar-8N for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:33:57 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwc5Z-0002Fw-DB for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:33:55 -0500 Received: from mail-io0-x236.google.com ([2607:f8b0:4001:c06::236]:32830) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwc5Z-0002Fq-6p for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:33:53 -0500 Received: by iouu10 with SMTP id u10so37031725iou.0 for ; Wed, 11 Nov 2015 12:33:52 -0800 (PST) In-Reply-To: <877floffyq.fsf@gmail.com> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Ramon Diaz-Uriarte Cc: Org Mode --001a113f8f3c9ef925052449befc Content-Type: text/plain; charset=UTF-8 On Wed, Nov 11, 2015 at 3:17 PM, Ramon Diaz-Uriarte wrote: > Dear Matt, > > > On Wed, 11-11-2015, at 15:42, Matt Price wrote: > > I've just written up a post on my workflow for PDF's Since my blog has, I > > think, a readership of 0 (surely there's a way to get emacsers to follow > > me? ah well), I will post a link here in the hopes that someone will be > > Add another 1 :-) > > > interested: > > > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ > > > > Really neat! A few comments/questions/ramblings: > > - The type of highlights you get from RepliGo contain the text itself. I > mean, when in your pdf I use C-c C-a l, the buffer showing the contents > of each highlight contain the highlighted text. > > This is not what I get from, say, EzPDF (which is what I use on Android), > or from highlighting from pdf-tools itself using C-c C-a h, or from > highlighting from Okular. The contents just gives the rectangle). > Hummmm... > > > Because of this, when I use your code on my pdfs, I only get things > such as > > Highlight > > ([[pdfview:/home/ramon/Zotero-data/storage/ESHHD4KW/Frank_2015_Commentary.pdf::5][Frank_2015_Commentary]], > 5) > > > instead of the text. Bummer! I wonder if RepliGO gives you a lot more > than the rest, or if I am doing something silly. > > I think that there is no standard way of storing the highlight contents. I chose Repligo over EZPDF because it gives you access to the text of the highlights! Okular, I think, stores your annotations in its own database, rather than in the pdf. You can (I think!) attach the annotations to the pdf from inside Okular. At leasts, that's what I remember from when I was looking around. Repligo stores the highlighted text in the "subject" field of the annotation. It's possible that the content of the annotation is stored in some other field, like "content". Maybe you can try: M-: (pdf-annot-get-annots) and look at the output in the *Messages* buffer. Can you see any evidence of the the text? Can you share what you learned? Politza and I are discussing this here: https://github.com/politza/pdf-tools/issues/137 that might be a good place to ocntinue the conversation. > > - You have to call mwp/pdf-multi-extract on each file/set of files. I guess > if I knew elisp, I'd find it trivial to iterate over a set of directories > and subdirectories (and do this using a cron job at night), and also > place everything in one single org file. Would this be something > reasonable to do? > > for sure. My elisp sucks too but I bet someone will answer you here on the list. > (This might be related to your second Todo) > > well, wasn't what I was planning but would still be useful. > > - I know nothing about how it works, and it does not use pdf-tools, but in > your first Todo you mention: "extend the pdfview link type (in > org-pdfview) to permit me to specify the precise location of an > annotation,". PDF.js (https://mozilla.github.io/pdf.js/), which is > used for instance by zotfile (http://zotfile.com/) does that and it > works > out of the box with Okular (but I've not been able to get it to work with > pdftools). > > Until I found pdf-tools, I had planned to write a node wrapper for pdf.js and grab the annotations that way. But I don't really know how to do that, so this turned out to be easier :-) Anyway, I've judated the post, and it's now possible to create links to individualt annotations, though you will have to use my updated version of org-pdfview, until/unless Markus accepts my patch. > > - In case it matters, I have somewhat similar modus operandi. I do a lot > of PDF reading, including note-taking and highlighting, in android > tablets ---I use EzPDF, which also embeds the notes in the PDF. I have a > cron job that extracts all the highlights and annotations of all the PDFs > and places them in a single org file. The kludge is explained here: > > https://github.com/rdiaz02/Adios_Mendeley#extracting-all-pdf-annotations-and-placing-them-in-an-org-mode-file > The truth is I use two mechanisms for PDF annotation and highlighting > extraction, since none is fully satisfactory to me, but the one that uses > Ruby (i.e., that does not depend on poppler) is able to actually extract > the text of the highlights. > > ah, man, that looks really cool and I'm sorry I didn't know about it earlier! I haven't read through your whole document but looks like there's a lot useful stuff there. > > Best, and thanks again for sharing, > > you're welcome & thank you! m --001a113f8f3c9ef925052449befc Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Wed, Nov 11, 2015 at 3:17 PM, Ramon Diaz-Uriarte &= lt;rdiaz02@gmail.com= > wrote:
Dear Matt,


On Wed, 11-11-2015, at 15:42, Matt Price <moptop99@gmail.com> wrote:
> I've just written up a post on my workflow for PDF's Since my = blog has, I
> think, a readership of 0 (surely there's a way to get emacsers to = follow
> me? ah well), I will post a link here in the hopes that someone will b= e

Add another 1 :-)

> interested:
>
> http://matt.hackinghistory.= ca/2015/11/11/note-taking-with-pdf-tools/
>

Really neat! A few comments/questions/ramblings:

- The type of highlights you get from RepliGo contain the text itself. I =C2=A0 mean, when in your pdf I use C-c C-a l, the buffer showing the conte= nts
=C2=A0 of each highlight contain=C2=A0 the highlighted text.

=C2=A0 This is not what I get from, say, EzPDF (which is what I use on Andr= oid),
=C2=A0 or from highlighting from pdf-tools itself using C-c C-a h, or from<= br> =C2=A0 highlighting from Okular. The contents just gives the rectangle). Hu= mmmm...


=C2=A0 Because of this, when I use your code on my pdfs, I only get things<= br> =C2=A0 such as

Highlight
([[pdfview:/home/ramon/Zotero-data/storage/ESHHD4KW/Frank_2015_Commentary.p= df::5][Frank_2015_Commentary]],
5)


=C2=A0 instead of the text. Bummer! I wonder if RepliGO gives you a lot mor= e
=C2=A0 than the rest, or if I am doing something silly.

I think that there is no standard way of storing the = highlight contents. I chose Repligo over EZPDF because it gives you access = to the text of the highlights!=C2=A0

Okular, I think, st= ores your annotations in its own database, rather than in the pdf. You can = (I think!) attach the annotations to the pdf from inside Okular.=C2=A0 At l= easts, that's what I remember from when I was looking around.

Repligo stores the highlighted text in the "subject" fie= ld of the annotation. It's possible that the content of the annotation = is stored in some other field, like "content".=C2=A0 Maybe you ca= n try:

M-: (pdf-annot-get-annots) and look at the output in the *Mes= sages* buffer.=C2=A0 Can you see any evidence of the the text? Can you shar= e what you learned?

Politza and I are discussing this her= e:
https://g= ithub.com/politza/pdf-tools/issues/137

th= at might be a good place to ocntinue the conversation.
=C2= =A0

- You have to call mwp/pdf-multi-extract on each file/set of files. I guess=
=C2=A0 if I knew elisp, I'd find it trivial to iterate over a set of di= rectories
=C2=A0 and subdirectories (and do this using a cron job at night), and also=
=C2=A0 place everything in one single org file. Would this be something
=C2=A0 reasonable to do?

for sure.=C2=A0 My elisp sucks too but I bet someone = will answer you here on the list.
=C2=A0
=C2=A0 (This might be related to your second Todo)

well, wasn't what I was planning but would still = be useful.

- I know nothing about how it works, and it does not use pdf-tools, but in<= br> =C2=A0 your first Todo you mention: "extend the pdfview link type (in<= br> =C2=A0 org-pdfview) to permit me to specify the precise location of an
=C2=A0 annotation,".=C2=A0 PDF.js (https://mozilla.github.io/p= df.js/), which is
=C2=A0 used for instance by zotfile (http://zotfile.com/) does that and it wo= rks
=C2=A0 out of the box with Okular (but I've not been able to get it to = work with
=C2=A0 pdftools).

Until I found pdf-tools, I had planned to write a nod= e wrapper for pdf.js and grab the annotations that way.=C2=A0 But I don'= ;t really know how to do that, so this turned out to be easier :-)

<= /div>
Anyway, I've judated the post, and it's now possible to c= reate links to individualt annotations, though you will have to use my upda= ted version of org-pdfview, until/unless Markus accepts my patch. =C2=A0

- In case it matters, I have somewhat similar modus operandi.=C2=A0 I do a = lot
=C2=A0 of PDF reading, including note-taking and highlighting, in android =C2=A0 tablets ---I use EzPDF, which also embeds the notes in the PDF. I ha= ve a
=C2=A0 cron job that extracts all the highlights and annotations of all the= PDFs
=C2=A0 and places them in a single org file. The kludge is explained here:<= br> =C2=A0 https://github.com/rdiaz02/Adios_Mendeley#extracting-all-pd= f-annotations-and-placing-them-in-an-org-mode-file
=C2=A0 The truth is I use two mechanisms for PDF annotation and highlightin= g
=C2=A0 extraction, since none is fully satisfactory to me, but the one that= uses
=C2=A0 Ruby (i.e., that does not depend on poppler) is able to actually ext= ract
=C2=A0 the text of the highlights.

ah, man, that looks really cool and I'm sorry I d= idn't know about it earlier! I haven't read through your whole docu= ment but looks like there's a lot useful stuff there.=C2=A0

=C2= =A0

Best, and thanks again for sharing,

you're welcome & thank you!
m
--001a113f8f3c9ef925052449befc-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:34:57 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a1141cf8c75c96a052449c260 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:55756) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwc6e-0001gH-Ck for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:35:02 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwc6b-0002ad-Np for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:35:00 -0500 Received: from mail-io0-x22c.google.com ([2607:f8b0:4001:c06::22c]:35893) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwc6b-0002Zu-JM for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:34:57 -0500 Received: by iofh3 with SMTP id h3so45231688iof.3 for ; Wed, 11 Nov 2015 12:34:57 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Org Mode --001a1141cf8c75c96a052449c260 Content-Type: text/plain; charset=UTF-8 On Wed, Nov 11, 2015 at 12:09 PM, Memnon Anon wrote: > Matt Price writes: > > > [...] Since my blog > > has, I think, a readership of 0 (surely there's a way to get emacsers > > to follow me? ah well), I will post a link here [...] > > http://planet.emacsen.org/\ But I can't figure out how to add my blog to the list there! Can you tell me the secret? --001a1141cf8c75c96a052449c260 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Wed, Nov 11, 2015 at 12:09 PM, Memnon Anon <memnon+usenet= @freeshell.org> wrote:
Matt= Price <moptop99@gmail.com>= writes:

> [...] Since my blog
> has, I think, a readership of 0 (surely there's a= way to get emacsers
> to follow me? ah well), I will post a link here [...]

http://planet.emacsen.org/\
But I can't figure out how to add my blog to the list there! Can = you tell me the secret?

--001a1141cf8c75c96a052449c260-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:38:43 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a113f8f3cf896bb052449cfed Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:56679) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcAL-00045e-G4 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:38:52 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwcAG-0003dv-HL for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:38:49 -0500 Received: from mail-io0-x231.google.com ([2607:f8b0:4001:c06::231]:35673) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcAG-0003dj-Cw for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:38:44 -0500 Received: by ioc74 with SMTP id 74so45185999ioc.2 for ; Wed, 11 Nov 2015 12:38:44 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Cc: Org Mode --001a113f8f3cf896bb052449cfed Content-Type: text/plain; charset=UTF-8 On Wed, Nov 11, 2015 at 9:59 AM, Kaushal Modi wrote: > Have you had a look at https://github.com/rudolfochrist/interleave not till now! > > > The interleave package works great for me: taking notes in an org-mode > buffer in one side (buffer) corresponding to the pdf page open in > pdf-tools on the other side (buffer). > Huh. Looks pretty neat. Doesn't seem to work with pdfview yet, and I'm sort of addicted now. But it seems like a great idea. I can think of some changes, e.g., would be nice to keep all my notes for all books/articles/etc. in a single file; if the interleave options could be set on a per-heading basis that owuld be really cool. And if it worked with pdfview, one could add highlights to the pdf text while writing notes in the org window. Huh. Interesting. Thanks for sharing. --001a113f8f3cf896bb052449cfed Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Wed, Nov 11, 2015 at 9:59 AM, Kaushal Modi <kaushal.modi@gmail= .com> wrote:
Have you had a= look at https://github.com/rudolfochrist/interleave<= /blockquote>

not till now!


The interleave package works great for me: taking notes in an org-mode
buffer in one side (buffer) corresponding to the pdf page open in
pdf-tools on the other side (buffer).
Huh.=C2=A0 Looks pretty neat.=C2=A0 Doesn't seem to = work with pdfview yet, and I'm sort of addicted now. But it seems like = a great idea.=C2=A0 I can think of some changes, e.g., would be nice to kee= p all my notes for all books/articles/etc. in a single file; if the interle= ave options could be set on a per-heading basis that owuld be really cool. = And if it worked with pdfview, one could add highlights to the pdf text whi= le writing notes in the org window. Huh. Interesting. Thanks for sharing.
--001a113f8f3cf896bb052449cfed-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Kaushal Modi Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:48:15 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:59049) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcKA-0000g5-J1 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:48:59 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwcK8-0006Cg-23 for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:48:58 -0500 Received: from mail-ob0-x22a.google.com ([2607:f8b0:4003:c01::22a]:35056) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcK7-0006CN-Tg for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:48:55 -0500 Received: by obbnk6 with SMTP id nk6so31192628obb.2 for ; Wed, 11 Nov 2015 12:48:55 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode > Doesn't seem to work with pdfview yet, and I'm sort of addicted now. I use pdf-view too (the major mode for viewing PDF files that comes with pdf-tools, right?) and interleave works great! > I can think of some changes, e.g., would be nice to keep all my notes for all books/articles/etc. in a single file Yeah, right now it is one org file per pdf. > if the interleave options could be set on a per-heading basis that would be really cool. What sort of options are you thinking about? Right now it creates one org heading for each set of notes corresponding to a page. May be you can work with the interleave.el dev to add in your ideas and code. I had worked with him to make interleave support pdf-tools and it was a pleasure to work with him. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 15:58:24 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a11c3c6be5baf0205244a1602 Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:33148) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcTK-0005ye-Cw for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:58:27 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwcTJ-0008Sw-CY for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:58:26 -0500 Received: from mail-ig0-x22e.google.com ([2607:f8b0:4001:c05::22e]:34760) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwcTJ-0008Sr-7m for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 15:58:25 -0500 Received: by igvg19 with SMTP id g19so1644176igv.1 for ; Wed, 11 Nov 2015 12:58:24 -0800 (PST) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Cc: Org Mode --001a11c3c6be5baf0205244a1602 Content-Type: text/plain; charset=UTF-8 On Wed, Nov 11, 2015 at 3:48 PM, Kaushal Modi wrote: > > Doesn't seem to work with pdfview yet, and I'm sort of addicted now. > > I use pdf-view too (the major mode for viewing PDF files that comes > with pdf-tools, right?) and interleave works great! > ah cool, i didn't know that. > > > I can think of some changes, e.g., would be nice to keep all my notes > for all books/articles/etc. in a single file > > Yeah, right now it is one org file per pdf. > > > if the interleave options could be set on a per-heading basis that would > be really cool. > > What sort of options are you thinking about? Right now it creates one > org heading for each set of notes corresponding to a page. > > May be you can work with the interleave.el dev to add in your ideas > and code. I had worked with him to make interleave support pdf-tools > and it was a pleasure to work with him. > I meant INTERLEAVE_PDF. I just opened an issue: https://github.com/rudolfochrist/interleave/issues/13 so who knows, maybe... --001a11c3c6be5baf0205244a1602 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Wed, Nov 11, 2015 at 3:48 PM, Kaushal Modi <kaushal.modi@gmail= .com> wrote:
>=C2=A0 Doesn't seem to work with pdfview yet,= and I'm sort of addicted now.

I use pdf-view too (the major mode for viewing PDF files that comes<= br> with pdf-tools, right?) and interleave works great!
ah= cool, i didn't know that.=C2=A0

> I can think of some changes, e.g., would be nice to keep all my notes = for all books/articles/etc. in a single file

Yeah, right now it is one org file per pdf.

> if the interleave options could be set on a per-heading basis that wou= ld be really cool.

What sort of options are you thinking about? Right now it creates one
org heading for each set of notes corresponding to a page.=C2=A0

May be you can work with the interleave.el dev to add in your ideas
and code. I had worked with him to make interleave support pdf-tools
and it was a pleasure to work with him.
I meant INTERL= EAVE_PDF. I just opened an issue: https://github.com/rudolfochrist/interleave/issues= /13=C2=A0 so who knows, maybe...

--001a11c3c6be5baf0205244a1602-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Lundin Subject: Re: Org Mode and PDF Notes! Date: Wed, 11 Nov 2015 16:43:52 -0600 Message-ID: <87lha46tsn.fsf@fastmail.fm> References: <877floffyq.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:33565) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwe7Z-0003lH-Ay for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 17:44:11 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwe7P-0001dI-1I for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 17:44:05 -0500 Received: from out4-smtp.messagingengine.com ([66.111.4.28]:56890) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwe7O-0001cc-SN for emacs-orgmode@gnu.org; Wed, 11 Nov 2015 17:43:54 -0500 In-Reply-To: (Matt Price's message of "Wed, 11 Nov 2015 15:33:52 -0500") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Ramon Diaz-Uriarte , Org Mode Matt Price writes: > > I think that there is no standard way of storing the highlight > contents. I chose Repligo over EZPDF because it gives you access to > the text of the highlights! > Okular, I think, stores your annotations in its own database, rather > than in the pdf. You can (I think!) attach the annotations to the pdf > from inside Okular. At leasts, that's what I remember from when I was > looking around. > Repligo stores the highlighted text in the "subject" field of the > annotation. It's possible that the content of the annotation is stored > in some other field, like "content". Maybe you can try: Repligo (as you say) grabs the text in the highlighted region and adds it to the "subject" connected to the highlight. I think it is rather unique in doing so. AFAIK, Okular, pdf-tools, or other android tools (mupdf, xodo reader) do not not do this. In other words, it is thanks of Repligo's previous extraction that the function on Matt's blog is able get the text of the highlighted region. Otherwise, you would need a means of going back into the region highlighted and extracting the embedded text. Pdf-tools doesn't currently extract the text, but it might not be that difficult to modify it to do so, since it already has the bounds of the highlighted region and can select and copy text embedded in the pdf. Best, Matt From mboxrd@z Thu Jan 1 00:00:00 1970 From: Karl Voit Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 12:30:35 +0100 Message-ID: <2015-11-12T12-19-00@devnull.Karl-Voit.at> References: Reply-To: Karl Voit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:59240) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwq5c-0005FJ-P0 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:30:53 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwq5Z-00045a-IB for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:30:52 -0500 Received: from plane.gmane.org ([80.91.229.3]:50323) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwq5Z-00045H-1S for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:30:49 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1Zwq5X-00083R-1o for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 12:30:47 +0100 Received: from friends.grml.info ([136.243.234.19]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 12:30:47 +0100 Received: from news1142 by friends.grml.info with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 12:30:47 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org * Matt Price wrote: > --047d7b41411eff32e6052444d409 > Content-Type: text/plain; charset=UTF-8 > > I've just written up a post on my workflow for PDF's Since my blog has, I > think, a readership of 0 (surely there's a way to get emacsers to follow > me? ah well), I will post a link here in the hopes that someone will be > interested: > > http://matt.hackinghistory.ca/2015/11/11/note-taking-with-pdf-tools/ Thanks for your blog post and the URL! I did something very similar a couple of years ago while writing my PhD thesis: https://github.com/novoid/extract_pdf_annotations_to_orgmode And I also have a blog which I use for Emacs/Org-mode stuff from time to time: http://karl-voit.at/ This reminds me of implementing a tag-based RSS/Atom-feed for Emacs-related stuff to my blogging system ... -- mail|git|SVN|photos|postings|SMS|phonecalls|RSS|CSV|XML to Org-mode: > get Memacs from https://github.com/novoid/Memacs < https://github.com/novoid/extract_pdf_annotations_to_orgmode + more on github From mboxrd@z Thu Jan 1 00:00:00 1970 From: Sebastian Christ Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 12:58:25 +0100 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:38894) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwqWY-0003AO-3C for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:58:42 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwqWU-0002yn-3O for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:58:42 -0500 Received: from plane.gmane.org ([80.91.229.3]:51968) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwqWT-0002yd-T8 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 06:58:38 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1ZwqWN-00043X-Pm for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 12:58:31 +0100 Received: from ip5f591fba.dynamic.kabel-deutschland.de ([95.89.31.186]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 12:58:31 +0100 Received: from rudolfo.christ by ip5f591fba.dynamic.kabel-deutschland.de with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 12:58:31 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org >>>>> On Wed, 11 Nov 2015 15:48:15 -0500, Kaushal Modi said: > May be you can work with the interleave.el dev to add in your ideas > and code. I had worked with him to make interleave support pdf-tools > and it was a pleasure to work with him. You're so kind! It was also a pleasure for me. Best, Sebastian From mboxrd@z Thu Jan 1 00:00:00 1970 From: Sebastian Christ Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 13:02:21 +0100 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:41259) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwqcu-0005Yj-M8 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:05:22 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwqcq-0004qP-MO for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:05:16 -0500 Received: from plane.gmane.org ([80.91.229.3]:52485) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwqcq-0004qK-GI for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:05:12 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1Zwqck-0002dZ-0Z for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 13:05:06 +0100 Received: from ip5f591fba.dynamic.kabel-deutschland.de ([95.89.31.186]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 13:05:05 +0100 Received: from rudolfo.christ by ip5f591fba.dynamic.kabel-deutschland.de with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 13:05:05 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org >>>>> On Wed, 11 Nov 2015 15:58:24 -0500, Matt Price said: > I meant INTERLEAVE_PDF. I just opened an issue: > https://github.com/rudolfochrist/interleave/issues/13 so who knows, > maybe... Sound like a good idea. I'm currently a little short on time,but perhaps I can look into that this weekend. -Sebastian From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 13:23:43 +0100 Message-ID: <87wptnqucw.fsf@gmail.com> References: <877floffyq.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:46758) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwqur-0004Wv-Qn for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:23:51 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwquo-0001Ot-Jp for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:23:49 -0500 Received: from mail-wm0-x232.google.com ([2a00:1450:400c:c09::232]:33098) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwquo-0001Om-BY for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 07:23:46 -0500 Received: by wmec201 with SMTP id c201so30203369wme.0 for ; Thu, 12 Nov 2015 04:23:45 -0800 (PST) In-reply-to: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Ramon Diaz-Uriarte , Org Mode On Wed, 11-11-2015, at 21:33, Matt Price wrote: >> >> >> instead of the text. Bummer! I wonder if RepliGO gives you a lot more >> than the rest, or if I am doing something silly. >> >> I think that there is no standard way of storing the highlight contents. I > chose Repligo over EZPDF because it gives you access to the text of the > highlights! I'll try to see if I can get repligo (I had it a few years ago) > > Okular, I think, stores your annotations in its own database, rather than > in the pdf. You can (I think!) attach the annotations to the pdf from > inside Okular. At leasts, that's what I remember from when I was looking > around. Actually, Okular stores the annotations in the PDF itself if you do "Save As". (It still keeps an internal db, but I never use it anymore). It is easy to check by doing that and then opening the file with another reader in another machine (e.g., in an android). > > Repligo stores the highlighted text in the "subject" field of the > annotation. It's possible that the content of the annotation is stored in > some other field, like "content". Maybe you can try: > > M-: (pdf-annot-get-annots) and look at the output in the *Messages* > buffer. Can you see any evidence of the the text? Can you share what you > learned? Nope, no evidence of the text. I get things such as (((buffer . #) (page . 13) (edges 0.113553 0.31717 0.868657 0.361746) (type . highlight) (id . annot-13-0) (flags . 4) (color . "#ffff00") (contents . "") (modified 22081 45188) (label . "TF201") (subject . "Highlight") (opacity . 1.0) ...) so we get the location of the highlight (and its properties), but not the textual contents. And this is the case whether I make the annotation with EzPDF or Okular or, for that matter, with pdf-tools itself. So it seems RepliGO is actually giving you a lot more by default :-) > > Politza and I are discussing this here: > https://github.com/politza/pdf-tools/issues/137 > > that might be a good place to ocntinue the conversation. > I'll do. In the meantime, I think this is a limitation coming from poppler. Other people have mentioned similar things (e.g., http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using other tools that depend on poppler (such as Leela: https://github.com/TrilbyWhite/Leela) also will not give us the text itself. >> >> Until I found pdf-tools, I had planned to write a node wrapper for pdf.js > and grab the annotations that way. But I don't really know how to do that, > so this turned out to be easier :-) > > Anyway, I've judated the post, and it's now possible to create links to > individualt annotations, though you will have to use my updated version of > org-pdfview, until/unless Markus accepts my patch. I just updated packages, and things are working perfectly: I am jumping to the page and location. Thanks, R. -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 08:11:23 -0500 Message-ID: References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a1141cf8cfee8e1052457ad9b Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:58237) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwrev-0006Nc-Vj for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 08:11:27 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwreu-00047q-5e for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 08:11:25 -0500 Received: from mail-io0-x22d.google.com ([2607:f8b0:4001:c06::22d]:36393) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwreu-00047a-0l for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 08:11:24 -0500 Received: by iofh3 with SMTP id h3so64162938iof.3 for ; Thu, 12 Nov 2015 05:11:23 -0800 (PST) In-Reply-To: <87wptnqucw.fsf@gmail.com> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Ramon Diaz-Uriarte Cc: Org Mode --001a1141cf8cfee8e1052457ad9b Content-Type: text/plain; charset=UTF-8 On Thu, Nov 12, 2015 at 7:23 AM, Ramon Diaz-Uriarte wrote: > > > > On Wed, 11-11-2015, at 21:33, Matt Price wrote: > >> > >> > >> instead of the text. Bummer! I wonder if RepliGO gives you a lot more > >> than the rest, or if I am doing something silly. > >> > >> I think that there is no standard way of storing the highlight > contents. I > > chose Repligo over EZPDF because it gives you access to the text of the > > highlights! > > > I'll try to see if I can get repligo (I had it a few years ago) > > > > > Okular, I think, stores your annotations in its own database, rather than > > in the pdf. You can (I think!) attach the annotations to the pdf from > > inside Okular. At leasts, that's what I remember from when I was looking > > around. > > Actually, Okular stores the annotations in the PDF itself if you do "Save > As". (It still keeps an internal db, but I never use it anymore). It is > easy to check by doing that and then opening the file with another reader > in another machine (e.g., in an android). > > > my bad, thx. > > > > > Repligo stores the highlighted text in the "subject" field of the > > annotation. It's possible that the content of the annotation is stored in > > some other field, like "content". Maybe you can try: > > > > M-: (pdf-annot-get-annots) and look at the output in the *Messages* > > buffer. Can you see any evidence of the the text? Can you share what you > > learned? > > Nope, no evidence of the text. I get things such as > > (((buffer . #) (page . 13) (edges > 0.113553 0.31717 0.868657 0.361746) (type . highlight) (id . annot-13-0) > (flags . 4) (color . "#ffff00") (contents . "") (modified 22081 45188) > (label . "TF201") (subject . "Highlight") (opacity . 1.0) ...) > > > so we get the location of the highlight (and its properties), but not the > textual contents. And this is the case whether I make the annotation with > EzPDF or Okular or, for that matter, with pdf-tools itself. > > So it seems RepliGO is actually giving you a lot more by default :-) > > Try replacing (text (assoc-default 'subject annot)) with (text (pdf-info-gettext page (assoc-default 'edges annot))) in the lambda function in pdf-annot-markups-as-org-text. This will fail on cropped pdfs if you have added highlights using the most recent pdf-tools, which stores negative values in the 'edges field, but I've found it works otherwise. I'd love to hear if it works for you too. (I know you're following the relevant bug report on the pdf-tools github repo). > > > > > Politza and I are discussing this here: > > https://github.com/politza/pdf-tools/issues/137 > > > > that might be a good place to ocntinue the conversation. > > > > I'll do. In the meantime, I think this is a limitation coming from > poppler. Other people have mentioned similar things (e.g., > http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using > other > tools that depend on poppler (such as Leela: > https://github.com/TrilbyWhite/Leela) also will not give us the text > itself. > > > > >> > >> Until I found pdf-tools, I had planned to write a node wrapper for > pdf.js > > and grab the annotations that way. But I don't really know how to do > that, > > so this turned out to be easier :-) > > > > Anyway, I've judated the post, and it's now possible to create links to > > individualt annotations, though you will have to use my updated version > of > > org-pdfview, until/unless Markus accepts my patch. > > > I just updated packages, and things are working perfectly: I am jumping to > the page and location. > > > > Thanks, > > > R. > > > > > --001a1141cf8cfee8e1052457ad9b Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Thu, Nov 12, 2015 at 7:23 AM, Ramon Diaz-Uriarte &= lt;rdiaz02@gmail.com= > wrote:



On Wed, 11-11-2015, at 21:33, Matt Price <moptop99@gmail.com> wrote:
>>
>>
>>=C2=A0 =C2=A0instead of the text. Bummer! I wonder if RepliGO gives= you a lot more
>>=C2=A0 =C2=A0than the rest, or if I am doing something silly.
>>
>> I think that there is no standard way of storing the highlight con= tents. I
> chose Repligo over EZPDF because it gives you access to the text of th= e
> highlights!


I'll try to see if I can get repligo (I had it a few years ago)<= br>
>
> Okular, I think, stores your annotations in its own database, rather t= han
> in the pdf. You can (I think!) attach the annotations to the pdf from<= br> > inside Okular.=C2=A0 At leasts, that's what I remember from when I= was looking
> around.

Actually, Okular stores the annotations in the PDF itself if you do = "Save
As". (It still keeps an internal db, but I never use it anymore). It i= s
easy to check by doing that and then opening the file with another reader in another machine (e.g., in an android).


my bad, thx.=C2=A0

>
> Repligo stores the highlighted text in the "subject" field o= f the
> annotation. It's possible that the content of the annotation is st= ored in
> some other field, like "content".=C2=A0 Maybe you can try: >
> M-: (pdf-annot-get-annots) and look at the output in the *Messages* > buffer.=C2=A0 Can you see any evidence of the the text? Can you share = what you
> learned?

Nope, no evidence of the text. I get things such as

(((buffer . #<buffer Frank_2015_Commentary.pdf>) (page . 13) (edges 0.113553 0.31717 0.868657 0.361746) (type . highlight) (id . annot-13-0) (flags . 4) (color . "#ffff00") (contents . "") (modifi= ed 22081 45188)
(label . "TF201") (subject . "Highlight") (opacity . 1.= 0) ...)


so we get the location of the highlight (and its properties), but not the textual contents. And this is the case whether I make the annotation with EzPDF or Okular or, for that matter, with pdf-tools itself.

So it seems RepliGO is actually giving you a lot more by default :-)


Try replacing
(text (assoc-default 'subject annot))

with=

(text (pdf-info-gettext page (assoc-default 'edges annot)))
in the lambda function in pdf-annot-markups-as-org-text.= =C2=A0 This will fail on cropped pdfs if you have added highlights using th= e most recent pdf-tools, which stores negative values in the 'edges fie= ld, but I've found it works otherwise.=C2=A0 I'd love to hear if it= works for you too. (I know you're following the relevant bug report on= the pdf-tools github repo).
=C2=A0

>
> Politza and I are discussing this here:
> https://github.com/politza/pdf-tools/issues/137<= /a>
>
> that might be a good place to ocntinue the conversation.
>

I'll do. In the meantime, I think this is a limitation coming fr= om
poppler. Other people have mentioned similar things (e.g.,
http://coda.caseykuhlman.com/entries/2014= /pdf-extract.html) and using other
tools that depend on poppler (such as Leela:
https://github.com/TrilbyWhite/Leela) also will not give us= the text
itself.



>>
>> Until I found pdf-tools, I had planned to write a node wrapper for= pdf.js
> and grab the annotations that way.=C2=A0 But I don't really know h= ow to do that,
> so this turned out to be easier :-)
>
> Anyway, I've judated the post, and it's now possible to create= links to
> individualt annotations, though you will have to use my updated versio= n of
> org-pdfview, until/unless Markus accepts my patch.


I just updated packages, and things are working perfectly: I am jump= ing to
the page and location.



Thanks,


R.





--001a1141cf8cfee8e1052457ad9b-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Lundin Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 08:28:44 -0600 Message-ID: <87k2pn70mb.fsf@fastmail.fm> References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:48588) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwsrs-0007yZ-LD for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 09:28:53 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zwsrn-0006i2-St for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 09:28:52 -0500 Received: from out4-smtp.messagingengine.com ([66.111.4.28]:48854) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zwsrn-0006hZ-Mt for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 09:28:47 -0500 In-Reply-To: <87wptnqucw.fsf@gmail.com> (Ramon Diaz-Uriarte's message of "Thu, 12 Nov 2015 13:23:43 +0100") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Ramon Diaz-Uriarte Cc: Org Mode Ramon Diaz-Uriarte writes: > > so we get the location of the highlight (and its properties), but not the > textual contents. And this is the case whether I make the annotation with > EzPDF or Okular or, for that matter, with pdf-tools itself. > > So it seems RepliGO is actually giving you a lot more by default :-) > >> >> Politza and I are discussing this here: >> https://github.com/politza/pdf-tools/issues/137 >> >> that might be a good place to ocntinue the conversation. >> > > I'll do. In the meantime, I think this is a limitation coming from > poppler. Other people have mentioned similar things (e.g., > http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using other > tools that depend on poppler (such as Leela: > https://github.com/TrilbyWhite/Leela) also will not give us the text > itself. I don't think this is a limitation of poppler so much as the way that pdf annotations work. Typically, the subject/text field is not populated by the text of the highlighted region. Rather, a highlight annotation specifies bounds, color, style, etc. Basically what Repligo does (I wouldn't recommend using it, as it is closed source and severely out of date) is to grab the text *at the time of highlighting* and add it to the notes field. I don't know of any other annotation tool that does the same thing. Applications built on poppler could do it, though they currently do not. For extracting the text of highlighted regions *after the fact*, I've had good luck with this script that relies on the pdf-reader gem for ruby: https://gist.github.com/danlucraft/5277732 Matt From mboxrd@z Thu Jan 1 00:00:00 1970 From: Memnon Anon Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 17:31:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:51188) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwvoJ-0008KV-0s for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 12:37:23 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZwvoF-0001f4-HA for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 12:37:22 -0500 Received: from plane.gmane.org ([80.91.229.3]:33960) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZwvoF-0001eX-BF for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 12:37:19 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1Zwvo9-0003kC-7y for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:37:13 +0100 Received: from odin.sdf-eu.org ([178.63.35.194]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 18:37:13 +0100 Received: from memnon+usenet by odin.sdf-eu.org with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2015 18:37:13 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Matt Price writes: > On Wed, Nov 11, 2015 at 12:09 PM, Memnon Anon < > memnon+usenet@freeshell.org> wrote: > > http://planet.emacsen.org/\ > > But I can't figure out how to add my blog to the list there! Can you > tell me the secret? http://edward.oconnor.cx/2009/11/planet-emacsen-turns-3 ,---- | If you’d like me to add your Emacs feed to Planet Emacsen, send me an | email [mailto:hober0@gmail.com] and I’ll get right on it. `---- While you are at it, please ask him to put that info somewhere more visible on the site itself... -- /-------------------------------------------\ | SDF and SDF-EU Public Access UNIX System | | http://sdf.org || http://sdf-eu.org | ============================================= From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt Price Subject: Re: Org Mode and PDF Notes! Date: Thu, 12 Nov 2015 17:52:45 -0500 Message-ID: References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> <87k2pn70mb.fsf@fastmail.fm> Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a11c3c6be21cb7305245fcdcb Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:58495) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx0jY-0003bD-0Q for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 17:52:49 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zx0jW-0006q4-JO for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 17:52:47 -0500 Received: from mail-ig0-x235.google.com ([2607:f8b0:4001:c05::235]:37759) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx0jW-0006pw-D2 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 17:52:46 -0500 Received: by igbhv6 with SMTP id hv6so3921015igb.0 for ; Thu, 12 Nov 2015 14:52:45 -0800 (PST) In-Reply-To: <87k2pn70mb.fsf@fastmail.fm> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Cc: Org Mode --001a11c3c6be21cb7305245fcdcb Content-Type: text/plain; charset=UTF-8 On Thu, Nov 12, 2015 at 9:28 AM, Matt Lundin wrote: > Ramon Diaz-Uriarte writes: > > > > I'll do. In the meantime, I think this is a limitation coming from > > poppler. Other people have mentioned similar things (e.g., > > http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using > other > > tools that depend on poppler (such as Leela: > > https://github.com/TrilbyWhite/Leela) also will not give us the text > > itself. > > I don't think this is a limitation of poppler so much as the way that > pdf annotations work. Typically, the subject/text field is not populated > by the text of the highlighted region. Rather, a highlight annotation > specifies bounds, color, style, etc. Basically what Repligo does (I > wouldn't recommend using it, as it is closed source and severely out of > date) is to grab the text *at the time of highlighting* and add it to > the notes field. I don't know of any other annotation tool that does the > same thing. Applications built on poppler could do it, though they > currently do not. > > For extracting the text of highlighted regions *after the fact*, I've > had good luck with this script that relies on the pdf-reader gem for > ruby: > > https://gist.github.com/danlucraft/5277732 > > This looks interesting. It searches for file "./markup_receiver", but doesn't provide that file, which does not appear to be a gem. Any hints? With politza's help am getting close to being able to extract annotation text from within pdf-tools, but am not quite there yet. > Matt > --001a11c3c6be21cb7305245fcdcb Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Thu, Nov 12, 2015 at 9:28 AM, Matt Lundin <mdl@imapmail.org><= /span> wrote:
Ramon Diaz= -Uriarte <rdiaz02@gmail.com>= writes:
>
> I'll do. In the meantime, I think this is a limitation coming from=
> poppler. Other people have mentioned similar things (e.g.,
> http://coda.caseykuhlman.com/entries/= 2014/pdf-extract.html) and using other
> tools that depend on poppler (such as Leela:
> https://github.com/TrilbyWhite/Leela) also will not giv= e us the text
> itself.

I don't think this is a limitation of poppler so much as the way= that
pdf annotations work. Typically, the subject/text field is not populated by the text of the highlighted region. Rather, a highlight annotation
specifies bounds, color, style, etc. Basically what Repligo does (I
wouldn't recommend using it, as it is closed source and severely out of=
date) is to grab the text *at the time of highlighting* and add it to
the notes field. I don't know of any other annotation tool that does th= e
same thing. Applications built on poppler could do it, though they
currently do not.

For extracting the text of highlighted regions *after the fact*, I've had good luck with this script that relies on the pdf-reader gem for
ruby:

https://gist.github.com/danlucraft/5277732

This looks interesting. It searches for file "./markup_receive= r", but doesn't provide that file, which does not appear to be a g= em.=C2=A0 Any hints?=C2=A0

With politza's help am ge= tting close to being able to extract annotation text from within pdf-tools,= but am not quite there yet.=C2=A0
=C2=A0
Matt

--001a11c3c6be21cb7305245fcdcb-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Fri, 13 Nov 2015 00:51:41 +0100 Message-ID: <874mgqg4j6.fsf@gmail.com> References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> <87k2pn70mb.fsf@fastmail.fm> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:47979) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx1ee-0003QK-UW for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:51:50 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zx1eb-00071L-2t for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:51:48 -0500 Received: from mail-wm0-x233.google.com ([2a00:1450:400c:c09::233]:34499) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx1ea-000711-TF for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:51:45 -0500 Received: by wmvv187 with SMTP id v187so57932421wmv.1 for ; Thu, 12 Nov 2015 15:51:44 -0800 (PST) In-reply-to: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Org Mode On Thu, 12-11-2015, at 23:52, Matt Price wrote: > On Thu, Nov 12, 2015 at 9:28 AM, Matt Lundin wrote: > >> Ramon Diaz-Uriarte writes: >> > >> > I'll do. In the meantime, I think this is a limitation coming from >> > poppler. Other people have mentioned similar things (e.g., >> > http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using >> other >> > tools that depend on poppler (such as Leela: >> > https://github.com/TrilbyWhite/Leela) also will not give us the text >> > itself. >> >> I don't think this is a limitation of poppler so much as the way that >> pdf annotations work. Typically, the subject/text field is not populated >> by the text of the highlighted region. Rather, a highlight annotation >> specifies bounds, color, style, etc. Basically what Repligo does (I >> wouldn't recommend using it, as it is closed source and severely out of >> date) is to grab the text *at the time of highlighting* and add it to >> the notes field. I don't know of any other annotation tool that does the >> same thing. Applications built on poppler could do it, though they >> currently do not. >> >> For extracting the text of highlighted regions *after the fact*, I've >> had good luck with this script that relies on the pdf-reader gem for >> ruby: >> >> https://gist.github.com/danlucraft/5277732 >> >> This looks interesting. It searches for file "./markup_receiver", but > doesn't provide that file, which does not appear to be a gem. Any hints? I think I got it from https://www.omniref.com/github/danlucraft/pyranine/HEAD/files/lib/pyranine/markup_receiver.rb > > With politza's help am getting close to being able to extract annotation > text from within pdf-tools, but am not quite there yet. Neat! R. > > >> Matt >> -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Fri, 13 Nov 2015 00:55:14 +0100 Message-ID: <8737wag4d9.fsf@gmail.com> References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> <87k2pn70mb.fsf@fastmail.fm> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:48816) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx1i4-000530-Oi for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:55:21 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zx1i0-00084w-Q6 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:55:20 -0500 Received: from mail-wm0-x22c.google.com ([2a00:1450:400c:c09::22c]:35223) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx1i0-00084c-KB for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 18:55:16 -0500 Received: by wmdw130 with SMTP id w130so8606847wmd.0 for ; Thu, 12 Nov 2015 15:55:16 -0800 (PST) In-reply-to: <87k2pn70mb.fsf@fastmail.fm> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Lundin Cc: Ramon Diaz-Uriarte , Org Mode On Thu, 12-11-2015, at 15:28, Matt Lundin wrote: > Ramon Diaz-Uriarte writes: > >> >> so we get the location of the highlight (and its properties), but not the >> textual contents. And this is the case whether I make the annotation with >> EzPDF or Okular or, for that matter, with pdf-tools itself. >> >> So it seems RepliGO is actually giving you a lot more by default :-) >> >>> >>> Politza and I are discussing this here: >>> https://github.com/politza/pdf-tools/issues/137 >>> >>> that might be a good place to ocntinue the conversation. >>> >> >> I'll do. In the meantime, I think this is a limitation coming from >> poppler. Other people have mentioned similar things (e.g., >> http://coda.caseykuhlman.com/entries/2014/pdf-extract.html) and using other >> tools that depend on poppler (such as Leela: >> https://github.com/TrilbyWhite/Leela) also will not give us the text >> itself. > > I don't think this is a limitation of poppler so much as the way that > pdf annotations work. Typically, the subject/text field is not populated > by the text of the highlighted region. Rather, a highlight annotation > specifies bounds, color, style, etc. Basically what Repligo does (I > wouldn't recommend using it, as it is closed source and severely out of > date) is to grab the text *at the time of highlighting* and add it to > the notes field. I don't know of any other annotation tool that does the > same thing. Applications built on poppler could do it, though they > currently do not. I stand corrected. You are right; sorry for the sloppiness in the wording and ideas. > > For extracting the text of highlighted regions *after the fact*, I've > had good luck with this script that relies on the pdf-reader gem for > ruby: > > https://gist.github.com/danlucraft/5277732 That is also what I use for extracting the text from the highlighted regions. R. > > Matt -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Fri, 13 Nov 2015 01:39:30 +0100 Message-ID: <871tbug2bh.fsf@gmail.com> References: <877floffyq.fsf@gmail.com> <87wptnqucw.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:56726) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx2Ox-0000vG-3E for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 19:39:40 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zx2Os-0001ZG-9K for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 19:39:39 -0500 Received: from mail-wm0-x22a.google.com ([2a00:1450:400c:c09::22a]:33027) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx2Os-0001ZB-33 for emacs-orgmode@gnu.org; Thu, 12 Nov 2015 19:39:34 -0500 Received: by wmec201 with SMTP id c201so58996368wme.0 for ; Thu, 12 Nov 2015 16:39:33 -0800 (PST) In-reply-to: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matt Price Cc: Ramon Diaz-Uriarte , Org Mode On Thu, 12-11-2015, at 14:11, Matt Price wrote: >> > >> > M-: (pdf-annot-get-annots) and look at the output in the *Messages* >> > buffer. Can you see any evidence of the the text? Can you share what you >> > learned? >> >> Nope, no evidence of the text. I get things such as >> >> (((buffer . #) (page . 13) (edges >> 0.113553 0.31717 0.868657 0.361746) (type . highlight) (id . annot-13-0) >> (flags . 4) (color . "#ffff00") (contents . "") (modified 22081 45188) >> (label . "TF201") (subject . "Highlight") (opacity . 1.0) ...) >> >> >> so we get the location of the highlight (and its properties), but not the >> textual contents. And this is the case whether I make the annotation with >> EzPDF or Okular or, for that matter, with pdf-tools itself. >> >> So it seems RepliGO is actually giving you a lot more by default :-) >> >> > Try replacing > > (text (assoc-default 'subject annot)) > > with > > (text (pdf-info-gettext page (assoc-default 'edges annot))) > > > in the lambda function in pdf-annot-markups-as-org-text. This will fail on > cropped pdfs if you have added highlights using the most recent pdf-tools, > which stores negative values in the 'edges field, but I've found it works > otherwise. I'd love to hear if it works for you too. (I know you're > following the relevant bug report on the pdf-tools github repo). It works! I get the highlighted text (I have just tried with a pdf I highlighted with EzPDF, the one that, above, was giving just the location and properties). And it still does the correct thing in your pdf (the one from repligo). Awesome! Thanks. R. -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matti Minkkinen Subject: Re: Org Mode and PDF Notes! Date: Fri, 13 Nov 2015 10:04:47 +0200 Message-ID: <87pozepbog.fsf@utu.fi> References: <20@gmane.emacs.orgmode.nnrss> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:57500) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx9Lv-0000a9-Og for emacs-orgmode@gnu.org; Fri, 13 Nov 2015 03:05:00 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zx9Lq-0000qG-Li for emacs-orgmode@gnu.org; Fri, 13 Nov 2015 03:04:59 -0500 Received: from smtp-03.utu.fi ([130.232.207.3]:65423) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zx9Lq-0000iq-FJ for emacs-orgmode@gnu.org; Fri, 13 Nov 2015 03:04:54 -0500 Received: from mmin-utu (dyn70-104.kk.net.utu.fi [130.232.70.104]) by smtp-03.utu.fi (Oracle Communications Messaging Server 7.0.5.29.0 64bit (built Jul 9 2013)) with ESMTPSA id <0NXQ0075NUFZKB20@smtp-03.utu.fi> for emacs-orgmode@gnu.org; Fri, 13 Nov 2015 10:04:47 +0200 (EET) In-reply-to: <20@gmane.emacs.orgmode.nnrss> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Ramon Diaz-Uriarte writes: > On Thu, 12-11-2015, at 14:11, Matt Price gmail.com> wrote: > It works! I get the highlighted text (I have just tried with a pdf I > highlighted with EzPDF, the one that, above, was giving just the location > and properties). And it still does the correct thing in your pdf (the one > from repligo). > Awesome! Thanks. > R. > > link I also have a strong interest in getting PDF annotations from Android to org-mode notes easily, so I've been following this thread with great interest. Very recently, on 11 November, there were a couple of commits to this pdf-tools fork which also let you export highlighted text: https://github.com/pinguim06/pdf-tools Matt Price already referred to the pull request from that fork, but those commits are more recent than the original pull request. I think it uses the same principle as the solution mentioned here but maybe it's more elaborate. I don't know enough elisp to confirm that. -- Matti From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ramon Diaz-Uriarte Subject: Re: Org Mode and PDF Notes! Date: Mon, 16 Nov 2015 11:07:15 +0100 Message-ID: <87fv06qmuk.fsf@gmail.com> References: <20@gmane.emacs.orgmode.nnrss> <87pozepbog.fsf@utu.fi> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:48110) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZyGh0-0002BR-K7 for emacs-orgmode@gnu.org; Mon, 16 Nov 2015 05:07:23 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZyGgw-0005vc-I5 for emacs-orgmode@gnu.org; Mon, 16 Nov 2015 05:07:22 -0500 Received: from mail-wm0-x233.google.com ([2a00:1450:400c:c09::233]:35639) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZyGgw-0005vW-Av for emacs-orgmode@gnu.org; Mon, 16 Nov 2015 05:07:18 -0500 Received: by wmdw130 with SMTP id w130so103600377wmd.0 for ; Mon, 16 Nov 2015 02:07:17 -0800 (PST) In-reply-to: <87pozepbog.fsf@utu.fi> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Matti Minkkinen Cc: emacs-orgmode@gnu.org Thanks for the pointer. On Fri, 13-11-2015, at 09:04, Matti Minkkinen wrote: > Ramon Diaz-Uriarte writes: > >> On Thu, 12-11-2015, at 14:11, Matt Price gmail.com> wrote: >> It works! I get the highlighted text (I have just tried with a pdf I >> highlighted with EzPDF, the one that, above, was giving just the location >> and properties). And it still does the correct thing in your pdf (the one >> from repligo). >> Awesome! Thanks. >> R. >> >> link > > I also have a strong interest in getting PDF annotations from Android to > org-mode notes easily, so I've been following this thread with great > interest. > > Very recently, on 11 November, there were a couple of commits to this > pdf-tools fork which also let you export highlighted text: > https://github.com/pinguim06/pdf-tools > > Matt Price already referred to the pull request from that fork, but > those commits are more recent than the original pull request. I think it > uses the same principle as the solution mentioned here but maybe it's > more elaborate. I don't know enough elisp to confirm that. -- Ramon Diaz-Uriarte Department of Biochemistry, Lab B-25 Facultad de Medicina Universidad Autónoma de Madrid Arzobispo Morcillo, 4 28029 Madrid Spain Phone: +34-91-497-2412 Email: rdiaz02@gmail.com ramon.diaz@iib.uam.es http://ligarto.org/rdiaz