From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan Schmitt Subject: Re: How do you store web pages for reference? Date: Mon, 16 Jan 2017 16:41:43 +0100 Message-ID: References: <2017-01-16T15-41-12@devnull.Karl-Voit.at> Mime-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:43138) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cT9Pm-0002BL-F4 for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 10:41:47 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cT9Pi-0007wo-Hu for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 10:41:46 -0500 Received: from mail2-relais-roc.national.inria.fr ([192.134.164.83]:52420) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cT9Pi-0007wE-5C for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 10:41:42 -0500 In-Reply-To: <2017-01-16T15-41-12@devnull.Karl-Voit.at> (Karl Voit's message of "Mon, 16 Jan 2017 15:43:25 +0100") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: "Emacs-orgmode" To: Karl Voit Cc: Karl Voit , emacs-orgmode@gnu.org --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hello Karl, On 2017-01-16 15:43, Karl Voit writes: > On Mon, Jan 16 2017 at 09:48:38 am, Alan Schmitt wrote: > >> I'm looking for a workflow that allows me to save a web page for >> reference, ideally from Firefox. I know of org-protocol-capture-html >> (https://github.com/alphapapa/org-protocol-capture-html), which is >> perfect for pure-text pages, but I'm also looking for a solution for >> images-heavy pages. I've tried to simply save the page to PDF, but it >> does not preserve the links. > > I am using the Firefox plugin Shelve[1] which stores all of my web > pages visited. Those HTML files are written with an ISO time-stamp > in their file name. Therefore, my Memacs filename module (see sig) > is indexing all visited URLs and they appear on my agenda. > > So I do have a direct link between my agenda and the HTML files of > all web pages I have visited. > > [1] https://addons.mozilla.org/en-US/firefox/addon/shelve/ This plugin looks interesting, but it seems to rely on the existing functionality of Firefox to save web pages. As I want to save a page with its picture and CSS, I would need to choose =E2=80=9CWeb page, complet= e=E2=80=9D, but the FF documentation says =E2=80=9CThis choice allows you to view it as originally shown with pictures, but it may not keep the HTML link structure of the original page=E2=80=9D, which worries me a little. Do you only save the html or the pictures as well. If it's the latter, have you had any issues about links not being preserved? Thanks, Alan =2D-=20 OpenPGP Key ID : 040D0A3B4ED2E5C7 Monthly Athmospheric CO=E2=82=82, Mauna Loa Obs. 2016-12: 404.48, 2015-12: = 401.85 --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQEcBAEBCgAGBQJYfOm4AAoJEAQNCjtO0uXHjOYH/200n4Hy8BfjnA1DHcgrSxXF Q+QgG+wege7XJ0N7OiyA3IhgwFTMNvIQ9BJ+HscM+JNcgYbBPii/nIsCe1wkhS3Z T9kF0VnuO+Tmq0JRLOh2fqP/XJs5UO6U1ndr/UGKXwNiFSXpmvTqPaKRApv09mrA Ay7Kkpz9BgC05s/BG33XhSxNIDGXXirbIh1Ychw/jpX/+77ktwi6vRDuOTCj1q2N 4yT1qWSc8qvE32IFkglSrdM36MtVtMaovwFG0zyD54vNyiWjdtROuDzXqCOj/rTf T+w8PBkk4HExRMfsJoWie4TXrpBdm0xHJmJ8VtNaXeKZ7yfno4PFHyr2ZVu5H60= =7S8D -----END PGP SIGNATURE----- --=-=-=--