From mboxrd@z Thu Jan 1 00:00:00 1970 From: charles@aurox.ch (Charles A. Roelli) Subject: Re: How do you store web pages for reference? Date: Mon, 16 Jan 2017 11:38:38 +0100 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:36437) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cT4gh-0001HT-9U for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 05:38:56 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cT4ge-0006w4-3C for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 05:38:55 -0500 Received: from sinyavsky.aurox.ch ([37.35.109.145]:52207) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cT4gd-0006ub-Pj for emacs-orgmode@gnu.org; Mon, 16 Jan 2017 05:38:52 -0500 Received: from sinyavsky.aurox.ch (sinyavsky.aurox.ch [127.0.0.1]) by sinyavsky.aurox.ch (Postfix) with ESMTP id E83D322314 for ; Mon, 16 Jan 2017 10:36:10 +0000 (UTC) Received: from sinyavsky.aurox.ch ([127.0.0.1]) by sinyavsky.aurox.ch (sinyavsky.aurox.ch [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id btgUueKBasPT for ; Mon, 16 Jan 2017 10:36:10 +0000 (UTC) In-Reply-To: (Alan Schmitt's message of "Mon, 16 Jan 2017 09:48:38 +0100") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: "Emacs-orgmode" To: Alan Schmitt Cc: emacs-orgmode@gnu.org Hi Alan, On Mon, Jan 16 2017 at 09:48:38 am, Alan Schmitt wrote: > I'm looking for a workflow that allows me to save a web page for > reference, ideally from Firefox. I know of org-protocol-capture-html > (https://github.com/alphapapa/org-protocol-capture-html), which is > perfect for pure-text pages, but I'm also looking for a solution for > images-heavy pages. I've tried to simply save the page to PDF, but it > does not preserve the links. You might also want to try org-board: https://github.com/scallywag/org-board It offers archiving, diffing between archives, and anything `wget' can do (see its manual for more details). I haven't integrated it with Firefox, but if you can a) Get the current URL from Firefox, b) Send it to Emacs, c) Open a dedicated web bookmark file buffer, create a heading for the URL, and run `org-board-archive', then org-board would take care of archiving the site exactly as you see it in the browser. Cheers, Charles