From mboxrd@z Thu Jan 1 00:00:00 1970 From: charles@aurox.ch (Charles A. Roelli) Subject: Re: org-board -- bookmarking and archival Date: Fri, 16 Sep 2016 20:40:25 +0200 Message-ID: References: <87oa3pdrca.fsf@alphapapa.net> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:55301) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bky44-0005mT-MV for emacs-orgmode@gnu.org; Fri, 16 Sep 2016 14:40:45 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bky40-0003jK-Eo for emacs-orgmode@gnu.org; Fri, 16 Sep 2016 14:40:43 -0400 Received: from [2a03:2040:2:1b::1] (port=37478 helo=sinyavsky.aurox.ch) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bky3z-0003eI-Sn for emacs-orgmode@gnu.org; Fri, 16 Sep 2016 14:40:40 -0400 Received: from sinyavsky.aurox.ch (sinyavsky.aurox.ch [127.0.0.1]) by sinyavsky.aurox.ch (Postfix) with ESMTP id DD5002220A for ; Fri, 16 Sep 2016 18:39:47 +0000 (UTC) Received: from sinyavsky.aurox.ch ([127.0.0.1]) by sinyavsky.aurox.ch (sinyavsky.aurox.ch [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id LkwqmWmZGkFv for ; Fri, 16 Sep 2016 18:39:45 +0000 (UTC) In-Reply-To: <87oa3pdrca.fsf@alphapapa.net> (Adam Porter's message of "Thu, 15 Sep 2016 12:07:33 -0500") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: "Emacs-orgmode" To: Adam Porter Cc: emacs-orgmode@gnu.org Adam Porter writes: > Hi Charles, > > Thanks for sharing that, I will check it out. As was mentioned, it > seems ripe for integrating with browser capture. On that note, have you > seen org-protocol-capture-html? For articles that are primarily text, > I've been capturing articles directly in Org format, but your package > sounds good for capturing pages as-is. Thanks for letting me know about org-protocol-capture-html, I had not seen it. Capturing text directly to an Org file sounds more manageable. > By the way, you might want to consider integrating something like > Readability or the Python package python-readability (aka > readability-lxml) for reducing web pages to the primary content. It's > worked out well in org-protocol-capture-html. Great idea, maybe as part of a post-processing hook? Then we could save the HTML as a backup (for later web browsing) and then include the primary text in the Org file for easy viewing straight from Emacs. Seems your package is already well-suited to that part. :) I also wanted to keep the design relatively abstract so that things like this could be added later. One other feature idea that could be implemented as a post-processing hook is responding to "downloadable" links (like links to YouTube videos) by running a backend program (in this case, "youtube-dl") to go take care of fetching the apprapriate content. > By the way, here's some code I've been using to read and/or capture >web > pages from URLs on the clipboard: > [...] It's helpful to see an example of org-capture in use, I still have more to learn about it. I'll put a little example in the README for org-board. Cheers, Charles