From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp1 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id mJZrOLjkz16/NgAA0tVLHw (envelope-from ) for ; Thu, 28 May 2020 16:20:08 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp1 with LMTPS id eGk/NLjkz17ZFgAAbx9fmQ (envelope-from ) for ; Thu, 28 May 2020 16:20:08 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 77C25940367 for ; Thu, 28 May 2020 16:20:07 +0000 (UTC) Received: from localhost ([::1]:45650 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jeLGH-00084Q-8W for larch@yhetil.org; Thu, 28 May 2020 12:20:05 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:39268) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jeLFs-000847-I7 for emacs-orgmode@gnu.org; Thu, 28 May 2020 12:19:40 -0400 Received: from wout3-smtp.messagingengine.com ([64.147.123.19]:57959) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jeLFr-0008N5-9n for emacs-orgmode@gnu.org; Thu, 28 May 2020 12:19:40 -0400 Received: from compute4.internal (compute4.nyi.internal [10.202.2.44]) by mailout.west.internal (Postfix) with ESMTP id 58BF5B1E; Thu, 28 May 2020 12:19:37 -0400 (EDT) Received: from mailfrontend1 ([10.202.2.162]) by compute4.internal (MEProxy); Thu, 28 May 2020 12:19:37 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=imapmail.org; h= from:to:subject:in-reply-to:references:date:message-id :mime-version:content-type; s=fm3; bh=YdBfi7CZnNFntRXNlnuO0p2Xki iMol9eLWuNUV3dSkQ=; b=jRYcm1FlrWNLEsGIt/LdF5IRJe1yGlLA4ZNf7EOZ1Q kBr78+b/YGqi015dJe/1SJSDvnwW8igOZS/q6v2NC7PicNQpLKRt8snUMUq33L/b ktZ1kyn05AxWTaQBqz4id+U/rzvjE2xiacqdZ/5rwsnZW8X+4sxIoBmZYAP+MPjs bJ5A4WBxu/WPeOmjOjy2ZiF/igeNqcgWth3bRoz7vvpScQfh21RzgLbcBqlwdN87 VnBSoUEJBmuF2zdzviKBw6QVWfiAxUrj6sWI9gxGba2vsrZzxo7SFy2HGCTMlFcU hdZ9qFSguFpH4bggLXlb1eo7b79l7w2KUVPR+svg6+LA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-type:date:from:in-reply-to :message-id:mime-version:references:subject:to:x-me-proxy :x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=fm2; bh=YdBfi7 CZnNFntRXNlnuO0p2XkiiMol9eLWuNUV3dSkQ=; b=rVfZIegcq7/mzhq1P3R9Yh dBKYF1Hlk1IcpwKFP/6LJsUghRUP2RvQhIjI0f89tw8cFtC4UF7ftqc+BUkkm+gB KYv2f0ctKoVAIStusa9m5G0L7dz8Km9B0JEcou5ZL8RtL/GXDfvpIGguFxDCgX1+ kv7Y/C7f+H8oLLMW/8GO/2lbOjK9YnJ/HgzxDgnZ62symWK/QZ12Kh5ida5yp46T a9gorEpSmiipeTbFdMlqDdq/8TsZgbMX8Y1KNXRSFT9SEPRh2XYfki8kO+gTQxI/ n0yumOBNjKBR8Lfa7d87AO//2V5dz1epYQCOE2/5SbICM2JTDqM8uwwFgSppB6WQ == X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeduhedruddviedgheduucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurhephffvufgjfhffkfggtgesthdtredttddttdenucfhrhhomhepofgrthhthhgv ficunfhunhguihhnuceomhgulhesihhmrghpmhgrihhlrdhorhhgqeenucggtffrrghtth gvrhhnpeduuefhfeejgeejueeileeutdeljeegveefgeeutefftdejtddtgfehvdeludej tdenucffohhmrghinhepuhhrlhdrohhrghenucfkphepjeeirdduiedrudefhedruddule enucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpehmughl sehimhgrphhmrghilhdrohhrgh X-ME-Proxy: Received: from localhost (c-76-16-135-119.hsd1.il.comcast.net [76.16.135.119]) by mail.messagingengine.com (Postfix) with ESMTPA id 92B8D328005E; Thu, 28 May 2020 12:19:36 -0400 (EDT) From: Matthew Lundin To: Ihor Radchenko , numbchild@gmail.com, Org Mode Subject: Re: [Feature] add a new org-attach dispatcher command to offline save web page In-Reply-To: <87r1v4bodg.fsf@localhost> References: <87sgflu2gw.fsf@gmail.com> <87r1v4wyy4.fsf@fastmail.fm> <87r1v4bodg.fsf@localhost> Date: Thu, 28 May 2020 11:19:35 -0500 Message-ID: <874ks0vxpk.fsf@fastmail.fm> MIME-Version: 1.0 Content-Type: text/plain Received-SPF: pass client-ip=64.147.123.19; envelope-from=mdl@imapmail.org; helo=wout3-smtp.messagingengine.com X-detected-operating-system: by eggs.gnu.org: First seen = 2020/05/28 12:19:37 X-ACL-Warn: Detected OS = Linux 2.2.x-3.x [generic] [fuzzy] X-Spam_score_int: -27 X-Spam_score: -2.8 X-Spam_bar: -- X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Scanner: scn0 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=imapmail.org header.s=fm3 header.b=jRYcm1Fl; dkim=pass header.d=messagingengine.com header.s=fm2 header.b=rVfZIegc; dmarc=pass (policy=none) header.from=imapmail.org; spf=pass (aspmx1.migadu.com: domain of emacs-orgmode-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=emacs-orgmode-bounces@gnu.org X-Spam-Score: -1.71 X-TUID: 3DlDO/7YWRU1 Ihor Radchenko writes: > It does not mean that attaching URL directly is not worth including > into org. This sounds pretty common use case, especially considering > the number of packages providing similar feature. You mentioned > org-board, but there is also org-download and org-web-tools. My uneasiness has more to do with the specificity of the dependence on monolith and the way that is hard-coded into the patch. When it comes to patches, I think priority should go to those that are configurable, accessible, and useful for everyone as opposed to those that have hard-coded work-flows or highly-specific user configurations. > I agree that monolith is completely uncommon tool and I would not expect > the majority of users to have it installed, but the same functionality > utilising built-in url.el (as a default) should be acceptable. The question is: which functionality? A simple downloading tool or a full archival tool? Achieving similar functionality to org-board or monolith would a big task, since they aim to download an archival version of a webpage (including all resources). In addition, with archiving you also quickly run into the complexity of versioning based on time archived. There's also the challenge of mapping the downloaded files to metadata (specifically the original url). Org-board currently handles both of these very well. I suppose there would be a few options depending on what the aims are: 1. At the simple end, include little more than than a quick and dirty way of downloading a single resource (html, pdf, jpeg) using url.el or wget (or optionally, monolith) and putting that in the attachment folder. Those who want full archiving of all resources could use other tools like org-board or org-web-tools. 2. At the (much) more complex end, it would be to code out a robust archiving solution on top of url.el or wget. 3. Another, possibly simpler option... Add a command to the dispatcher that allows the user to invoke a custom function that is called with the attachment directory as the default-directory. This would enable more end-user flexibility, such as the ability to use wkhtmtoimage/wkhtmltopdf, monolith, phantom.js, archive.is, etc. Best, Matt