From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp10.migadu.com ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id SHfHMgaY5mFePwEAgWs5BA (envelope-from ) for ; Tue, 18 Jan 2022 11:35:50 +0100 Received: from aspmx1.migadu.com ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp10.migadu.com with LMTPS id qAJwKwaY5mF//AAAG6o9tA (envelope-from ) for ; Tue, 18 Jan 2022 11:35:50 +0100 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 6C1B13A7D0 for ; Tue, 18 Jan 2022 11:35:50 +0100 (CET) Received: from localhost ([::1]:36180 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1n9lq8-0000Yp-Gk for larch@yhetil.org; Tue, 18 Jan 2022 05:35:48 -0500 Received: from eggs.gnu.org ([209.51.188.92]:33098) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1n9lpE-0000YY-P9 for emacs-orgmode@gnu.org; Tue, 18 Jan 2022 05:34:52 -0500 Received: from ciao.gmane.io ([116.202.254.214]:39216) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1n9lpB-0000O0-Vq for emacs-orgmode@gnu.org; Tue, 18 Jan 2022 05:34:51 -0500 Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1n9lp7-0004pF-Ip for emacs-orgmode@gnu.org; Tue, 18 Jan 2022 11:34:45 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: emacs-orgmode@gnu.org From: Max Nikulin Subject: Re: Yet another browser extension for capturing notes - LinkRemark Date: Tue, 18 Jan 2022 17:34:36 +0700 Message-ID: References: <20201225142604.GC1140@maokai> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.14.0 X-Mozilla-News-Host: news://news.gmane.io In-Reply-To: Content-Language: en-US Received-SPF: pass client-ip=116.202.254.214; envelope-from=geo-emacs-orgmode@m.gmane-mx.org; helo=ciao.gmane.io X-Spam_score_int: 28 X-Spam_score: 2.8 X-Spam_bar: ++ X-Spam_report: (2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_ADSP_CUSTOM_MED=0.001, FORGED_GMAIL_RCVD=1, FORGED_MUA_MOZILLA=2.309, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.248, NICE_REPLY_A=-0.001, NML_ADSP_CUSTOM_MED=0.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1642502150; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post; bh=u7fkpeRF7WFh/G9/9hPtaydbSn2o0Sd+a9dcGDJtvLc=; b=uTAZUYwMsyNt9lCLAKdcS0KxC94hNS50QGXRd3O7MVWHKwxQIEIdyPnotgTsCSd5WOHVuP cBNmptgCkX2kei0YpM18PMK3B7egM5IblespTDjXwbghvng+tjDyaSNrqvA935afQGRY3l /BVxAfOtuwuFWGeqZdbIWHmEMRnuUgvUsAwPMhgHfI7LgYev9ldneSOiyyXaCFrPKElnvv wbFqQHzsi+ascET9KR9xo+CRP88pp3knRc/K1/Ohj0R+YZJKYjOQXAKnJxde+THlTIgPCJ XOLmIv6D8NZ1wfV5x9QH9lZEna3kNF2zumQzOsGPlAsYv3cZG+MzT8gDq7s0IA== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1642502150; a=rsa-sha256; cv=none; b=Bq0HfdQ+wxfsZi+NqLOeOoOfVGS7jxFJAZdvAW0q05c3XlquvvYeNw8IbFXVtnQrqmMC4w t2kFXrtR8n+S7oC92iWm/O5L2h2LQC0jMHIvyyznVJBVbyUIjM2QLtBZ6A0VmlM3DJt2OR Vd+X20rbrD2x6w4MfFmPPObhKHagSZVfGMTWzZX72rc2CnQsRBZ+9DcPkk7FQGwOolSp3N Ipyt9gqMEfD8GAhyswlBZMDP16miC0gp2zIgqiEF6+WH1/2bo1oAFCmxtObLwaZqiUVU/p 67Sv1nhZE3L+F4uZnB8V3PNa9woAqJVnuJyvLn4tGKXyffHbrnZ5fL2TC60ySw== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=fail reason="SPF not aligned (relaxed), No valid DKIM" header.from=gmail.com (policy=none); spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: -2.82 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=fail reason="SPF not aligned (relaxed), No valid DKIM" header.from=gmail.com (policy=none); spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: 6C1B13A7D0 X-Spam-Score: -2.82 X-Migadu-Scanner: scn0.migadu.com X-TUID: sUJLOeBR/vYr Samuel, since significant part of your message is dedicated to capturing of tab groups I should ask if you have tried version of LinkRemark add-on currently available from browser extension catalogues: - https://addons.mozilla.org/firefox/addon/linkremark/ - https://chrome.google.com/webstore/detail/mgmcoaemjnaehlliifkgljdnbpedihoe Groups of tabs or selected (highlighted) tabs are supported for Chromium, Firefox has no built-in tab groups, but it is still possible to capture selected tabs. Your feature requests: - Clean-up URLs. I have such idea, but I have not approached to implementation of it. Maybe URLs should be sent to another extension that excels in such task. If you have come comments which add-ons are great and which work rather poor, the suggestions my be helpful. - Deduplicate URLs from tab groups. It requires some work to merge selected text, links, or nested frames from each tab. The complication is that some sites use internal navigation not reflected in location, so the same URL may have completely different content. Some sites have their top pages as canonical URLs, so some measures against false positives is required. Currently the extension may check if URL already present in org files. It requires https://github.com/maxnikulin/burl helper application that is in proof-of concept stage. - Restore set of tabs. It requires some elisp code to iterate over subtree and to pick first "Link URL" or "URL" from description lists. Currently I am thinking on some changes of interface since sometimes I just want to check if some URL is in my notes already. I would prefer to avoid adding more context menu items. Additional details are inline. On 17/01/2022 09:29, Samuel Wales wrote: > On 12/26/20, Maxim Nikulin wrote: >> On 26/12/2020, Samuel Wales wrote: >> >>> [... i can imagine great things possible with such extensions. for >>> example, you could have sets of tabs, selected by right click in >>> firefox, to save to a bunch of org entries. then you could load that >>> particular set of entries into firefox whenever you want. > > interesting. i do note tab selection features in recent firefox-esr > and i was just assuming something like that. There is no a ready to use recipe for loading saved tabs, but saving should work to some extent. >>> You can do this with the "Copy all URLs" extension (ID: >>> djdmadneanknadilpjiknlnanaolmbfk). Use this as the custom format (note >>> the linebreak): >> >> I am almost sure that similar extension should exist for Firefox as well. > > i think this is for copying all tabs, not selected ones. ... > also i think this extension does not exist any more in firefox. I have not tried them: - https://github.com/piroor/copy-selected-tabs-to-clipboard/ - https://github.com/yorkxin/copy-as-markdown >> - Are you going to capture reviews of "rice cookers" that could be >> considered as ordinary pages or you are going to save items from online >> stores? ... >> Could you inspect head element of pages in >> your favorite stores contains desired metadata using page source or >> inspect element tools? > > my web knowledge is too limited to understand your question, but i am > just hoping it would capture ordinary amazon links, review sites, and > so on. It seems that quality of metadata in marketplaces like amazon severely depends on particular seller. The extension attempts to treat some data specially if there are microdata or JSON-LD with Product schema.org type. If I remember correctly, Amazon does not expose canonical link explicitly. >>> [now if i can only debug the extra-blank-lines-in-capture problem.] >> >> Fully agree that it is really annoying. It is among high priority items >> in my TODO list. > > we might be talking about different thinks. i am referring to > something in org that adds blank lines when my particular org capture > templates are used. See info "(org) Template elements" https://orgmode.org/manual/Template-elements.html :empty-lines, :empty-lines-after, :empty-lines-before however I can not say that I really understand their meaning. Actually I do not mind to have empty line before next heading when refile is completed. My impression that it depends on number of empty lines at the end of capture buffer. I usually add some comments to captured pages. On 18/01/2022 08:03, Samuel Wales wrote: > my amazon example was silly and confusing. the point isn't shopping > for something; it's anything. science papers, news outlets, nerd > blogs. Scientific papers require more work, it is necessary to make them available to org-cite somehow. Some nerds use quite peculiar blog engines and strange setting of metadata. So shopping on some sites might work better than other cases.