From mboxrd@z Thu Jan 1 00:00:00 1970 From: Bill Jacobson Subject: Unwanted percent-encoding capturing data from Firefox Date: Sun, 19 Jun 2011 11:43:49 -0500 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit Return-path: Received: from eggs.gnu.org ([140.186.70.92]:58414) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1QYLCP-0002U5-0A for emacs-orgmode@gnu.org; Sun, 19 Jun 2011 12:50:14 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1QYLCM-00060p-Qw for emacs-orgmode@gnu.org; Sun, 19 Jun 2011 12:50:12 -0400 Received: from lo.gmane.org ([80.91.229.12]:48354) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1QYLCM-0005zU-DD for emacs-orgmode@gnu.org; Sun, 19 Jun 2011 12:50:10 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1QYLCI-0005r9-72 for emacs-orgmode@gnu.org; Sun, 19 Jun 2011 18:50:06 +0200 Received: from c-67-176-217-52.hsd1.il.comcast.net ([67.176.217.52]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 19 Jun 2011 18:50:06 +0200 Received: from gabedad by c-67-176-217-52.hsd1.il.comcast.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 19 Jun 2011 18:50:06 +0200 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org For some months now, I've been successfully capturing Firefox data as explained here: http://orgmode.org/worg/org-contrib/org-protocol.html But the results below demonstrate a problem. The 2nd and 3rd represent what was captured into Emacs, the quoted line in each case being what was highlighted in Firefox. Emacs version is: GNU Emacs 24.0.50.1 (x86_64-unknown-linux-gnu, GTK+ Version 2.22.0) of 2011-05-03 Manually copied: [[http://www.tbray.org/ongoing/When/201x/2011/05/31/Browsers][ongoing by Tim Bray · Me and My Browsers]] "I like that Chrome’s fast, and I really like that it’s robust" Org-mode version 7.4: [[http://www.tbray.org/ongoing/When/201x/2011/05/31/Browsers][ongoing by Tim Bray · Me and My Browsers]] "I like that Chrome’s fast, and I re­-ally like that it’s ro­-bust" Org-mode version 7.4 (release_7.5.409.g4f3a3) == latest [[http%3A%2F%2Fwww.tbray.org%2Fongoing%2FWhen%2F201x%2F2011%2F05%2F31%2FBrowsers][ongoing by Tim Bray %C2%B7 Me and My Browsers]] "I like that Chromeâ%80%99s fast%2C and I re%C2%ADally like that itâ%80%99s ro%C2%ADbust" I don't yet have the Elisp chops to see where this should be mended. (The bogus hyphens are only a minor concern but they're a puzzle. Apparently some Javascript is sprinkling the text with soft hyphens which are removed when copying via the browser but not when capture copies using window.getSelection()) Thanks, Bill