From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mathias Bauer Subject: Bug: text export and multi-word link descriptions with line breaks Date: Thu, 3 Apr 2014 16:28:34 +0200 Message-ID: <20140403142834.GA27238@gmx.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:43094) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WVidJ-0006pb-6v for emacs-orgmode@gnu.org; Thu, 03 Apr 2014 10:28:50 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WVidB-00028h-UQ for emacs-orgmode@gnu.org; Thu, 03 Apr 2014 10:28:45 -0400 Received: from mout.gmx.net ([212.227.17.22]:50374) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WVidB-00028T-Kd for emacs-orgmode@gnu.org; Thu, 03 Apr 2014 10:28:37 -0400 Received: from mail.internal ([87.175.187.230]) by mail.gmx.com (mrgmx103) with ESMTPSA (Nemesis) id 0LaXmV-1WtDDw1UPn-00mKdU for ; Thu, 03 Apr 2014 16:28:35 +0200 Received: from localhost by localhost with ESMTP id 59987117860 for ; Thu, 3 Apr 2014 16:28:34 +0200 (CEST) Received: from localhost by localhost with LMTP id xiSkJucAHNiR for ; Thu, 3 Apr 2014 16:28:34 +0200 (CEST) Content-Disposition: inline List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Dear Maintainers, I just stumbled over Org's plain text export and how it works on links with descriptions consisting of multiple words and line breaks between them. I'm running Org stable version 8.2.5h. Org source (spaces at the end of line 1 and 2 don't matter): --------------------snip-------------------- "OpenPGP Message Format" ([[https://tools.ietf.org/html/rfc4880][RFC 4880]] which obsoletes [[https://tools.ietf.org/html/rfc1991][RFC 1991]] and [[https://tools.ietf.org/html/rfc2440][RFC 2440]])... ... foo [[https://tools.ietf.org/html/rfc4880][RFC 4880]] bar baz [[https://tools.ietf.org/html/rfc1991][RFC 1991]] foo bar [[https://tools.ietf.org/html/rfc2440][RFC 2440]] baz --------------------snip-------------------- Text export result: --------------------snip-------------------- "OpenPGP Message Format" ([RFC 4880] which obsoletes [RFC 1991] and [RFC 2440])... ... foo [RFC 4880] bar baz [RFC 1991] foo bar [RFC 2440] baz [RFC 4880] https://tools.ietf.org/html/rfc4880 [RFC 1991] https://tools.ietf.org/html/rfc1991 [RFC 2440] https://tools.ietf.org/html/rfc2440 [RFC 4880] https://tools.ietf.org/html/rfc4880 [RFC 1991] https://tools.ietf.org/html/rfc1991 --------------------snip-------------------- These multiple references look quite bad. Is it possible to "normalize" the descriptions in some way *before* checking them for uniqueness and output them thereafter? Thanks for considering this issue. Kind regards Mathias