From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp12.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms5.migadu.com with LMTPS id MEskEOrPb2IMIAEAbAwnHQ (envelope-from ) for ; Mon, 02 May 2022 14:34:50 +0200 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp12.migadu.com with LMTPS id SNYhEOrPb2Jb0QAAauVa8A (envelope-from ) for ; Mon, 02 May 2022 14:34:50 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id CC5CA4E7C for ; Mon, 2 May 2022 14:34:49 +0200 (CEST) Received: from localhost ([::1]:56128 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nlVGJ-0005EC-TQ for larch@yhetil.org; Mon, 02 May 2022 08:34:48 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:56990) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nlVEc-0005E3-UI for emacs-orgmode@gnu.org; Mon, 02 May 2022 08:33:03 -0400 Received: from ciao.gmane.io ([116.202.254.214]:40104) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nlVEb-0007zj-9w for emacs-orgmode@gnu.org; Mon, 02 May 2022 08:33:02 -0400 Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1nlVEZ-000221-2r for emacs-orgmode@gnu.org; Mon, 02 May 2022 14:32:59 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: emacs-orgmode@gnu.org From: Max Nikulin Subject: Re: [BUG] Exporting italic link with bang inside to html fails to parse the link [9.5.2 (N/A @ /gnu/store/89yvbijwnvsbpa5h33mvbgh1gy9w30n2-emacs-org-9.5.2/share/emacs/site-lisp/org-9.5.2/)] Date: Mon, 2 May 2022 19:32:50 +0700 Message-ID: References: <87v8vng70x.fsf@web.de> <87a6c29aj6.fsf@localhost> <956f9324-a8ea-4766-b0fd-e4dcd364edd7@gmail.com> <8735hu92dh.fsf@localhost> <87czgy6ifx.fsf@localhost> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.7.0 Content-Language: en-US In-Reply-To: <87czgy6ifx.fsf@localhost> Received-SPF: pass client-ip=116.202.254.214; envelope-from=geo-emacs-orgmode@m.gmane-mx.org; helo=ciao.gmane.io X-Spam_score_int: 28 X-Spam_score: 2.8 X-Spam_bar: ++ X-Spam_report: (2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_ADSP_CUSTOM_MED=0.001, FORGED_GMAIL_RCVD=1, FORGED_MUA_MOZILLA=2.309, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, NICE_REPLY_A=-0.001, NML_ADSP_CUSTOM_MED=0.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN X-Migadu-To: larch@yhetil.org X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1651494890; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post; bh=ti8b+Qf37x7m5m9FnC/53aNa7i7YVFlrGFEsey3dnpI=; b=bz/L65MQ0aXBhON/wOKDxmzrOqGFZoiS8bTFX+CogdakDVT4Jary2liHqPCmvXLHT5W6t3 h8yhSkV2wjGfLspg5B4aWMl3Y3cejemmS068iakVGLwqhmwIDGDl3LLZavsez7BocyVf1I bft1bDQ7uNbhi8ATh+H9EmDdb5iQ2tmdT4TJiERXqYYdgQlPYYLLtG/QozmymY4ZvNT0Ou Yc/yLKXwwCIGhq4WTObFtutjw7wB20c1wqN7NYCgQecUYOCWIh3+5b9/Wd1F1sH2F2/tYV gH5/jsmqdDJOrQ+20qsRmEldMV6ar8KFGDHMkWVRw5AohW53CF03UIbCuIlNCQ== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1651494890; a=rsa-sha256; cv=none; b=EBCojqKq1hhp93HV2qLfRwjndJ/98I9HGOJ8+/WC40h3Uukjzy6nYXlvaUYyJXqrrSBoP+ IAiZ2tdO8rpPwH1phUsb3MB1Ikp2N4hA+LlMIhj8ia19bR8SExJ3XvyirqTue7izIBFCS9 pptvYXfeO5h8VjMWTMP40WS++dQcudr6blkzzqSxEk0ghOplkwfYg8vuY3QeaZEGCUWsUK PEWPCRLnS4YTWW7ulSjRiIKgho1GKofdg0wNRApEiUDRKkzCDWLNPRUefwH4RApuNJHGx2 AGy9lEjdzWCAkKCxR/I9hjFdUffFgrd8bGJ+BHtAkz1q1CISpy9u5FwZJ8BXRQ== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=fail reason="SPF not aligned (relaxed), No valid DKIM" header.from=gmail.com (policy=none); spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: 4.21 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=fail reason="SPF not aligned (relaxed), No valid DKIM" header.from=gmail.com (policy=none); spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: CC5CA4E7C X-Spam-Score: 4.21 X-Migadu-Scanner: scn1.migadu.com X-TUID: VXj+1xij/ZtJ On 01/05/2022 10:27, Ihor Radchenko wrote: > Max Nikulin writes: > >>>> 2021-09-03 5:17 Dr. Arne Babenhauserheide Bug: PDF Export of Link fails >>>> https://list.orgmode.org/87pmtqp79s.fsf@web.de/T/#u >> >> Then the older bug may be cancelled as a duplicate. > > Not sure. Even a fix to org-insert-link would not solve the problem with > unexpected export if the link is typed in manually. So, I'd rather keep > both the reports for the time being. I would not insist any more. My point was: the same reporter, the same case of punctuation after slash in link target, the same idea to make `org-insert-link' more smart. > Or someone may go through all the related bugs and create a single giant > discussion to avoid scattering things around. In my notes, I have at > least 6 discussions related to edge cases of Org markup. I have some notes as well. Though I think it should be either FAQ entry or a separate document describing limitations of the parser (and test data set for the parser). > I think it is already kind of official. At least, we directly suggest > using zero width spaces in > https://orgmode.org/manual/Escape-Character.html#Escape-Character Things are more complicated. Without a filter (that it is not mentioned) it may cause undesired line breaks (the primary purpose of zero width space). Fortunately PdfLaTeX ignores them. Tom Gillespie. On zero width spaces and Org syntax. Fri, 3 Dec 2021 20:04:28 -0800. https://list.orgmode.org/CA+G3_PM4cxHa8bU+3QG541UiOauLNAQFZQu-+UKczx3itOeTHg@mail.gmail.com suggested word joiner U+2060, but this character is not a space for regular expressions. I experimented a bit, but I can not provide a summary yet, my notes are in early draft stage. the "Escape Character" section should be expanded to discuss more use cases. >> In the case of links I still prefer breaking emphasis at the link >> borders. `org-insert-link' may check after inserting the markup if it >> is parsed as a link and add more markers if necessary. >> ... >> /inter/[[https://orgmode.org/?oops=1][/word/]]/link/ > > I do not like this idea. It is fine when inserting a link into existing > emphasis, but what if an emphasis is applied around link later? We would > also need to update org-emphasize and still have an issue because many > users simply type the emphasis markers manually. Emphasis around other inline objects anyway can be easily broken. Try to make the whole string bold: begin =middle* verbatim= end It may be useful to add a checker to `org-lint' that issues warnings for confusing link targets. I believe that zero width space does not belong to "plain text markup" since it is invisible (at least by default). I see that printable ASCII characters are already in use, but I still think that U+200B should be used as rare as possible. You are aware of my opinion now and I do not need more. You are free to ignore it since I can not offer anything better.