From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp0 ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id qBsbNVEeqWHH+wAAgWs5BA (envelope-from ) for ; Thu, 02 Dec 2021 20:28:17 +0100 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp0 with LMTPS id 8FDAMFEeqWH3GgAA1q6Kng (envelope-from ) for ; Thu, 02 Dec 2021 19:28:17 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 992A88AFE for ; Thu, 2 Dec 2021 20:28:17 +0100 (CET) Received: from localhost ([::1]:60606 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1msrke-0003eB-PP for larch@yhetil.org; Thu, 02 Dec 2021 14:28:16 -0500 Received: from eggs.gnu.org ([209.51.188.92]:44950) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1msrT1-0007VG-B9 for emacs-orgmode@gnu.org; Thu, 02 Dec 2021 14:10:03 -0500 Received: from mout01.posteo.de ([185.67.36.65]:36759) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1msrSy-0006OS-Mg for emacs-orgmode@gnu.org; Thu, 02 Dec 2021 14:10:03 -0500 Received: from submission (posteo.de [89.146.220.130]) by mout01.posteo.de (Postfix) with ESMTPS id 95BF324002A for ; Thu, 2 Dec 2021 20:09:58 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=posteo.net; s=2017; t=1638472198; bh=X76mQjRxJx/aO/W3tKYoPVC14lCnT+dcnDLzdfaJmvY=; h=From:To:Cc:Subject:Date:From; b=ql3TaAT2/44UnIxS6gsXTAU7iZTut0s7XxsuxdFlE1h7+JdgvJAFuZhyZ8E6GfdHG 9PfPFQ7a/p7rPCwt+6CdqnzhsUDBplNZn+srOoa3+L89LyrUQr5ReXXD66FcRhpj6r rs1VVqyDlnXN7Xhy1wKhzAI3d4uoxBaraZs2/QZol5I9lMDioHywvfWfqmFcRaNpPu F9is6yNwHgrjKmgRFfHqIcEZyBkcSpHF5makYK0Zao75EseskxUyXwp+dRrMCeJOcq YiDuI+XidBxHyRLnV7Hhf0vjMjzmWRJtaxlXeizsk6D86IeZIkiZ7n2vHI/Z2E6iyx hWzQrB/AnMBPQ== Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 4J4lrj73P0z9rxM; Thu, 2 Dec 2021 20:09:57 +0100 (CET) From: =?utf-8?Q?Juan_Manuel_Mac=C3=ADas?= To: Tom Gillespie Subject: Re: Org-syntax: Intra-word markup References: <4897bc60-b74f-ccfd-e13e-9b89a1194fdf@mailbox.org> <87fsrbp673.fsf@gmail.com> <1ef0e093-c165-2a5f-954d-6a33b64c8ee9@mailbox.org> <87r1avgnpi.fsf@localhost> Date: Thu, 02 Dec 2021 19:09:55 +0000 In-Reply-To: (Tom Gillespie's message of "Thu, 2 Dec 2021 10:11:14 -0800") Message-ID: <87czmekem4.fsf@posteo.net> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=185.67.36.65; envelope-from=maciaschain@posteo.net; helo=mout01.posteo.de X-Spam_score_int: -43 X-Spam_score: -4.4 X-Spam_bar: ---- X-Spam_report: (-4.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: orgmode Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1638473297; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=97jSmcoIsI/91aPN7e5p5XmNbGTLbN/MaaBqcQhXbqc=; b=SO30xan+rd6tkHhZ76M/nJyK5QcYfkRP7aNHPUBfVf+ghROSwD82y1oxfM78sCkGqLGOOA IOyp36h7GmcOK08iUTTAK1wdjjxliId5RXdgnArVABXi2zwTX0aEDH/QlVhh/1frIKEKty KlXurBYIlEXaNaZb9z7/r9kmBjoHu6OAr1zGwFqMdu+D+CvEcung0F4GbZnb59u35+Dxws Kzj4bBtQhhdd5t85sjRXn3Q+1brX67EjXdpzvMA6XxbuSFic2uH/E9+6TlqLecgIrqhZQU 0QhAbbb8vXF/aUw+54515RWWDh6ivGnYDzvgrf3JZoPB76IIY5YySeiYPn55Xg== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1638473297; a=rsa-sha256; cv=none; b=c8fa5Ql3704JlItGvY8Qp+5dsH84rc0yPzMwKj+R4YXgrf3W0vVBZ0tMxvUwn3wwrgLm9A QTbIrSnlWWkkbvZs2dhdtmIPtH6ikqj4h1UcmXwy0VWs3Va5EOirIWku3qJFrp8BfEv8Vo 0qOZo1fE3sq+gdMDMTHvD7hifBsYGV9+tULcXlPrFJyar9ZMHnTUd7CB3gY1AfvBgotG2L eFzA0CqoZQ6Ht2AU4TbYhxVIJuO0LlN5to2dHj3U63Rjrc1ecsCtN0TFhMkhLES2Z6y2bu yqJDH4E1949evgb6J/weWW4Pdi1KXu4VImC1XCPXJBI5r1zFWgoeWo61jxePrg== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=posteo.net header.s=2017 header.b=ql3TaAT2; dmarc=pass (policy=none) header.from=posteo.net; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: -4.32 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=posteo.net header.s=2017 header.b=ql3TaAT2; dmarc=pass (policy=none) header.from=posteo.net; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: 992A88AFE X-Spam-Score: -4.32 X-Migadu-Scanner: scn0.migadu.com X-TUID: CWr7I8CRbPPG Tom Gillespie writes: > I don't mean to be a wet blanket, but the edge cases for > the current markup syntax are already hard enough to > implement correctly, to the point where different parts of > Org mode are inconsistent. Intra-word markup isn't viable > because there simply isn't any sane way to parse something > like *hello world*/hrm/oh no*. The other issue is that this will > degrade parsing performance because almost every > character could precede the start of a markup section. > > I recommend anyone suggesting solutions try to implement > something that can parse the markup unambiguously with > lots of nasty test cases. You will likely find that it is impossible > to consistently tokenize markup, and that you have to hand > write a whole bunch of heuristics, making Org syntax even > harder to implement correctly. > > Any solution that suggests extending how =3D/*~+_ can be > used gets a hard no from me. I could see teaching other > exporters how to interpret \emph{hello}world, but trying for > to have any sane behavior for something like > why *hello*world oh no a wild askterisk* > is not worth it. I believe, that emphasis marks are a part of Org that can be very shocking to new users. I mean, there is a series of behaviors that seem obvious and trivial in the emphasized text, but that in Org are not possible out of the box, unless you configure `org-emphasis-regexp-components'. Three quick examples. This in Org is not possible out of the box: #+begin_example [/emphasis/] =C2=A1/emphasis/! =C2=BF/Emphasis/? #+end_example Nor is it possible ---out of the box--- to extend emphasis beyond a certain number of lines. New users who come from other forms of markup maybe expect the obvious to be something like: some-text begin-emphasis whatever-is-in-between end-emphasis more-text Over time one ends up seeing these things more as a feature than as a bug :-) But those little inconsistencies make the Org syntax a bit ugly, IMHO. I can't think of how to improve that, though. Best regards, Juan Manuel=20