From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp11.migadu.com ([2001:41d0:403:4789::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms9.migadu.com with LMTPS id CJ7PDHZftmQv0gAASxT56A (envelope-from ) for ; Tue, 18 Jul 2023 11:46:30 +0200 Received: from aspmx1.migadu.com ([2001:41d0:403:4789::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp11.migadu.com with LMTPS id aFquDHZftmSH2AAA9RJhRA (envelope-from ) for ; Tue, 18 Jul 2023 11:46:30 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 0E72C42459 for ; Tue, 18 Jul 2023 11:46:30 +0200 (CEST) Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=posteo.net header.s=2017 header.b=b3fW7Igp; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org"; dmarc=pass (policy=none) header.from=posteo.net ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1689673590; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:in-reply-to:in-reply-to: references:references:list-id:list-help:list-unsubscribe: list-subscribe:list-post:dkim-signature; bh=t8+zfUb4OoLx55NCucdFJ1bLJUOywu6JBXkcr474cic=; b=dunfLKjSPWVGRCZNLjR/1/Mbt9AsUO/yn8qcO+//dHZ+P6Zt4K5IuNg+ForTJxK+zsOB5v IyPWhXj3SPYdyduZUgqqWYDdH/MOAndU5p4f1JYysADH6SZC+bJVvrMPrNNjaBv/SQq95N 2cdkbk5rjN1gfN+kYqyd+57OlkRRqHsez10u9iZXJDLiSNaYtxHFzg4foiF5VxIbpdD/vJ /lTgcbCQNYy46BqeTewNAAMoKMEXewv7Fedr+zcqx9BRvr0mKVh89bl/YtHsgwDl7BIKyx 6/bLOUpE0QpCp1upJ01hatFS8mXmpxEXv1x/rxvs1nGLPPSxm0S2JjFyukJTPw== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1689673590; a=rsa-sha256; cv=none; b=akZAWvfYXM/2/Uk/X9EPrTUZ8Uj0t4xgUp8QVNmgvEWUMlN3wNZAknu/drs0O8seOcM6H2 JaaXR1rC2VMU1uVuB+AInMSW9R6jEBG5Unf3Y0nH7qDisiWHCKJPHv3g8KzDoR4/mqPbJ2 ACXqhXSuJkQio/sbBj7OYipC70onrxEBTpVxQQvXk7Wne+asuR2DrIHEYx5UBrew2EWOKI Vv1NVyHpRGfXIScQzNBscdaTgVVxooUuFBTmYpY3ofIYyPiTYlA6XrId63SoxQhtNKsP/w igT6VoCzaHfKWHwgbzo4tqezU+47cNP8YkfGOTH7YnI60ol7NGea0DhhtT5Vrg== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=posteo.net header.s=2017 header.b=b3fW7Igp; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org"; dmarc=pass (policy=none) header.from=posteo.net Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1qLhGx-0004sP-9Q; Tue, 18 Jul 2023 05:45:35 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qLhGt-0004ro-4O for emacs-orgmode@gnu.org; Tue, 18 Jul 2023 05:45:31 -0400 Received: from mout01.posteo.de ([185.67.36.65]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qLhGp-0000Vy-13 for emacs-orgmode@gnu.org; Tue, 18 Jul 2023 05:45:30 -0400 Received: from submission (posteo.de [185.67.36.169]) by mout01.posteo.de (Postfix) with ESMTPS id 2C996240028 for ; Tue, 18 Jul 2023 11:45:23 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=posteo.net; s=2017; t=1689673523; bh=88hlSv2IKx0cmD0N6M6VE3EACb62VRnps6YtNCAnn04=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version:From; b=b3fW7Igp4IBbWCyT9a77COaj3IBMItVPFCK9Ai/XfSRvxDiY5wElZFZHn+ykeA0ke NZwTrYEy+KbJPX11E4VrV1tBT8ip93x6VRQ5QHTbA38FopJuEajmTqezMcNPCo1B7d gpe2usEDYNjA8wRxo1uVfdgorKrJuieLBsq8rb8gwQjulBUAUlj1Kza3oB6PWFCiCX E0YI5Hjn96bzdvdrZDSFBdIh0vbd0V1td1pDpUQo+tY+z6CcT44pyESYeS033Eq/y8 NBmqeI19FHFjhPOBRbPLB9ju8e32205b1DpPQjRvoVntvp+0G0MZagjmCsEsAn8nnN LKzh5KXHWm3SA== Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 4R4vFZ2FPmz6v09; Tue, 18 Jul 2023 11:45:22 +0200 (CEST) From: Ihor Radchenko To: Tom Gillespie Cc: Max Nikulin , emacs-orgmode@gnu.org, Timothy , Bastien Subject: Re: Org markup and non-ASCII punctuation (was: org parser and priorities of inline elements) In-Reply-To: References: <87o86mw86r.fsf@localhost> <87fsrxkahq.fsf@nicolasgoaziou.fr> <87fsrxa1j5.fsf@localhost> <878rxoa6lk.fsf@localhost> <87tug93b2a.fsf@localhost> <87y25l8wvs.fsf@nicolasgoaziou.fr> <87r1bd39ny.fsf@localhost> <8735nsv9qo.fsf@nicolasgoaziou.fr> <87mtm09xzf.fsf@localhost> <87zgq02ueq.fsf@nicolasgoaziou.fr> <87h7c89rqr.fsf@localhost> <874k86y997.fsf@nicolasgoaziou.fr> <87v90lzwkm.fsf@localhost> <874jm2kb7x.fsf@localhost> <87ttu13j08.fsf@localhost> Date: Tue, 18 Jul 2023 09:45:32 +0000 Message-ID: <87pm4pv9hf.fsf@localhost> MIME-Version: 1.0 Content-Type: text/plain Received-SPF: pass client-ip=185.67.36.65; envelope-from=yantar92@posteo.net; helo=mout01.posteo.de X-Spam_score_int: -43 X-Spam_score: -4.4 X-Spam_bar: ---- X-Spam_report: (-4.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: emacs-orgmode-bounces+larch=yhetil.org@gnu.org X-Migadu-Country: US X-Migadu-Flow: FLOW_IN X-Migadu-Spam-Score: -5.18 X-Spam-Score: -5.18 X-Migadu-Queue-Id: 0E72C42459 X-Migadu-Scanner: mx1.migadu.com X-TUID: MmVrIzkPxv4e Tom Gillespie writes: >> We might probably generalize to >> PRE = Zs Zl Pc Pd Ps Pi ' " >> POST = Zs Zl Pc Pd Pe Pf . ; : ! ? ' " \ [ > > If this works I think it is reasonable. We might want to > specify what to do in cases where an org implementation > might not fully support unicode, Just fall back to ASCII subset? If the implementation does not support unicode, it probably cannot properly work with UTF-encoded documents anyway. > ...and might want to do a > review of related issues in syntax with respect to ascii > vs unicode, because iirc there is some ambiguity in > the current syntax doc. > For example, I'm pretty sure that I'm mixing and matching > unicode and ascii whitespace in the tokenizer I have in Racket. Feel free to open new bug reports about such ambiguities. -- Ihor Radchenko // yantar92, Org mode contributor, Learn more about Org mode at . Support Org development at , or support my work at