From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Colin Fraizer" Subject: question about org-latex-regexps Date: Mon, 27 Feb 2012 12:32:48 -0500 Message-ID: <00c101ccf575$d3c20b70$7b462250$@cfraizer.com> Mime-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Return-path: Received: from eggs.gnu.org ([208.118.235.92]:35671) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1S24RX-0002dV-ST for emacs-orgmode@gnu.org; Mon, 27 Feb 2012 12:33:03 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1S24RW-0005Je-Ts for emacs-orgmode@gnu.org; Mon, 27 Feb 2012 12:32:59 -0500 Received: from caibbdcaaaaf.dreamhost.com ([208.113.200.5]:38722 helo=homiemail-a55.g.dreamhost.com) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1S24RW-0005JH-Kx for emacs-orgmode@gnu.org; Mon, 27 Feb 2012 12:32:58 -0500 Received: from homiemail-a55.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a55.g.dreamhost.com (Postfix) with ESMTP id 9CD3012C0D8 for ; Mon, 27 Feb 2012 09:32:56 -0800 (PST) Received: from colinfhp (wks201.inter-intelli.com [209.43.1.201]) (Authenticated sender: orgmode@cfraizer.com) by homiemail-a55.g.dreamhost.com (Postfix) with ESMTPA id 4F8CA12C048 for ; Mon, 27 Feb 2012 09:32:56 -0800 (PST) Content-Language: en-us List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org The variable org-latex-regexps has an element ("$" "\\([^$]\\|^\\)\\(\\(\\$\\([^ \r\n,;.$][^$\n\r]*?\\(\n[^$\n\r]*?\\)\\{0,2\\}[^ \r\n,.$]\\)\\$\\)\\)\\([- .,?;:'\")\000]\\|$\\)" 2 nil) The 6th nested subexpression (starts with "\\([-") prevents conversion of some latex math-mode stuff. Is there a reason for that subexpression to be so restrictive? Because I mostly write UTF-8 org-mode documents, I often have "$$" followed immediately by an EN DASH or EM DASH (U+2013 or U+2014). Is there a reason why those characters could not be inserted in that character class? That is, could I change that subexpression to be "\\([--- .,?;:'\")\000]\\|$\\)" ? [I've tried it with a couple of files and didn't notice any problems. Thanks, --Colin