From mboxrd@z Thu Jan  1 00:00:00 1970
From: Christopher Genovese <genovese@cmu.edu>
Subject: Re: full parser implementation for tag queries (parentheses, fast
 heading match, and more)
Date: Sat, 4 Aug 2012 06:07:18 -0400
Message-ID: <CAPum5FgdJzb8vnanwhS7WD5BCVHCY94jX-7RiXp=WesBrviiEQ@mail.gmail.com>
References: <CAPum5FhMWm0rsHyk=VB4EoiPAqfFMxw37_znDFsRX+rE=9My3Q@mail.gmail.com>
Mime-Version: 1.0
Content-Type: multipart/alternative; boundary=047d7b10d1f789ea4204c66dd08e
Return-path: <emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org>
Received: from eggs.gnu.org ([208.118.235.92]:47579)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <genovese.cr@gmail.com>) id 1SxbGq-0008QJ-1j
	for emacs-orgmode@gnu.org; Sat, 04 Aug 2012 06:07:46 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <genovese.cr@gmail.com>) id 1SxbGm-0003mk-LT
	for emacs-orgmode@gnu.org; Sat, 04 Aug 2012 06:07:43 -0400
Received: from mail-pb0-f41.google.com ([209.85.160.41]:43372)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <genovese.cr@gmail.com>) id 1SxbGm-0003mZ-4v
	for emacs-orgmode@gnu.org; Sat, 04 Aug 2012 06:07:40 -0400
Received: by pbbrp2 with SMTP id rp2so2984561pbb.0
	for <emacs-orgmode@gnu.org>; Sat, 04 Aug 2012 03:07:39 -0700 (PDT)
In-Reply-To: <CAPum5FhMWm0rsHyk=VB4EoiPAqfFMxw37_znDFsRX+rE=9My3Q@mail.gmail.com>
List-Id: "General discussions about Org-mode." <emacs-orgmode.gnu.org>
List-Unsubscribe: <https://lists.gnu.org/mailman/options/emacs-orgmode>,
	<mailto:emacs-orgmode-request@gnu.org?subject=unsubscribe>
List-Archive: <http://lists.gnu.org/archive/html/emacs-orgmode>
List-Post: <mailto:emacs-orgmode@gnu.org>
List-Help: <mailto:emacs-orgmode-request@gnu.org?subject=help>
List-Subscribe: <https://lists.gnu.org/mailman/listinfo/emacs-orgmode>,
	<mailto:emacs-orgmode-request@gnu.org?subject=subscribe>
Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org
Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org
To: emacs-orgmode@gnu.org

--047d7b10d1f789ea4204c66dd08e
Content-Type: text/plain; charset=ISO-8859-1

A small addendum:

Right after I posted (of course!), I noticed both a small mistake and an
opportunity for
simplification, relating to detecting and processing the todo expressions
after a /.
Specifically, the approximate fix I proposed to the bug in the 7.8 code is
insufficient
to handle regexp matching in todo expressions. The full solution using the
new function org-find-todo-query is needed in the code as I have it, and
that can just
be plugged in instead of the string-match. (See Note h in the original
post, specifically
the string-match given there, and org-find-todo-query.)

But I realized that all that is unnecessary as the todo processing can be
easily built
into the new parser. I've mostly done that just now and will test and post
an update
Saturday, er...today.

Sorry to muddy the waters by not catching that earlier, and sorrier still
if I'm rambling.
Off to bed...

 -- Christopher


On Sat, Aug 4, 2012 at 3:50 AM, Christopher Genovese <genovese@cmu.edu>wrote:

> I am writing an application layer on top of org that uses the
> entry mapping API, but I needed both negation of complex
> selections and heading searches. Because the current tag query
> parser does not handle parenthesized expressions, it does not
> allow negating complex queries. At first, I wrote a workaround
> solution that mimics/specializes the mapping API, but that
> approach seemed inelegant and harder to maintain.
>
> So instead I implemented a full parser for tag queries with a
> number of useful features (see the labeled Notes at the bottom
> for further comments on these features):
>
>   1. Parenthesized expressions to arbitrary depth are allowed.
>   2. A '-' can be used to negate a parenthesized term.             [Note a]
>   3. Regex's in {} can contain braces escaped by doubling: {{ }}.  [Note b]
>   4. Supports fast property search on HEADING and PRIORITY.        [Note c]
>   5. Handles hyphens in property names properly.                   [Note
> d,h]
>   6. Allows only the proper comparison operators, including ==.    [Note
> e,h]
>   7. Allows spaces around operators and terms for readability.     [Note f]
>   8. Matchers use the original expression order; not a big
>      deal, but free.
>   9. The error messages during parsing are reasonably helpful.
>   10. Several bug fixes and a cleaner `org-make-tags-matcher'.     [Note h]
>
> I'm submitting the code for your consideration, with the
> goal of eventually incorporating this into org.el. I would be
> happy to hear any comments or suggestions you have. As I'll describe
> below, this involves relatively minor changes to two existing
> functions and adding a few new support functions. I've attached two
> files org-tag-query-parse.el (the code) and tag-query-tests.el (a
> collection of tests built on a simple framework). I've also
> put the files in http://www.stat.cmu.edu/~genovese/emacs/. The
> comments in both files will I hope be helpful.
>
> At the risk of going on too long, I'd like to add a few comments
> about the code and tests. First, the two existing functions that
> are affected in the code are `org-make-tags-matcher' and
> `org-scan-tags'. In the new version of the former, I've extracted
> out both kinds of query parsing, leading to a shorter and cleaner
> function. The new version of the latter differs in only a couple
> *very minor* places that capture two values that were already
> being computed anyway (see the diff reproduced in the comments).
> Btw, I'm working from the 7.8.11 code.
>
> Loading org-tag-query-parse.el does not change the original
> functions. Instead, I've added a `-NEW' to the names of these
> functions and saved the originals also with a `-ORIGINAL' added.
> After loading the file, you can choose a version to try by doing
>
>     (org-tmp-use-tag-parser 'new)
> and
>     (org-tmp-use-tag-parser 'original)
>
> or do (org-tmp-use-tag-parser) to toggle between versions.
> You can also just use the names with suffixes directly.
> I'd also suggest byte-compiling the file.
>
> I think the place to start looking at the code is the new version
> of `org-make-tags-matcher'. The main entry function for the new
> parser is `org-tag-query-parse', though the real workhorse is
> actually the function `org-tag-query-parse-1'. There is also a
> new function `org-todo-query-parse' which just extracts the
> existing todo matching method. (I didn't do anything with that
> method as the manual makes it clear that it is of secondary
> importance.) I think the modularity here makes
> `org-make-tags-matcher' and each separate parser easier to read
> and understand.
>
> The other substantial piece (in terms of lines of code) is a utility
> macro `org-match-cond' that is used throughout and makes the main
> parser much more readable IMHO. Admittedly, I went a bit
> overboard in optimizing it; the first version worked fine
> but this one produces really nice code. I'd suggest ignoring this
> code (in section "Parsing utility for readable matchers") on
> first pass. The docstring is pretty complete, and its use is more
> or less self-explanatory. Most of its work is done at compile time.
>
> To run the tests, load org-tag-query-parse.el and tag-query-tests.el
> and do
>
>    (tag-test-run :results) ; use :summary for a brief summary of all runs
>    (tag-test-other-tests)  ; miscellaneous other tests, including scanning
>
> or name individual suites. They are at the moment:
>
>    (tag-test-run :results 'org-comparison-1)  ; or use :summary
>    (tag-test-run :results 'org-comparison-2)
>    (tag-test-run :results 'match-results-1)
>    (tag-test-run :results 'match-results-2)
>    (tag-test-run :results 'should-error-1)
>
> If you have other ideas for tests or find any bugs, please let me
> know. Sorry for the homegrown framework; it just sort of grew and
> then I was too tired to rewrite the tests. One complication here
> is that the original and new algorithms produce different term
> orders and use a few different functions. The function
> tag-test-transform transforms original results to the new
> algorithms conventions, but it does not handle PRIORITY or
> HEADING matches at the moment. Use the tree form of the tess (see
> match-results-1 for example) on these. Btw, I've run the tests on
> GNU Emacs 23.2 and 24.1 (running on OS X lion).
>
> Notes:
>    a. There is no need to introduce a new character such as ! for
>       negation because the semantics of the - are clear and are
>       consistent with its use for tags. A - binds more tightly
>       than & which in turn binds more tightly than |. A +
>       selector can also be used for positive selection of a
>       parenthesized term but it is equivalent to using no
>       selector, just as for tags.
>
>    b. Because \'s are so heavily used in regex's and because they
>       have to be doubled in strings, using \'s for an additional
>       escape layer would be messy, ambiguous, and hard to read.
>       Only the {}'s need to be escaped and the doubling escapes
>       {{ -> { and }} -> } are simple, readable, and fast to
>       parse. For example: "+{abc\\{{3,7\\}}}" gives the regex
>       "abc\\{3,7\\}". Parity makes correctness clear at a glance.
>
>    c. Because headline (and priority) searches can be useful and
>       powerful, and because the information on those fields is
>       *already processed* in `org-scan-tags', we get those
>       special searches *essentially for free*, requiring only two
>       minor changes to `org-scan-tags'. See the unified diff in
>       comments. The special PRIORITY property already exists; I
>       added the special HEADING property for these purposes. I'm
>       open to changing the name of course, but I do think the
>       feature is both useful and elegant. (I'm using it in my
>       application, for instance.)
>
>    d. I did not see it in the manual, but I think that property names
>       with hyphens should have these \-escaped -'s in the query
>       string, with the escaping slashes removed in the produced
>       matcher. This is not currently done, but the new version does.
>       See Note h for details.
>
>    e. It seems desirable to support both = and == as equality operators
>       since the latter is so common by habit. The new version allows
>       this explicitly. The original version does as well, but the
>       regex for the comparison operator also allows other operators
>       <<, ><, >>, =>, and >= as well, which can produce bad matchers.
>       See Note h for details.
>
>    f. Currently, spaces are ignored around &, |, the implicit & between
>       terms, around the comparison operators in property searches,
>       and around +/- selectors. Spaces are not ignored inside {}'s
>       for a regexp match.
>
>    g. The current code also allows +/- selectors before property
>       comparisons. I don't really like this because
>       +PROP<>"something" and -PROP="something" have the same
>       meaning but look very different. But the new code does
>       support this. As a side note, there's really no need for
>       the & characters as +/- serve the and/and-not function
>       completely. But again, no prob.
>
>    h. A few bugs detected in the 7.8.11 code:
>
>       + Faulty test for todo matcher in org-make-tags-matcher
>         (string-match "/+" match)
>
>         Ex: (org-make-tags-matcher "PROP={^\\s-*// .*$}") produces
>         an erroneous matcher:
>
>             ("PROP={^\\s-*// .*$}" progn
>              (setq org-cached-props nil)
>              (member "PROP" tags-list))
>
>         For all practical purposes it will be enough to do:
>
>          (string-match "\\(/\\(!\\)?\\s-*\\)[^{}\"]*$" match)
>
>         instead of the current test in org-make-tags-matcher.
>         This works as long as the TODO keywords do not contain a
>         right brace or quotation marks. (In most other cases, the
>         new parser should give an error message at parse time.)
>
>         A technicality: this is /not/ a complete solution because
>         arbitrary strings can be TODO keywords. For instance,
>         both PROP={/!} and PROP="/!{/!}" are valid TODO keywords
>         (it works!) *and* valid property comparisons. So, a pattern
>         alone is insufficient. We want to find the first slash
>         that is not enclosed in {}'s or ""'s; if found, a todo
>         match is needed. The function `org-find-todo-query' does
>         this and (org-find-todo-query match) can be plugged in
>         directly replacing the above (string-match ...) in then
>         new `org-make-tags-matcher'.
>
>         But because the todo parsing uses {}'s for regex matches,
>         TODO keywords with {}'s are ignored anyway. So there's
>         no need to go beyond the fixed string-match above.
>         The function `org-todo-query-parse', which handles todo
>         parsing in the new version, makes this explicit.
>
>       + Property names with -'s are not handled properly (cf. Note d)
>
>         Specifically, the escapes are not removed. Example:
>         (org-make-tags-matcher "PROP\\-WITH\\-HYPHENS=2")
>         produces
>
>         ("PROP\\-WITH\\-HYPHENS=2" and
>          (progn
>          (setq org-cached-props nil)
>          (=
>           (string-to-number
>            (or (org-cached-entry-get nil "PROP\\-WITH\\-HYPHENS")
>            ""))
>           2))
>          t)
>
>         The original code /does/ instead remove -'s from tag
>         names, which shouldn't have them anyway. I suspect that
>         this was intended for property names rather than tag
>         names. The new version fixes up property names but does
>         not allow -'s in tags.
>
>       + Incorrect comparison operators allowed (cf. Note e)
>
>         The regular expression used is "[<=>]\\{1,2\\}" is used to
>         detect the comparison operators. But this can produce bad
>         matchers that fail opaquely at match time rather than
>         giving an appropriate error message at parse time.
>
>         Ex: (org-make-tags-matcher "P<<2") produces
>
>          ("P<<2" and
>           (progn
>             (setq org-cached-props nil)
>             (nil
>              (string-to-number (or (org-cached-entry-get nil "P") "")) 2))
>           t)
>
>         This is fixed in the new version and delivers an error
>         message at parse time.
>
>       + missing org-re (line 7179 in org.el) with posix classes
>
>         Minor consistency issue.  This line does not occur in the new
>         code.
>
>
> Thanks and regards,
>
>    Christopher Genovese
>
>
>

--047d7b10d1f789ea4204c66dd08e
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

A small addendum:<br><br>Right after I posted (of course!), I noticed both =
a small mistake and an opportunity for<br>simplification, relating to detec=
ting and processing the todo expressions after a /.<br>Specifically, the ap=
proximate fix I proposed to the bug in the 7.8 code is insufficient<br>

to handle regexp matching in todo expressions. The full solution using the<=
br>new function org-find-todo-query is needed in the code as I have it, and=
 that can just<br>be plugged in instead of the string-match. (See Note h in=
 the original post, specifically <br>

the string-match given there, and org-find-todo-query.) <br><br>But I reali=
zed that all that is unnecessary as the todo processing can be easily built=
 <br>into the new parser. I&#39;ve mostly done that just now and will test =
and post an update <br>

Saturday, er...today. <br><br>Sorry to muddy the waters by not catching tha=
t earlier, and sorrier still if I&#39;m rambling.=A0 <br>Off to bed...<br><=
br>=A0-- Christopher<br><br><br><div class=3D"gmail_quote">On Sat, Aug 4, 2=
012 at 3:50 AM, Christopher Genovese <span dir=3D"ltr">&lt;<a href=3D"mailt=
o:genovese@cmu.edu" target=3D"_blank">genovese@cmu.edu</a>&gt;</span> wrote=
:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><span style=3D"font-family:courier new,monos=
pace">I am writing an application layer on top of org that uses the</span><=
br style=3D"font-family:courier new,monospace">

<span style=3D"font-family:courier new,monospace">entry mapping API, but I =
needed both negation of complex</span><br style=3D"font-family:courier new,=
monospace">
<span style=3D"font-family:courier new,monospace">selections and heading se=
arches. Because the current tag query</span><br style=3D"font-family:courie=
r new,monospace"><span style=3D"font-family:courier new,monospace">parser d=
oes not handle parenthesized expressions, it does not</span><br style=3D"fo=
nt-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">allow negating complex qu=
eries. At first, I wrote a workaround</span><br style=3D"font-family:courie=
r new,monospace"><span style=3D"font-family:courier new,monospace">solution=
 that mimics/specializes the mapping API, but that</span><br style=3D"font-=
family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">approach seemed inelegant=
 and harder to maintain.</span><br style=3D"font-family:courier new,monospa=
ce"><br style=3D"font-family:courier new,monospace"><span style=3D"font-fam=
ily:courier new,monospace">So instead I implemented a full parser for tag q=
ueries with a</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">number of useful features=
 (see the labeled Notes at the bottom</span><br style=3D"font-family:courie=
r new,monospace"><span style=3D"font-family:courier new,monospace">for furt=
her comments on these features):</span><br style=3D"font-family:courier new=
,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0 1. Parenthesized expressions to arbitrary depth =
are allowed.</span><br style=3D"font-family:courier new,monospace"><span st=
yle=3D"font-family:courier new,monospace">=A0 2. A &#39;-&#39; can be used =
to negate a parenthesized term.=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 [Note a=
]</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0 3. Regex&#39;s in {} =
can contain braces escaped by doubling: {{ }}.=A0 [Note b]</span><br style=
=3D"font-family:courier new,monospace"><span style=3D"font-family:courier n=
ew,monospace">=A0 4. Supports fast property search on HEADING and PRIORITY.=
=A0=A0=A0=A0=A0=A0=A0 [Note c]</span><br style=3D"font-family:courier new,m=
onospace">


<span style=3D"font-family:courier new,monospace">=A0 5. Handles hyphens in=
 property names properly.=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 [Note d,h]</span><br style=3D"font-family:courier new,monospace"><sp=
an style=3D"font-family:courier new,monospace">=A0 6. Allows only the prope=
r comparison operators, including =3D=3D.=A0=A0=A0 [Note e,h]</span><br sty=
le=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0 7. Allows spaces arou=
nd operators and terms for readability.=A0=A0=A0=A0 [Note f]</span><br styl=
e=3D"font-family:courier new,monospace"><span style=3D"font-family:courier =
new,monospace">=A0 8. Matchers use the original expression order; not a big=
</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0 deal, but fr=
ee.</span><br style=3D"font-family:courier new,monospace"><span style=3D"fo=
nt-family:courier new,monospace">=A0 9. The error messages during parsing a=
re reasonably helpful.</span><br style=3D"font-family:courier new,monospace=
">


<span style=3D"font-family:courier new,monospace">=A0 10. Several bug fixes=
 and a cleaner `org-make-tags-matcher&#39;.=A0=A0=A0=A0 [Note h]</span><br =
style=3D"font-family:courier new,monospace"><br style=3D"font-family:courie=
r new,monospace">


<span style=3D"font-family:courier new,monospace">I&#39;m submitting the co=
de for your consideration, with the <br>goal of eventually incorporating th=
is into org.el. I would be </span><br style=3D"font-family:courier new,mono=
space">


<span style=3D"font-family:courier new,monospace">happy to hear any comment=
s or suggestions you have. As I&#39;ll describe</span><br style=3D"font-fam=
ily:courier new,monospace"><span style=3D"font-family:courier new,monospace=
">below, this involves relatively minor changes to two existing</span><br s=
tyle=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">functions and adding a fe=
w new support functions. I&#39;ve attached two</span><br style=3D"font-fami=
ly:courier new,monospace"><span style=3D"font-family:courier new,monospace"=
>files org-tag-query-parse.el (the code) and tag-query-tests.el (a</span><b=
r style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">collection of tests built=
 on a simple framework). I&#39;ve also</span><br style=3D"font-family:couri=
er new,monospace"><span style=3D"font-family:courier new,monospace">put the=
 files in <a href=3D"http://www.stat.cmu.edu/%7Egenovese/emacs/" target=3D"=
_blank">http://www.stat.cmu.edu/~genovese/emacs/</a>. The</span><br style=
=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">comments in both files wi=
ll I hope be helpful.</span><br style=3D"font-family:courier new,monospace"=
><br style=3D"font-family:courier new,monospace"><span style=3D"font-family=
:courier new,monospace">At the risk of going on too long, I&#39;d like to a=
dd a few comments</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">about the code and tests.=
 First, the two existing functions that</span><br style=3D"font-family:cour=
ier new,monospace"><span style=3D"font-family:courier new,monospace">are af=
fected in the code are `org-make-tags-matcher&#39; and</span><br style=3D"f=
ont-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">`org-scan-tags&#39;. In t=
he new version of the former, I&#39;ve extracted</span><br style=3D"font-fa=
mily:courier new,monospace"><span style=3D"font-family:courier new,monospac=
e">out both kinds of query parsing, leading to a shorter and cleaner</span>=
<br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">function. The new version=
 of the latter differs in only a couple</span><br style=3D"font-family:cour=
ier new,monospace"><span style=3D"font-family:courier new,monospace">*very =
minor* places that capture two values that were already</span><br style=3D"=
font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">being computed anyway (se=
e the diff reproduced in the comments).</span><br style=3D"font-family:cour=
ier new,monospace"><span style=3D"font-family:courier new,monospace">Btw, I=
&#39;m working from the 7.8.11 code.</span><br style=3D"font-family:courier=
 new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">Loading org-tag-query-parse.el does not change the o=
riginal</span><br style=3D"font-family:courier new,monospace"><span style=
=3D"font-family:courier new,monospace">functions. Instead, I&#39;ve added a=
 `-NEW&#39; to the names of these</span><br style=3D"font-family:courier ne=
w,monospace">


<span style=3D"font-family:courier new,monospace">functions and saved the o=
riginals also with a `-ORIGINAL&#39; added.</span><br style=3D"font-family:=
courier new,monospace"><span style=3D"font-family:courier new,monospace">Af=
ter loading the file, you can choose a version to try by doing</span><br st=
yle=3D"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0 (org-tmp-use-tag-parser &#39;new)</span><b=
r style=3D"font-family:courier new,monospace"><span style=3D"font-family:co=
urier new,monospace">and</span><br style=3D"font-family:courier new,monospa=
ce">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0 (org-tmp-use-ta=
g-parser &#39;original)</span><br style=3D"font-family:courier new,monospac=
e"><br style=3D"font-family:courier new,monospace"><span style=3D"font-fami=
ly:courier new,monospace">or do (org-tmp-use-tag-parser) to toggle between =
versions.</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">You can also just use the=
 names with suffixes directly. </span><br style=3D"font-family:courier new,=
monospace"><span style=3D"font-family:courier new,monospace">I&#39;d also s=
uggest byte-compiling the file.</span><br style=3D"font-family:courier new,=
monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">I think the place to start looking at the code is th=
e new version</span><br style=3D"font-family:courier new,monospace"><span s=
tyle=3D"font-family:courier new,monospace">of `org-make-tags-matcher&#39;. =
The main entry function for the new</span><br style=3D"font-family:courier =
new,monospace">


<span style=3D"font-family:courier new,monospace">parser is `org-tag-query-=
parse&#39;, though the real workhorse is</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">actua=
lly the function `org-tag-query-parse-1&#39;. There is also a</span><br sty=
le=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">new function `org-todo-qu=
ery-parse&#39; which just extracts the</span><br style=3D"font-family:couri=
er new,monospace"><span style=3D"font-family:courier new,monospace">existin=
g todo matching method. (I didn&#39;t do anything with that</span><br style=
=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">method as the manual make=
s it clear that it is of secondary</span><br style=3D"font-family:courier n=
ew,monospace"><span style=3D"font-family:courier new,monospace">importance.=
) I think the modularity here makes</span><br style=3D"font-family:courier =
new,monospace">


<span style=3D"font-family:courier new,monospace">`org-make-tags-matcher=
9; and each separate parser easier to read</span><br style=3D"font-family:c=
ourier new,monospace"><span style=3D"font-family:courier new,monospace">and=
 understand.</span><br style=3D"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">The other substantial piece (in terms of lines of co=
de) is a utility</span><br style=3D"font-family:courier new,monospace"><spa=
n style=3D"font-family:courier new,monospace">macro `org-match-cond&#39; th=
at is used throughout and makes the main</span><br style=3D"font-family:cou=
rier new,monospace">


<span style=3D"font-family:courier new,monospace">parser much more readable=
 IMHO. Admittedly, I went a bit</span><br style=3D"font-family:courier new,=
monospace"><span style=3D"font-family:courier new,monospace">overboard in o=
ptimizing it; the first version worked fine</span><br style=3D"font-family:=
courier new,monospace">


<span style=3D"font-family:courier new,monospace">but this one produces rea=
lly nice code. I&#39;d suggest ignoring this</span><br style=3D"font-family=
:courier new,monospace"><span style=3D"font-family:courier new,monospace">c=
ode (in section &quot;Parsing utility for readable matchers&quot;) on</span=
><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">first pass. The docstring=
 is pretty complete, and its use is more</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">or le=
ss self-explanatory.</span> <span style=3D"font-family:courier new,monospac=
e">Most of its work is done at compile time.</span><br style=3D"font-family=
:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">To run the tests, load org-tag-query-parse.el and ta=
g-query-tests.el</span><br style=3D"font-family:courier new,monospace"><spa=
n style=3D"font-family:courier new,monospace">and do</span><br style=3D"fon=
t-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0 (tag-test-run :results) ; use :summary for a =
brief summary of all runs</span><br style=3D"font-family:courier new,monosp=
ace">

<span style=3D"font-family:courier new,monospace">=A0=A0 (tag-test-other-te=
sts)=A0 ; miscellaneous other tests, including scanning</span><br style=3D"=
font-family:courier new,monospace">
<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">or name individual suites. They are at the moment:</=
span><br style=3D"font-family:courier new,monospace"><br style=3D"font-fami=
ly:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 (tag-test-run :res=
ults &#39;org-comparison-1)=A0 ; or use :summary</span><br style=3D"font-fa=
mily:courier new,monospace"><span style=3D"font-family:courier new,monospac=
e">=A0=A0 (tag-test-run :results &#39;org-comparison-2)</span><br style=3D"=
font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 (tag-test-run :res=
ults &#39;match-results-1)</span><br style=3D"font-family:courier new,monos=
pace"><span style=3D"font-family:courier new,monospace">=A0=A0 (tag-test-ru=
n :results &#39;match-results-2)</span><br style=3D"font-family:courier new=
,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 (tag-test-run :res=
ults &#39;should-error-1)</span><br style=3D"font-family:courier new,monosp=
ace"><br style=3D"font-family:courier new,monospace"><span style=3D"font-fa=
mily:courier new,monospace">If you have other ideas for tests or find any b=
ugs, please let me</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">know. Sorry for the homeg=
rown framework; it just sort of grew and</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">then =
I was too tired to rewrite the tests. One complication here</span><br style=
=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">is that the original and =
new algorithms produce different term</span><br style=3D"font-family:courie=
r new,monospace"><span style=3D"font-family:courier new,monospace">orders a=
nd use a few different functions. The function</span><br style=3D"font-fami=
ly:courier new,monospace">


<span style=3D"font-family:courier new,monospace">tag-test-transform transf=
orms original results to the new</span><br style=3D"font-family:courier new=
,monospace"><span style=3D"font-family:courier new,monospace">algorithms co=
nventions, but it does not handle PRIORITY or</span><br style=3D"font-famil=
y:courier new,monospace">


<span style=3D"font-family:courier new,monospace">HEADING matches at the mo=
ment. Use the tree form of the tess (see</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">match=
-results-1 for example) on these. Btw, I&#39;ve run the tests on</span><br =
style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">GNU Emacs 23.2 and 24.1 (=
running on OS X lion). </span><br style=3D"font-family:courier new,monospac=
e"><br style=3D"font-family:courier new,monospace"><span style=3D"font-fami=
ly:courier new,monospace">Notes:</span><br style=3D"font-family:courier new=
,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 a. There is no nee=
d to introduce a new character such as ! for</span><br style=3D"font-family=
:courier new,monospace"><span style=3D"font-family:courier new,monospace">=
=A0=A0=A0=A0=A0 negation because the semantics of the - are clear and are</=
span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 consisten=
t with its use for tags. A - binds more tightly</span><br style=3D"font-fam=
ily:courier new,monospace"><span style=3D"font-family:courier new,monospace=
">=A0=A0=A0=A0=A0 than &amp; which in turn binds more tightly than |. A +</=
span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 selector =
can also be used for positive selection of a</span><br style=3D"font-family=
:courier new,monospace"><span style=3D"font-family:courier new,monospace">=
=A0=A0=A0=A0=A0 parenthesized term but it is equivalent to using no</span><=
br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 selector,=
 just as for tags.</span><br style=3D"font-family:courier new,monospace"><s=
pan style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 </span><br =
style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 b. Because \&#39;s=
 are so heavily used in regex&#39;s and because they</span><br style=3D"fon=
t-family:courier new,monospace"><span style=3D"font-family:courier new,mono=
space">=A0=A0=A0=A0=A0 have to be doubled in strings, using \&#39;s for an =
additional</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 escape la=
yer would be messy, ambiguous, and hard to read.</span><br style=3D"font-fa=
mily:courier new,monospace"><span style=3D"font-family:courier new,monospac=
e">=A0=A0=A0=A0=A0 Only the {}&#39;s need to be escaped and the doubling es=
capes</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 {{ -&gt; =
{ and }} -&gt; } are simple, readable, and fast to</span><br style=3D"font-=
family:courier new,monospace"><span style=3D"font-family:courier new,monosp=
ace">=A0=A0=A0=A0=A0 parse. For example: &quot;+{abc\\{{3,7\\}}}&quot; give=
s the regex</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 &quot;abc=
\\{3,7\\}&quot;. Parity makes correctness clear at a glance.</span><br styl=
e=3D"font-family:courier new,monospace"><span style=3D"font-family:courier =
new,monospace">=A0=A0=A0=A0=A0 </span><br style=3D"font-family:courier new,=
monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 c. Because headlin=
e (and priority) searches can be useful and</span><br style=3D"font-family:=
courier new,monospace"><span style=3D"font-family:courier new,monospace">=
=A0=A0=A0=A0=A0 powerful, and because the information on those fields is</s=
pan><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 *already =
processed* in `org-scan-tags&#39;, we get those</span><br style=3D"font-fam=
ily:courier new,monospace"><span style=3D"font-family:courier new,monospace=
">=A0=A0=A0=A0=A0 special searches *essentially for free*, requiring only t=
wo</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 minor cha=
nges to `org-scan-tags&#39;. See the unified diff in</span><br style=3D"fon=
t-family:courier new,monospace"><span style=3D"font-family:courier new,mono=
space">=A0=A0=A0=A0=A0 comments. The special PRIORITY property already exis=
ts; I</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 added the=
 special HEADING property for these purposes. I&#39;m</span><br style=3D"fo=
nt-family:courier new,monospace"><span style=3D"font-family:courier new,mon=
ospace">=A0=A0=A0=A0=A0 open to changing the name of course, but I do think=
 the</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 feature i=
s both useful and elegant. (I&#39;m using it in my</span><br style=3D"font-=
family:courier new,monospace"><span style=3D"font-family:courier new,monosp=
ace">=A0=A0=A0=A0=A0 application, for instance.)</span><br style=3D"font-fa=
mily:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0 d. I did not see it in the manual, but I thin=
k that property names</span><br style=3D"font-family:courier new,monospace"=
><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 with hyp=
hens should have these \-escaped -&#39;s in the query</span><br style=3D"fo=
nt-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 string, w=
ith the escaping slashes removed in the produced</span><br style=3D"font-fa=
mily:courier new,monospace"><span style=3D"font-family:courier new,monospac=
e">=A0=A0=A0=A0=A0 matcher. This is not currently done, but the new version=
 does.</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 See Note =
h for details.</span><br style=3D"font-family:courier new,monospace"><br st=
yle=3D"font-family:courier new,monospace"><span style=3D"font-family:courie=
r new,monospace">=A0=A0 e. It seems desirable to support both =3D and =3D=
=3D as equality operators</span><br style=3D"font-family:courier new,monosp=
ace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 since the=
 latter is so common by habit. The new version allows</span><br style=3D"fo=
nt-family:courier new,monospace"><span style=3D"font-family:courier new,mon=
ospace">=A0=A0=A0=A0=A0 this explicitly. The original version does as well,=
 but the</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 regex for=
 the comparison operator also allows other operators</span><br style=3D"fon=
t-family:courier new,monospace"><span style=3D"font-family:courier new,mono=
space">=A0=A0=A0=A0=A0 &lt;&lt;, &gt;&lt;, &gt;&gt;, =3D&gt;, and &gt;=3D a=
s well, which can produce bad matchers.</span><br style=3D"font-family:cour=
ier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 See Note =
h for details.</span><br style=3D"font-family:courier new,monospace"><br st=
yle=3D"font-family:courier new,monospace"><span style=3D"font-family:courie=
r new,monospace">=A0=A0 f. Currently, spaces are ignored around &amp;, |, t=
he implicit &amp; between</span><br style=3D"font-family:courier new,monosp=
ace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 terms, ar=
ound the comparison operators in property searches,</span><br style=3D"font=
-family:courier new,monospace"><span style=3D"font-family:courier new,monos=
pace">=A0=A0=A0=A0=A0 and around +/- selectors. Spaces are not ignored insi=
de {}&#39;s</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 for a reg=
exp match. </span><br style=3D"font-family:courier new,monospace"><br style=
=3D"font-family:courier new,monospace"><span style=3D"font-family:courier n=
ew,monospace">=A0=A0 g. The current code also allows +/- selectors before p=
roperty</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 compariso=
ns. I don&#39;t really like this because</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">=A0=
=A0=A0=A0=A0 +PROP&lt;&gt;&quot;something&quot; and -PROP=3D&quot;something=
&quot; have the same</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 meaning b=
ut look very different. But the new code does</span><br style=3D"font-famil=
y:courier new,monospace"><span style=3D"font-family:courier new,monospace">=
=A0=A0=A0=A0=A0 support this. As a side note, there&#39;s really no need fo=
r</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 the &amp;=
 characters as +/- serve the and/and-not function</span><br style=3D"font-f=
amily:courier new,monospace"><span style=3D"font-family:courier new,monospa=
ce">=A0=A0=A0=A0=A0 completely. But again, no prob.</span><br style=3D"font=
-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0 h. A few bugs detected in the 7.8.11 code:</s=
pan><br style=3D"font-family:courier new,monospace"><br style=3D"font-famil=
y:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 + Faulty =
test for todo matcher in org-make-tags-matcher</span><br style=3D"font-fami=
ly:courier new,monospace"><span style=3D"font-family:courier new,monospace"=
>=A0=A0=A0=A0=A0=A0=A0 (string-match &quot;/+&quot; match)</span><br style=
=3D"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 Ex: (org-make-tags-matcher &qu=
ot;PROP=3D{^\\s-*// .*$}&quot;) produces </span><br style=3D"font-family:co=
urier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 an =
erroneous matcher:</span><br style=3D"font-family:courier new,monospace"><b=
r style=3D"font-family:courier new,monospace"><span style=3D"font-family:co=
urier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 (&quot;PROP=3D{^\\s-=
*// .*$}&quot; progn</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0 (setq org-cached-props nil)</span><br style=3D"font-family:cou=
rier new,monospace"><span style=3D"font-family:courier new,monospace">=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 (member &quot;PROP&quot; tags-list))</spa=
n><br style=3D"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 For all practical purposes it =
will be enough to do:</span><br style=3D"font-family:courier new,monospace"=
><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </=
span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0 =
(string-match &quot;\\(/\\(!\\)?\\s-*\\)[^{}\&quot;]*$&quot; match)</span><=
br style=3D"font-family:courier new,monospace"><span style=3D"font-family:c=
ourier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 </span><br st=
yle=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 ins=
tead of the current test in org-make-tags-matcher.</span><br style=3D"font-=
family:courier new,monospace"><span style=3D"font-family:courier new,monosp=
ace">=A0=A0=A0=A0=A0=A0=A0 This works as long as the TODO keywords do not c=
ontain a</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 rig=
ht brace or quotation marks. (In most other cases, the</span><br style=3D"f=
ont-family:courier new,monospace"><span style=3D"font-family:courier new,mo=
nospace">=A0=A0=A0=A0=A0=A0=A0 new parser should give an error message at p=
arse time.)</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </s=
pan><br style=3D"font-family:courier new,monospace"><span style=3D"font-fam=
ily:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 A technicality: this is /n=
ot/ a complete solution because</span><br style=3D"font-family:courier new,=
monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 arb=
itrary strings can be TODO keywords. For instance,</span><br style=3D"font-=
family:courier new,monospace"><span style=3D"font-family:courier new,monosp=
ace">=A0=A0=A0=A0=A0=A0=A0 both PROP=3D{/!} and PROP=3D&quot;/!{/!}&quot; a=
re valid TODO keywords</span><br style=3D"font-family:courier new,monospace=
">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 (it=
 works!) *and* valid property comparisons. So, a pattern</span><br style=3D=
"font-family:courier new,monospace"><span style=3D"font-family:courier new,=
monospace">=A0=A0=A0=A0=A0=A0=A0 alone is insufficient. We want to find the=
 first slash</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 tha=
t is not enclosed in {}&#39;s or &quot;&quot;&#39;s; if found, a todo</span=
><br style=3D"font-family:courier new,monospace"><span style=3D"font-family=
:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 match is needed. The function=
 `org-find-todo-query&#39; does</span><br style=3D"font-family:courier new,=
monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 thi=
s and (org-find-todo-query match) can be plugged in</span><br style=3D"font=
-family:courier new,monospace"><span style=3D"font-family:courier new,monos=
pace">=A0=A0=A0=A0=A0=A0=A0 directly replacing the above (string-match ...)=
 in then</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 new=
 `org-make-tags-matcher&#39;.</span><br style=3D"font-family:courier new,mo=
nospace"><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=
=A0=A0 </span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 But=
 because the todo parsing uses {}&#39;s for regex matches,</span><br style=
=3D"font-family:courier new,monospace"><span style=3D"font-family:courier n=
ew,monospace">=A0=A0=A0=A0=A0=A0=A0 TODO keywords with {}&#39;s are ignored=
 anyway. So there&#39;s</span><br style=3D"font-family:courier new,monospac=
e">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 no =
need to go beyond the fixed string-match above.</span><br style=3D"font-fam=
ily:courier new,monospace"><span style=3D"font-family:courier new,monospace=
">=A0=A0=A0=A0=A0=A0=A0 The function `org-todo-query-parse&#39;, which hand=
les todo</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 par=
sing in the new version, makes this explicit.</span><br style=3D"font-famil=
y:courier new,monospace"><span style=3D"font-family:courier new,monospace">=
=A0=A0=A0=A0=A0=A0=A0 </span><br style=3D"font-family:courier new,monospace=
">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0 + Propert=
y names with -&#39;s are not handled properly (cf. Note d)</span><br style=
=3D"font-family:courier new,monospace"><span style=3D"font-family:courier n=
ew,monospace">=A0=A0=A0=A0=A0=A0=A0 </span><br style=3D"font-family:courier=
 new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 Spe=
cifically, the escapes are not removed. Example:</span><br style=3D"font-fa=
mily:courier new,monospace"><span style=3D"font-family:courier new,monospac=
e">=A0=A0=A0=A0=A0=A0=A0 (org-make-tags-matcher &quot;PROP\\-WITH\\-HYPHENS=
=3D2&quot;)</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 pro=
duces</span><br style=3D"font-family:courier new,monospace"><span style=3D"=
font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </span><br style=
=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 (&q=
uot;PROP\\-WITH\\-HYPHENS=3D2&quot; and</span><br style=3D"font-family:cour=
ier new,monospace"><span style=3D"font-family:courier new,monospace">=A0=A0=
=A0=A0=A0=A0=A0=A0 (progn</span><br style=3D"font-family:courier new,monosp=
ace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0 =
(setq org-cached-props nil)</span><br style=3D"font-family:courier new,mono=
space"><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=
=A0=A0 (=3D</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0 (string-to-number</span><br style=3D"font-family:courier new,monospace"=
><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 (or (org-cached-entry-get nil &quot;PROP\\-WITH\\-HYPHENS&quot;)</sp=
an><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 &quot;&quot;))</span><br style=3D"font-family:courier new,monospace"=
><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0 2))</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0 =
t)</span><br style=3D"font-family:courier new,monospace"><span style=3D"fon=
t-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </span><br style=3D"f=
ont-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 The=
 original code /does/ instead remove -&#39;s from tag</span><br style=3D"fo=
nt-family:courier new,monospace"><span style=3D"font-family:courier new,mon=
ospace">=A0=A0=A0=A0=A0=A0=A0 names, which shouldn&#39;t have them anyway. =
I suspect that</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 thi=
s was intended for property names rather than tag</span><br style=3D"font-f=
amily:courier new,monospace"><span style=3D"font-family:courier new,monospa=
ce">=A0=A0=A0=A0=A0=A0=A0 names. The new version fixes up property names bu=
t does</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 not=
 allow -&#39;s in tags.</span><br style=3D"font-family:courier new,monospac=
e"><br style=3D"font-family:courier new,monospace"><span style=3D"font-fami=
ly:courier new,monospace">=A0=A0=A0=A0=A0 + Incorrect comparison operators =
allowed (cf. Note e)</span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </s=
pan><br style=3D"font-family:courier new,monospace"><span style=3D"font-fam=
ily:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 The regular expression use=
d is &quot;[&lt;=3D&gt;]\\{1,2\\}&quot; is used to</span><br style=3D"font-=
family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 det=
ect the comparison operators. But this can produce bad</span><br style=3D"f=
ont-family:courier new,monospace"><span style=3D"font-family:courier new,mo=
nospace">=A0=A0=A0=A0=A0=A0=A0 matchers that fail opaquely at match time ra=
ther than </span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 giv=
ing an appropriate error message at parse time.</span><br style=3D"font-fam=
ily:courier new,monospace"><br style=3D"font-family:courier new,monospace">=
<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 Ex:=
 (org-make-tags-matcher &quot;P&lt;&lt;2&quot;) produces</span><br style=3D=
"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0 (&quot;P&lt;&lt;2&quot; and=
</span><br style=3D"font-family:courier new,monospace"><span style=3D"font-=
family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=A0 (progn</span><br =
style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0 (setq org-cached-props nil)</span><br style=3D"font-family:courie=
r new,monospace"><span style=3D"font-family:courier new,monospace">=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0 (nil</span><br style=3D"font-family:courier new=
,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0 (string-to-number (or (org-cached-entry-get nil &quot;P&quot;)=
 &quot;&quot;)) 2))</span><br style=3D"font-family:courier new,monospace"><=
span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0=A0=
=A0 t)</span><br style=3D"font-family:courier new,monospace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 This is fixed in the new versi=
on and delivers an error </span><br style=3D"font-family:courier new,monosp=
ace"><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=
=A0 message at parse time.</span><br style=3D"font-family:courier new,monos=
pace">


<br style=3D"font-family:courier new,monospace"><span style=3D"font-family:=
courier new,monospace">=A0=A0=A0=A0=A0 + missing org-re (line 7179 in org.e=
l) with posix classes</span><br style=3D"font-family:courier new,monospace"=
><span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 </=
span><br style=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0=A0=A0=A0=A0=A0 Min=
or consistency issue.=A0 This line does not occur in the new</span><br styl=
e=3D"font-family:courier new,monospace"><span style=3D"font-family:courier =
new,monospace">=A0=A0=A0=A0=A0=A0=A0 code.</span><br style=3D"font-family:c=
ourier new,monospace">


<br style=3D"font-family:courier new,monospace"><br style=3D"font-family:co=
urier new,monospace"><span style=3D"font-family:courier new,monospace">Than=
ks and regards,</span><br style=3D"font-family:courier new,monospace"><br s=
tyle=3D"font-family:courier new,monospace">


<span style=3D"font-family:courier new,monospace">=A0=A0 Christopher Genove=
se</span><br style=3D"font-family:courier new,monospace"><br style=3D"font-=
family:courier new,monospace"><br style=3D"font-family:courier new,monospac=
e">
</blockquote></div><br>

--047d7b10d1f789ea4204c66dd08e--