Hi All,
In looking at the Java Docs, our Lucene team noticed that the `@` symbol is
a reserved character in the Lucene regular expression syntax.
In re-visiting the page in curiosity, I found that the symbol was
[Optional] for "any string." This came at a surprise because there's a very
common way to achieve "any string" in `.*`. Is there any compelling reason
to preserve this tiny vector of complexity? I suspect there may be some
differences in the constructions of the finite automata produced by `.*`
and `@` but I am not sure.
If insignificant or non-existent, I suggest we remove `@` from the regular
expression syntax.
--
Marcus Eagan
In looking at the Java Docs, our Lucene team noticed that the `@` symbol is
a reserved character in the Lucene regular expression syntax.
In re-visiting the page in curiosity, I found that the symbol was
[Optional] for "any string." This came at a surprise because there's a very
common way to achieve "any string" in `.*`. Is there any compelling reason
to preserve this tiny vector of complexity? I suspect there may be some
differences in the constructions of the finite automata produced by `.*`
and `@` but I am not sure.
If insignificant or non-existent, I suggest we remove `@` from the regular
expression syntax.
--
Marcus Eagan