Mailing List Archive

DO NOT REPLY [Bug 9015] New: - e-mail token in StandardTokenizer.jj does not match valid e-mail addresses
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=9015>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=9015

e-mail token in StandardTokenizer.jj does not match valid e-mail addresses

Summary: e-mail token in StandardTokenizer.jj does not match
valid e-mail addresses
Product: Lucene
Version: CVS Nightly - Specify date in submission
Platform: PC
OS/Version: Linux
Status: NEW
Severity: Normal
Priority: Other
Component: Analysis
AssignedTo: lucene-dev@jakarta.apache.org
ReportedBy: danson@germane-software.com


E-mail token in StandardTokenizer.jj does not match many valid e-mail
addresses. See line 106:

<EMAIL: <ALPHANUM> "@" <ALPHANUM> ("." <ALPHANUM>)+ >

For example, neither danson@germane-software.com (because of the dash) nor
dale.anson@germane-software.com (because of the first dot and the dash) match.
the following is slightly better, but does not come close to meeting the
specifications of RFC 822:

<EMAIL: <ALPHANUM> ("."|"-" <ALPHANUM>)+ "@" <ALPHANUM> ("."|"-" <ALPHANUM>)+
>

This is being reported against the May 11 nightly build (I compiled from
source using the supplied Ant build file on RedHat Linux 7.2, jikes, javacc
2.0, and Sun Linux JDK 1.4), however, I originally ran across this problem in
Lucene 1.2 rc4.

--
To unsubscribe, e-mail: <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>