Mailing List Archive

nonalpha numberic characters
My search works for At&t with the ampersand in the middle. However it
doesn't work for e-commerce with the dash in the middle. Anything I have to
do with the analyzers/filters to fix this? Thanks.

-scott

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
Re: nonalpha numberic characters [ In reply to ]
Yes... you should extend one of the standard analyzers and provide a
mechanism to include the dash (-) within words. Use this analyzer to index
and search.


Rob
----- Original Message -----
From: <soshima@business.com>
To: <lucene-user@jakarta.apache.org>
Sent: Thursday, November 08, 2001 5:32 PM
Subject: nonalpha numberic characters


>
>
> My search works for At&t with the ampersand in the middle. However it
> doesn't work for e-commerce with the dash in the middle. Anything I have
to
> do with the analyzers/filters to fix this? Thanks.
>
> -scott
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>


--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
RE: nonalpha numberic characters [ In reply to ]
Thanks, do you know if nonalphanumeric characters are excluded by default?
since & works, but - works, I am assuming they are not. I am not sure how
to cleary include nonalphanumeric character explicitly. If you have any
exmaples of this that would be great. Much thanks again.

-----Original Message-----
From: Robert J. Lebowitz [mailto:lebowitz@finaltouch.com]
Sent: Thursday, November 08, 2001 4:24 PM
To: Lucene Users List
Subject: Re: nonalpha numberic characters


Yes... you should extend one of the standard analyzers and provide a
mechanism to include the dash (-) within words. Use this analyzer to index
and search.


Rob
----- Original Message -----
From: <soshima@business.com>
To: <lucene-user@jakarta.apache.org>
Sent: Thursday, November 08, 2001 5:32 PM
Subject: nonalpha numberic characters


>
>
> My search works for At&t with the ampersand in the middle. However it
> doesn't work for e-commerce with the dash in the middle. Anything I have
to
> do with the analyzers/filters to fix this? Thanks.
>
> -scott
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>


--
To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
Re: nonalpha numberic characters [ In reply to ]
It all depends on the analyzer and filters that you are using to index your
text. Look at the archive for this list using my last name (lebowitz). I
posted the source code for a slightly modified analyzer and filter that
should provide you with an example you can use for your own purposes. It
was for an AlphaNumericAnalyzer, I believe. You'd have to provide
additional tests for the characters you have in mind; not just standard
digits and characters.

Rob


--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>