Mailing List Archive

Search results/criteria validation
Hello there,

We are looking to check what part of criteria matched for the document to be included in the results. So, for example our criteria is "(A or B or C) and (D or E)" and documents 1,2,3 came back in results. Can we check for each of the documents, which parts of criteria matched? So, for example, it might be that document 1 was matched because A and B and D were found and for document 2 C and E were found. Is there a way to check that?

------
Regards
-Siraj Haider
(212) 306-0154


________________________________

This electronic mail message and any attachments may contain information which is privileged, sensitive and/or otherwise exempt from disclosure under applicable law. The information is intended only for the use of the individual or entity named as the addressee above. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution (electronic or otherwise) or forwarding of, or the taking of any action in reliance on, the contents of this transmission is strictly prohibited. If you have received this electronic transmission in error, please notify us by telephone, facsimile, or e-mail as noted above to arrange for the return of any electronic mail or attachments. Thank You.
Re:Search results/criteria validation [ In reply to ]
Maybe using explain?

https://chrisperks.co/2017/06/06/explaining-lucene-explain/


It might slow down the performance..

Cheers,
Diego


From: java-user@lucene.apache.org At: 03/17/21 17:26:14To: java-user@lucene.apache.org
Cc: shuo.pan@jobdiva.com
Subject: Search results/criteria validation

Hello there,

We are looking to check what part of criteria matched for the document to be
included in the results. So, for example our criteria is "(A or B or C) and (D
or E)" and documents 1,2,3 came back in results. Can we check for each of the
documents, which parts of criteria matched? So, for example, it might be that
document 1 was matched because A and B and D were found and for document 2 C
and E were found. Is there a way to check that?

------
Regards
-Siraj Haider
(212) 306-0154


________________________________

This electronic mail message and any attachments may contain information which
is privileged, sensitive and/or otherwise exempt from disclosure under
applicable law. The information is intended only for the use of the individual
or entity named as the addressee above. If you are not the intended recipient,
you are hereby notified that any disclosure, copying, distribution (electronic
or otherwise) or forwarding of, or the taking of any action in reliance on, the
contents of this transmission is strictly prohibited. If you have received this
electronic transmission in error, please notify us by telephone, facsimile, or
e-mail as noted above to arrange for the return of any electronic mail or
attachments. Thank You.
Re: Search results/criteria validation [ In reply to ]
Explain is a heavyweight thing. Maybe it helps you, maybe you need
something high-performance.

I was asking a similar question ~10 years ago and got a very interesting
answer on this list. If you want I can try to dig this to find it. At
the end, and with some limitation in the number of queries and in the
score’s fineness, it was indicating thing sub-query was used. This was
used to attempt highlighting matching of the parts of a formula.

Paul

On 17 Mar 2021, at 20:24, Diego Ceccarelli (BLOOMBERG/ LONDON) wrote:

> Maybe using explain?
>
> https://chrisperks.co/2017/06/06/explaining-lucene-explain/
>
>
> It might slow down the performance..
>
> Cheers,
> Diego
>
>
> From: java-user@lucene.apache.org At: 03/17/21 17:26:14To:
> java-user@lucene.apache.org
> Cc: shuo.pan@jobdiva.com
> Subject: Search results/criteria validation
>
> Hello there,
>
> We are looking to check what part of criteria matched for the document
> to be
> included in the results. So, for example our criteria is "(A or B or
> C) and (D
> or E)" and documents 1,2,3 came back in results. Can we check for each
> of the
> documents, which parts of criteria matched? So, for example, it might
> be that
> document 1 was matched because A and B and D were found and for
> document 2 C
> and E were found. Is there a way to check that?
>
> ------
> Regards
> -Siraj Haider
> (212) 306-0154
>
>
> ________________________________
>
> This electronic mail message and any attachments may contain
> information which
> is privileged, sensitive and/or otherwise exempt from disclosure under
> applicable law. The information is intended only for the use of the
> individual
> or entity named as the addressee above. If you are not the intended
> recipient,
> you are hereby notified that any disclosure, copying, distribution
> (electronic
> or otherwise) or forwarding of, or the taking of any action in
> reliance on, the
> contents of this transmission is strictly prohibited. If you have
> received this
> electronic transmission in error, please notify us by telephone,
> facsimile, or
> e-mail as noted above to arrange for the return of any electronic mail
> or
> attachments. Thank You.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Search results/criteria validation [ In reply to ]
See https://issues.apache.org/jira/browse/LUCENE-9640

On Wed, Mar 17, 2021 at 4:02 PM Paul Libbrecht
<paul@hoplahup.net.invalid> wrote:
>
> Explain is a heavyweight thing. Maybe it helps you, maybe you need
> something high-performance.
>
> I was asking a similar question ~10 years ago and got a very interesting
> answer on this list. If you want I can try to dig this to find it. At
> the end, and with some limitation in the number of queries and in the
> score’s fineness, it was indicating thing sub-query was used. This was
> used to attempt highlighting matching of the parts of a formula.
>
> Paul
>
> On 17 Mar 2021, at 20:24, Diego Ceccarelli (BLOOMBERG/ LONDON) wrote:
>
> > Maybe using explain?
> >
> > https://chrisperks.co/2017/06/06/explaining-lucene-explain/
> >
> >
> > It might slow down the performance..
> >
> > Cheers,
> > Diego
> >
> >
> > From: java-user@lucene.apache.org At: 03/17/21 17:26:14To:
> > java-user@lucene.apache.org
> > Cc: shuo.pan@jobdiva.com
> > Subject: Search results/criteria validation
> >
> > Hello there,
> >
> > We are looking to check what part of criteria matched for the document
> > to be
> > included in the results. So, for example our criteria is "(A or B or
> > C) and (D
> > or E)" and documents 1,2,3 came back in results. Can we check for each
> > of the
> > documents, which parts of criteria matched? So, for example, it might
> > be that
> > document 1 was matched because A and B and D were found and for
> > document 2 C
> > and E were found. Is there a way to check that?
> >
> > ------
> > Regards
> > -Siraj Haider
> > (212) 306-0154
> >
> >
> > ________________________________
> >
> > This electronic mail message and any attachments may contain
> > information which
> > is privileged, sensitive and/or otherwise exempt from disclosure under
> > applicable law. The information is intended only for the use of the
> > individual
> > or entity named as the addressee above. If you are not the intended
> > recipient,
> > you are hereby notified that any disclosure, copying, distribution
> > (electronic
> > or otherwise) or forwarding of, or the taking of any action in
> > reliance on, the
> > contents of this transmission is strictly prohibited. If you have
> > received this
> > electronic transmission in error, please notify us by telephone,
> > facsimile, or
> > e-mail as noted above to arrange for the return of any electronic mail
> > or
> > attachments. Thank You.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
RE: Search results/criteria validation [ In reply to ]
Thanks for the response Paul, it would be great if you can point me to that discussion.

------
Regards
-Siraj Haider
(212) 306-0154

-----Original Message-----
From: Paul Libbrecht <paul@hoplahup.net.INVALID>
Sent: Wednesday, March 17, 2021 4:02 PM
To: java-user@lucene.apache.org; Diego Ceccarelli <dceccarelli4@bloomberg.net>
Subject: Re: Search results/criteria validation

Explain is a heavyweight thing. Maybe it helps you, maybe you need something high-performance.

I was asking a similar question ~10 years ago and got a very interesting answer on this list. If you want I can try to dig this to find it. At the end, and with some limitation in the number of queries and in the score’s fineness, it was indicating thing sub-query was used. This was used to attempt highlighting matching of the parts of a formula.

Paul

On 17 Mar 2021, at 20:24, Diego Ceccarelli (BLOOMBERG/ LONDON) wrote:

> Maybe using explain?
>
> https://chrisperks.co/2017/06/06/explaining-lucene-explain/
>
>
> It might slow down the performance..
>
> Cheers,
> Diego
>
>
> From: java-user@lucene.apache.org At: 03/17/21 17:26:14To:
> java-user@lucene.apache.org
> Cc: shuo.pan@jobdiva.com
> Subject: Search results/criteria validation
>
> Hello there,
>
> We are looking to check what part of criteria matched for the document
> to be included in the results. So, for example our criteria is "(A or
> B or
> C) and (D
> or E)" and documents 1,2,3 came back in results. Can we check for each
> of the documents, which parts of criteria matched? So, for example, it
> might be that document 1 was matched because A and B and D were found
> and for document 2 C and E were found. Is there a way to check that?
>
> ------
> Regards
> -Siraj Haider
> (212) 306-0154
>
>
> ________________________________
>
> This electronic mail message and any attachments may contain
> information which is privileged, sensitive and/or otherwise exempt
> from disclosure under applicable law. The information is intended only
> for the use of the individual or entity named as the addressee above.
> If you are not the intended recipient, you are hereby notified that
> any disclosure, copying, distribution (electronic or otherwise) or
> forwarding of, or the taking of any action in reliance on, the
> contents of this transmission is strictly prohibited. If you have
> received this electronic transmission in error, please notify us by
> telephone, facsimile, or e-mail as noted above to arrange for the
> return of any electronic mail or attachments. Thank You.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


________________________________

This electronic mail message and any attachments may contain information which is privileged, sensitive and/or otherwise exempt from disclosure under applicable law. The information is intended only for the use of the individual or entity named as the addressee above. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution (electronic or otherwise) or forwarding of, or the taking of any action in reliance on, the contents of this transmission is strictly prohibited. If you have received this electronic transmission in error, please notify us by telephone, facsimile, or e-mail as noted above to arrange for the return of any electronic mail or attachments. Thank You.
B?KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKCB??[??X???X?KK[XZ[??]?K]\?\?][??X???X?PX?[?K?\X?K???B???Y][?[??[X[??K[XZ[??]?K]\?\?Z[X?[?K?\X?K???B?B
RE: Search results/criteria validation [ In reply to ]
Does that mean that we will need to wrap each clause of our Boolean query in order to check it for all clauses?

------
Regards
-Siraj Haider
(212) 306-0154

-----Original Message-----
From: Michael Sokolov <msokolov@gmail.com>
Sent: Wednesday, March 17, 2021 4:13 PM
To: java-user@lucene.apache.org
Cc: Diego Ceccarelli <dceccarelli4@bloomberg.net>
Subject: Re: Search results/criteria validation

See https://issues.apache.org/jira/browse/LUCENE-9640

On Wed, Mar 17, 2021 at 4:02 PM Paul Libbrecht <paul@hoplahup.net.invalid> wrote:
>
> Explain is a heavyweight thing. Maybe it helps you, maybe you need
> something high-performance.
>
> I was asking a similar question ~10 years ago and got a very
> interesting answer on this list. If you want I can try to dig this to
> find it. At the end, and with some limitation in the number of queries
> and in the score’s fineness, it was indicating thing sub-query was
> used. This was used to attempt highlighting matching of the parts of a formula.
>
> Paul
>
> On 17 Mar 2021, at 20:24, Diego Ceccarelli (BLOOMBERG/ LONDON) wrote:
>
> > Maybe using explain?
> >
> > https://chrisperks.co/2017/06/06/explaining-lucene-explain/
> >
> >
> > It might slow down the performance..
> >
> > Cheers,
> > Diego
> >
> >
> > From: java-user@lucene.apache.org At: 03/17/21 17:26:14To:
> > java-user@lucene.apache.org
> > Cc: shuo.pan@jobdiva.com
> > Subject: Search results/criteria validation
> >
> > Hello there,
> >
> > We are looking to check what part of criteria matched for the
> > document to be included in the results. So, for example our criteria
> > is "(A or B or
> > C) and (D
> > or E)" and documents 1,2,3 came back in results. Can we check for
> > each of the documents, which parts of criteria matched? So, for
> > example, it might be that document 1 was matched because A and B and
> > D were found and for document 2 C and E were found. Is there a way
> > to check that?
> >
> > ------
> > Regards
> > -Siraj Haider
> > (212) 306-0154
> >
> >
> > ________________________________
> >
> > This electronic mail message and any attachments may contain
> > information which is privileged, sensitive and/or otherwise exempt
> > from disclosure under applicable law. The information is intended
> > only for the use of the individual or entity named as the addressee
> > above. If you are not the intended recipient, you are hereby
> > notified that any disclosure, copying, distribution (electronic or
> > otherwise) or forwarding of, or the taking of any action in reliance
> > on, the contents of this transmission is strictly prohibited. If you
> > have received this electronic transmission in error, please notify
> > us by telephone, facsimile, or e-mail as noted above to arrange for
> > the return of any electronic mail or attachments. Thank You.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


________________________________

This electronic mail message and any attachments may contain information which is privileged, sensitive and/or otherwise exempt from disclosure under applicable law. The information is intended only for the use of the individual or entity named as the addressee above. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution (electronic or otherwise) or forwarding of, or the taking of any action in reliance on, the contents of this transmission is strictly prohibited. If you have received this electronic transmission in error, please notify us by telephone, facsimile, or e-mail as noted above to arrange for the return of any electronic mail or attachments. Thank You.
B?KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKCB??[??X???X?KK[XZ[??]?K]\?\?][??X???X?PX?[?K?\X?K???B???Y][?[??[X[??K[XZ[??]?K]\?\?Z[X?[?K?\X?K???B?B