Hello,
After getting useful results and fast with KinoSearch .20 I began looking at
ways to narrow results further using field specific refinements. EG having
CPAN metadata indexed and being able to slice into it by a license field.
Might it be possible for a Scorer (I think it's a scorer) to compute from
within the set of matched results, the total frequency of tokens from a
given field. To use the CPAN example again, rather than choosing to search
for "date parser" and license:artistic , might the initial search for
"date parser" return the matching results AND a structure describing that of
100 matched documents, the field 'license' breaks down to perl=50,
artistic=30, gpl=10, bsd=5, apache=5.
One could then repeat the original search , adding 'license:perl' to
narrow the search to only the 50 matching documents.
Since this would required reading/examining each matched record I would
guess this belongs in the XS/C rather than perl.
Is it wishful thinking ? or might this be possible with subclassable
scorers/hit collectors.
++KinoSearch
Andrew
After getting useful results and fast with KinoSearch .20 I began looking at
ways to narrow results further using field specific refinements. EG having
CPAN metadata indexed and being able to slice into it by a license field.
Might it be possible for a Scorer (I think it's a scorer) to compute from
within the set of matched results, the total frequency of tokens from a
given field. To use the CPAN example again, rather than choosing to search
for "date parser" and license:artistic , might the initial search for
"date parser" return the matching results AND a structure describing that of
100 matched documents, the field 'license' breaks down to perl=50,
artistic=30, gpl=10, bsd=5, apache=5.
One could then repeat the original search , adding 'license:perl' to
narrow the search to only the 50 matching documents.
Since this would required reading/examining each matched record I would
guess this belongs in the XS/C rather than perl.
Is it wishful thinking ? or might this be possible with subclassable
scorers/hit collectors.
++KinoSearch
Andrew