Mailing List Archive

Virtual Index
We have a relatively large (300,000+ documents) set of XML files to
index. The files themselves are articles broken up by journal and decade
so that users can restrict their search to specific journals and year
ranges. Under our old search engine this was done by creating a seperate
index for each journal/decade and then creating a "virtual index" which
would search the smaller indexes and put the results together (with
scoring preserved).

In Lucene it looks like I would have to build one large index and do
something like this:

title:test && (journal:myjournal && (year:1990 || year:1991 || year:1992
|| year:1993 || year:1994 || year:1995 || year:1996 || year:1997 ||
year:1998 || year:1999))

Is there a better way to do this?

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
Re: Virtual Index [ In reply to ]
If you prefer the old way (multiple indices) you can do that with
Lucene, too. Look at MultiSearcher class.
Lucene also supports range queries which may be helpful. I haven't
used them, but it sounds like the thing to look at.

Otis

--- Paul Dlug <paul@aps.org> wrote:
> We have a relatively large (300,000+ documents) set of XML files to
> index. The files themselves are articles broken up by journal and
> decade
> so that users can restrict their search to specific journals and year
> ranges. Under our old search engine this was done by creating a
> seperate
> index for each journal/decade and then creating a "virtual index"
> which
> would search the smaller indexes and put the results together (with
> scoring preserved).
>
> In Lucene it looks like I would have to build one large index and do
> something like this:
>
> title:test && (journal:myjournal && (year:1990 || year:1991 ||
> year:1992
> || year:1993 || year:1994 || year:1995 || year:1996 || year:1997 ||
> year:1998 || year:1999))
>
> Is there a better way to do this?
>
> --
> To unsubscribe, e-mail:
> <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
> <mailto:lucene-user-help@jakarta.apache.org>
>


__________________________________________________
Do You Yahoo!?
Try FREE Yahoo! Mail - the world's greatest free email!
http://mail.yahoo.com/

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>