Mailing List Archive

Book on crawlers (and almost all other features of current search engines)
Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
which
will be the first one I know off about this topic.
http://www.cse.iitb.ac.in/soumen/main/book-toc.ps

http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
ref%3Dsr%5F1%5F3/002-7943686-4436045


--Clemens




--------------------------------------
http://www.cmarschner.net



--
To unsubscribe, e-mail: <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>
Re: Book on crawlers (and almost all other features of current search engines) [ In reply to ]
There is another book out there called Programing Spiders, Bots, and
Aggregators in Java by Jeff Heaton.

It's published by Sybex.

It includes a CD with a complete crawling code that handles many issues,
although not all the ones the LARM project covers.

--Peter

On 7/28/02 4:16 AM, "Clemens Marschner" <cmad@lanlab.de> wrote:

> Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
> the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
> which
> will be the first one I know off about this topic.
> http://www.cse.iitb.ac.in/soumen/main/book-toc.ps
>
> http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
> ref%3Dsr%5F1%5F3/002-7943686-4436045
>
>
> --Clemens
>
>
>
>
> --------------------------------------
> http://www.cmarschner.net
>
>
>
> --
> To unsubscribe, e-mail: <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>
>
>


--
To unsubscribe, e-mail: <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>