I am working with lucene and i am new
I want to index documents HTML for this I do
java org.w3c.tidy.Tidy - m * html
java org.apache.lucene.demo.IndexHTML - create - index index .\
all this generates index to me and when doing my search in the Web if it
shows to the documents and the summary to me.
despues I index pdf
org.pdfbox.searchengine.lucene.IndexFiles - create - index pdf \
this also generates index to me
but the index PDF replace index HTML
how I can make him to have single index and when doing my search in the WEB
showme as HTML and PDF documents?
thanks
--
View this message in context: http://www.nabble.com/a-single-index-tf4401665.html#a12556579
Sent from the Lucene - General mailing list archive at Nabble.com.
I want to index documents HTML for this I do
java org.w3c.tidy.Tidy - m * html
java org.apache.lucene.demo.IndexHTML - create - index index .\
all this generates index to me and when doing my search in the Web if it
shows to the documents and the summary to me.
despues I index pdf
org.pdfbox.searchengine.lucene.IndexFiles - create - index pdf \
this also generates index to me
but the index PDF replace index HTML
how I can make him to have single index and when doing my search in the WEB
showme as HTML and PDF documents?
thanks
--
View this message in context: http://www.nabble.com/a-single-index-tf4401665.html#a12556579
Sent from the Lucene - General mailing list archive at Nabble.com.