Hi All
Hi Marvin, this is really great work and truly appreciated.
I'm using KS 0.162. When using the following code, the error below is produced:
My Definitions
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'en' );
my $stopalizer = KinoSearch::Analysis::Stopalizer->new(language => 'en');
my $analyzer = KinoSearch::Analysis::PolyAnalyzer->new(analyzers => [$stemmer, $stopalizer]);
The Error
Maximum token length is 65535; got 107462 at /usr/lib/perl5/site_perl/5.8.8/i386-linux-thread-multi/KinoSearch/Index/SegWriter.pm line 82
KinoSearch::Index::SegWriter::add_doc('KinoSearch::Index::SegWriter=HASH(0x852d47c)', 'KinoSearch::Document::Doc=HASH(0x852cf90)') called at /usr/lib/perl5/site_perl/5.8.8/i386-linux-thread-multi/KinoSearch/InvIndexer.pm line 224
KinoSearch::InvIndexer::add_doc('KinoSearch::InvIndexer=HASH(0x8546d7c)', 'KinoSearch::Document::Doc=HASH(0x852cf90)')
I comment $stemmer and $stopalizer's definitions and use the below code. This works perfectly but clearly won't allow for stemming and stopalizer =0
my $analyzer = KinoSearch::Analysis::PolyAnalyzer->new(language => 'en');
Could anyone assist in providing a possible work around this? - Your assistance is greatly appreciated.
Regards,
Riyaad
Hi Marvin, this is really great work and truly appreciated.
I'm using KS 0.162. When using the following code, the error below is produced:
My Definitions
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'en' );
my $stopalizer = KinoSearch::Analysis::Stopalizer->new(language => 'en');
my $analyzer = KinoSearch::Analysis::PolyAnalyzer->new(analyzers => [$stemmer, $stopalizer]);
The Error
Maximum token length is 65535; got 107462 at /usr/lib/perl5/site_perl/5.8.8/i386-linux-thread-multi/KinoSearch/Index/SegWriter.pm line 82
KinoSearch::Index::SegWriter::add_doc('KinoSearch::Index::SegWriter=HASH(0x852d47c)', 'KinoSearch::Document::Doc=HASH(0x852cf90)') called at /usr/lib/perl5/site_perl/5.8.8/i386-linux-thread-multi/KinoSearch/InvIndexer.pm line 224
KinoSearch::InvIndexer::add_doc('KinoSearch::InvIndexer=HASH(0x8546d7c)', 'KinoSearch::Document::Doc=HASH(0x852cf90)')
I comment $stemmer and $stopalizer's definitions and use the below code. This works perfectly but clearly won't allow for stemming and stopalizer =0
my $analyzer = KinoSearch::Analysis::PolyAnalyzer->new(language => 'en');
Could anyone assist in providing a possible work around this? - Your assistance is greatly appreciated.
Regards,
Riyaad