Mailing List Archive

features of Apache Solr
Hi,

My company is aming to handle all its knowledge management assets on an
unified platform. For this purpose we are looking for a software that can
search in realtime among sets of information of different type, structure
and/or location.



We have found that *Apache Solr* might be suitable for this task.

*We would be interested to know about the capabilities of Apapche Solr*:

- Is it capable to performing searches with short response time
(”real time”) among our knowledge management assets?
This includes assets by the following applications:

o Confluence: a local application with various type of attachments (png,
.pdf, .doc, .html, etc.)

o HP Service Manager: handling incoming tasks and their solutions,
including attachments (picture, text, e-mail, etc.)

o Sharepoint sites within our local network, with various type of
attachments (png, .pdf, .doc, .html, etc.)

o Microsoft OneNote: information without a structure, typically user
specific, Notepad-like files.

o JIRA: handling incoming tasks and their solutions, including
attachments (picture, text, e-mail, etc.)

o DokuWiki

o information stored in Microsoft Outlook

- In what way can it connect to the aforementioned applications?
The applications must be operatable independantly.



Regarding price:



? Is there any costs

? during installation? (e.g. cost of software, cost of consultation,
etc.)

? during operation?



Thank for anybody’s help in advance!



Sincerely,



Katalin Ecs?di

ecsodi.katalin@gmail.com
Re: features of Apache Solr [ In reply to ]
Hi,

Yes, Solr can search all the content you mention, but to do so you'll
have to extract the content and push it into Solr. This is done using
'connectors' and there are various open source and commercial
alternatives: too many to list here but I'd take a look at Apache
ManifoldCF as a starting point. Solr itself doesn't come with any of
these built in.

Solr doesn't cost anything to download - it's freely available under an
open source license. However there are certainly other costs to
consider: if you don't have the skills in-house you'll need to train
your team or hire someone (which might be hard as those with these
skills are rare and well-paid) or engage a consultancy (disclaimer: I
work for OpenSource Connections which is one of these consultancies). As
to operation, you'll need to host it somewhere but that's the only cost
you'll have apart from the people who'll run it.

As it seems you're at the start of your journey I can recommend you a
few resources:
- the free Search Insights reports (2018, 2019, 2020) linked from
https://opensourceconnections.com/blog/2020/02/27/search-insights-2020-a-new-report-from-the-search-network/
have a wealth of information on planning and running search projects
including directories
- https://manifoldcf.apache.org/

Cheers

Charlie

On 14/04/2020 17:05, Ecs?di Katalin wrote:
> Hi,
>
> My company is aming to handle all its knowledge management assets on an
> unified platform. For this purpose we are looking for a software that can
> search in realtime among sets of information of different type, structure
> and/or location.
>
>
>
> We have found that *Apache Solr* might be suitable for this task.
>
> *We would be interested to know about the capabilities of Apapche Solr*:
>
> - Is it capable to performing searches with short response time
> (”real time”) among our knowledge management assets?
> This includes assets by the following applications:
>
> o Confluence: a local application with various type of attachments (png,
> .pdf, .doc, .html, etc.)
>
> o HP Service Manager: handling incoming tasks and their solutions,
> including attachments (picture, text, e-mail, etc.)
>
> o Sharepoint sites within our local network, with various type of
> attachments (png, .pdf, .doc, .html, etc.)
>
> o Microsoft OneNote: information without a structure, typically user
> specific, Notepad-like files.
>
> o JIRA: handling incoming tasks and their solutions, including
> attachments (picture, text, e-mail, etc.)
>
> o DokuWiki
>
> o information stored in Microsoft Outlook
>
> - In what way can it connect to the aforementioned applications?
> The applications must be operatable independantly.
>
>
>
> Regarding price:
>
>
>
> ? Is there any costs
>
> ? during installation? (e.g. cost of software, cost of consultation,
> etc.)
>
> ? during operation?
>
>
>
> Thank for anybody’s help in advance!
>
>
>
> Sincerely,
>
>
>
> Katalin Ecs?di
>
> ecsodi.katalin@gmail.com
>

--
Charlie Hull
OpenSource Connections, previously Flax

tel/fax: +44 (0)8700 118334
mobile: +44 (0)7767 825828
web: www.o19s.com
Re: features of Apache Solr [ In reply to ]
Hi,
Thank you for your answers.

I have some more question about Apache Solr.
I speak english very bad, sorry for that.

- Is it possible to use together both Apapche Solr and MaifoldCF?
- In that case have they an own surface wich is user friendly, costumizable
by users? Or it is a tipically searching surface?
- Have they built-in authorization? I mean user groups with entitlement to
see or find a group of documents are managed in the application, or it
managed by AD groups? Or it needs an other software?
- Does an article have a life cycle status? I mean, is it possible to set
if a document is too old, not relevant, or its new, or just changed, or to
be aproved? Maybe these are storeed like metadata?
- Are they able to manage versions of the article?
- Could they send messages if an article has changed?
- Could an article be RO, or RW, or RWD with metadata?
- Are they able to manage competition processes ? for example 2 user want
to change a document.
- In that case we can make a real time connection with the applications
I've mentioned (Confluence, OneNote, HP SM (other name is 7IM), etc.),
could we use them onward, in their original enviroment?
- HP SM (7IM) has its data's in DB, are Apache Solr or Mainfold CF able to
connect with it (not by API)?
- Do they manage only links, references, or it stores the documents of the
other applications I mentioned?
- What statistics can make about searching?

- Is it possible to use - after planning and deploying Apache Solr and
Mainfold CF - by ourself?

Thank you very much in advance!

Best regards, Kate

Charlie Hull <charlie@flax.co.uk> ezt írta (id?pont: 2020. ápr. 14., K,
18:20):

> Hi,
>
> Yes, Solr can search all the content you mention, but to do so you'll
> have to extract the content and push it into Solr. This is done using
> 'connectors' and there are various open source and commercial
> alternatives: too many to list here but I'd take a look at Apache
> ManifoldCF as a starting point. Solr itself doesn't come with any of
> these built in.
>
> Solr doesn't cost anything to download - it's freely available under an
> open source license. However there are certainly other costs to
> consider: if you don't have the skills in-house you'll need to train
> your team or hire someone (which might be hard as those with these
> skills are rare and well-paid) or engage a consultancy (disclaimer: I
> work for OpenSource Connections which is one of these consultancies). As
> to operation, you'll need to host it somewhere but that's the only cost
> you'll have apart from the people who'll run it.
>
> As it seems you're at the start of your journey I can recommend you a
> few resources:
> - the free Search Insights reports (2018, 2019, 2020) linked from
>
> https://opensourceconnections.com/blog/2020/02/27/search-insights-2020-a-new-report-from-the-search-network/
> have a wealth of information on planning and running search projects
> including directories
> - https://manifoldcf.apache.org/
>
> Cheers
>
> Charlie
>
> On 14/04/2020 17:05, Ecs?di Katalin wrote:
> > Hi,
> >
> > My company is aming to handle all its knowledge management assets on an
> > unified platform. For this purpose we are looking for a software that can
> > search in realtime among sets of information of different type, structure
> > and/or location.
> >
> >
> >
> > We have found that *Apache Solr* might be suitable for this task.
> >
> > *We would be interested to know about the capabilities of Apapche Solr*:
> >
> > - Is it capable to performing searches with short response time
> > (”real time”) among our knowledge management assets?
> > This includes assets by the following applications:
> >
> > o Confluence: a local application with various type of attachments
> (png,
> > .pdf, .doc, .html, etc.)
> >
> > o HP Service Manager: handling incoming tasks and their solutions,
> > including attachments (picture, text, e-mail, etc.)
> >
> > o Sharepoint sites within our local network, with various type of
> > attachments (png, .pdf, .doc, .html, etc.)
> >
> > o Microsoft OneNote: information without a structure, typically user
> > specific, Notepad-like files.
> >
> > o JIRA: handling incoming tasks and their solutions, including
> > attachments (picture, text, e-mail, etc.)
> >
> > o DokuWiki
> >
> > o information stored in Microsoft Outlook
> >
> > - In what way can it connect to the aforementioned applications?
> > The applications must be operatable independantly.
> >
> >
> >
> > Regarding price:
> >
> >
> >
> > ? Is there any costs
> >
> > ? during installation? (e.g. cost of software, cost of
> consultation,
> > etc.)
> >
> > ? during operation?
> >
> >
> >
> > Thank for anybody’s help in advance!
> >
> >
> >
> > Sincerely,
> >
> >
> >
> > Katalin Ecs?di
> >
> > ecsodi.katalin@gmail.com
> >
>
> --
> Charlie Hull
> OpenSource Connections, previously Flax
>
> tel/fax: +44 (0)8700 118334
> mobile: +44 (0)7767 825828
> web: www.o19s.com
>
>