The search industry is an ecosystem with a number of different companies and related technologies that together provide complete solutions for intranet and customer-facing content search.
While their researchers will differ, we view most search engines as very similar technologies. Some seem to gravitate to one or two specific vertical markets, but nearly all of them have features that support customers across a broad range of requirements.
There are many ways to organize the companies that make up the search ecosystem, and there are so many that any list will invariably omit a number of players. Nonetheless, here are the companies that we feel are strongest in the field.
Established Players
There are a handful of companies that are consistently on every enterprise search vendor list, from our annual overview of search technology to the Gartner Magic Quadrant and the Forrester Wave. These are the established players who have been at the top of the ecosystem for many years and who, in many ways, continue to solidify their position.
Autonomy
Autonomy has solid search technology, an impressive customer base, and acquired rival Verity in late 2005. With that and other acquisitions since, they are moving beyond "just search" into an even bigger market - risk and compliance. Read more about Autonomy.
Endeca
The only privately held company among the established players, Endeca has built a name for itself with their Information Access platform by innovating search to support relevance based on business rules with Guided Navigation® and Content Spotlighting™. Read more about Endeca.
FAST Search and Transfer
Since January 2008 a wholly owned subsidiary of Microsoft, FAST is another of the well-established leaders on everyone's list. FAST's flagship ESP platform offers predictable scalability and Enterprise Search 2.0 capabilities out of the box. Read more about FAST Search.
Google
With the best name recognition in search, Google has put together a strong solution for the enterprise search market with their Google Search Appliance. While it performs well for many companies, it may not be the best solution for every environment - yet. Read more about Google.
Challengers
Attivio
A newcomer in search, Attivio combines enterprise search with traditional relational structure to facilitate queries with "SQL-like" results. Attivio uses some Lucene-based technology and has a number of former FAST employees on staff.
Dieselpoint
A solid pure-Java-based search technology, Dieselpoint Search is solid, has full capabilities.
Exalead
This French-based company has re-launched its US operation with a new CEO and its CloudView enterprise product line. You can get a good view of the technology in action at their internet site, www.exalead.com/search.
IBM
This well-known giant has a number of search technologies, from the free IBM/Yahoo! search discussed below to mainframe engines. We think the technology they acquired from iPhrase in 2007 and marketed as IBM OmniFind Discovery Edition is among the most interesting technologies out there. One of the features we like best is the ability to integrate business rules into the query processing, and as the scalability and feature set grows it could move IBM into our Established Player category.
ISYS
ISYS is a successful Australian company with a new executive team in the United States and the ISYS:web product line. The technology is straightforward and capable, and has been quite successful marketing to companies, government agencies, and law enforcement. We interviewed a new ISYS customer who is quite happy with his selection in our newsletter.
Recommind
A solid search technology for enterprise search, Recommind's MindServer is based on technology that recognizes context and concept search to deliver meaningful, high quality results. While the technology is applicable to enterprise search requirements, Recommind has enjoyed the most success among legal and eDiscovery applications.
Vivisimo
Originally a clustering technology for federated search still available at http://www.clusty.com, Vivisimo has grown into a solid full-capability search platform with a number of key public customer sits including http://usasearch.gov and the National Library of Medicine http://www.nlm.nih.gov/. Vivisimo is moving into social search with Enterprise Search 2.0 methodology.
Free and low-cost solutions
Lucene
One of two related Apache Software Foundation search projects, Lucene is perhaps the best known of open source toolkits. While the original Lucene was written in Java, it has been ported to virtually all major languages and platforms, and is probably on more web sites than any other technology. Large companies like IBM have invested heavily in Lucene, and I expect we'll see increasingly capable versions over the coming years.
Solr
An additional project under Lucene, Solr provides a higher level API and more functionality than its parent including term highlighting, faceted search, and a simple administration console. The Solr toolkit, which runs in a Java servlet container such as Tomcat, lets you submit XML-structured records, a vast improvement over the low level calls in Lucene.
IBM/Yahoo
IBM and Yahoo! have teamed up to deliver an excellent free search platform for organizations with no more than 500,000 documents to be indexed. Packaged as the IBM Omnifind Yahoo! edition, it features a rich browser-based administrative console comparable to those in the best commercial products to control indexing, thesaurus/spelling suggestions and rudimentary search analytics. http://omnifind.ibm.yahoo.net/
Google Site Search
Google has introduced a low-cost hosted solution for public-web-based web sites called Google Site Search. Like hosted search companies such as SearchButton.com, FreeFind and Atomz, Google Site Search lets you add a free search to your public web content and display results with no ads and with full control over what users see. Pricing starts at $100 per year for up to 5000 web pages, and is, of course, powered by Google. Visit Google Site Search
Supporting Technologies
We'll have more to say about these in a bit.
- Grokker
- InQuira
- Nutch
- Stratify
- Teragram
- WAND