Internet vs. Enterprise Search - Part 1

« NIE Newsletter

20+ Differences Between Internet vs. Enterprise Search - And Why You Should Care^{(Part 1)}

Read Part 2 and Part 3 of this series.

Introduction

The perennial question of what separates Enterprise Search from the more familiar search engines that power the public Internet recently came up again. Dr. Search was planning to do a blog entry but the list mushroomed, and we now present the first in a three part series on the dozens of things that make Enterprise Search surprisingly difficult, and that sometimes flummox the engines that were created to power the public web.

As we hinted above, the public Internet was the inspiration and proving ground for a majority of the commercial and open source search engines out there. Solving that technical problem, indexing the Internet, has influenced both the architecture and implementation, as engineers have made hundreds of assumptions about data and usage patterns – assumptions that do not always apply behind the firewalls of corporations and agencies.

When vendors talk about their products, features and patents, they are usually talking about technology that was not specifically designed for the enterprise. This isn't just academic theory - as you'll see, these assumptions can actually break enterprise search, if not adjusted properly.

[back to top]

A Few Logistics

We've divided our list into "technical issues: user facing", "technical issues: back end data and indexing", and then "business and strategic" differences; we're doing the "easier" technical stuff in the first two parts, with the strategic and biz stuff as the finale. There's a bit of overlap, as some issues can be viewed from both a business and technical perspective, and data/indexing issues can affect what the user sees. Of course not every item applies to every project and vendor, "your mileage may vary". And heck, you may already know some of these, but we're trying to be quite comprehensive in scope, though perhaps a bit brief on some items. If anything catches your eye, that you'd like more details on, please drop us a note. And we've decided to let you do your own "numbering", this isn't late night TV after all.

[back to top]

Defining "Enterprise" for this article

To be clear, when we say "enterprise" search, we are referring to both the search engines that power private Intranets and Extranets, and to a lesser extent, the engines that companies have purchased to power their commerce and customer facing web sites. Broadly, "enterprise" search could be thought of as "all search engines EXCEPT the public Yahoo, Google and MSN", since you DO own and control the search engine that powers your public web site or online store. And again, your usage patterns and priorities are likely different from those of the Internet portals.

With all that said, let's get started!

20+ Differences Between Internet vs. Enterprise Search - And Why You Should Care(Part 1)

Introduction

High Level Internet / Intranet Mismatches

User Experience: High Level

Federated Search

Flexible rules for combining results from all of the engines searched

Maintaining User Security Credentials

Mapping User Security Credentials to other security domains

Advanced Duplicate Detection and Removal

Combining results list Navigators, such as Faceted Search links and Taxonomy Nodes.

Handling other results list links such as "next page" and sort order.

Translating user searches into the different search syntaxes used by the disparate engines

Extracting hits from HTML results, AKA "scraping"

User Experience: Lower Level Differences between Internet and Enterprise Search

In Our Next Issue…

20+ Differences Between Internet vs. Enterprise Search - And Why You Should Care^{(Part 1)}