Get the real story via our bi-monthly newsletter

Search

    4
    0

rss

Send to a colleague

Home > Commentary > Trends Archive > IBM, Lucene, and the future of search

Browse TrendWatch Blog

Recent Blog Entries

The Complete Archive

Trends by Vendor


TrendWatch by Channel

Web Content Management Trends

Enterprise Portals Trends

ECM Trends

Web Analytics Trends

Enterprise Search Trends

SharePoint Trends

Digital & Media Asset Management Trends

XML & Component Content Management Trends

E-mail Archiving & Management Trends

Enterprise Social Software & Collaboration Trends


Report Excerpt

The Search & Information Access Report looks at... IBM: OmniFind Enterprise Edition

"Characteristically, IBM finds itself awash in good technology, but challenged by more prosaic matters involving product-line integration. Yet, there is progress to report. "

(p. 116)

More about The Search & Information Access Report

Our customers say

"The Search & Information Access Report is a useful tool for anyone working on an enterprise search project. The descriptions of how search works, recommendations for how teams and projects should be structured, as well as the information about hidden costs and how to control them make the report worth your time.
- - Chiara Fox,
Senior Information Architect, Adaptive Path

NEW at CMS Watch

The Search and Information Access ReportThe Search & Information Access Report: This newly updated 341-page Search and Information Access Report critically evaluates 23 Search and Information Access offerings from around the globe... Read more

The Enterprise Collaboration & Community Software ReportThe Enterprise Collaboration & Community Software Report : This newly updated research critically evaluates 27 Enterprise Collaboration and Community Software products head-to-head... Read more

The Enterprise Content Management ReportThe Enterprise Content Management Report : This newly updated research critically evaluates 32 Enterprise Content Management products head-to-head... Read more

 
 

TrendWatch Blog

IBM, Lucene, and the future of search

11-Nov-2009   --  

I've been covering IBM's search technology (for our Search and Information Access Research) for two years now, and I confess that I've never quite totally understood the strategy (if there is one) behind IBM OmniFind Yahoo! Edition (OYE).

OYE is the free, Apache Lucene-based search application that IBM has offered since 2006. IBM does have customers who pay for commercial support for OYE, and according to Big Blue there have been over 50,000 downloads of OYE to date. But OYE isn't something IBM pushes heavily, and Google's search appliance business hasn't suffered appreciably in the face of competition by OYE.

One wonders, then: Why bother offering something like OYE at all? What's the point in putting the "IBM OmniFind" moniker on a technology that is really mostly Lucene on the back end and Yahoo on the front end? It seems (on the surface) like rather a quick-and-easy way to try to get some of the "cool factor" from Lucene to rub off on OYE -- a kind of coolness by association.

It now seems likely that OYE was (among other things) an IBM testbed project for Lucene development, ahead of the eventual, inevitable Lucenization of the entire OmniFind family of products. And in fact an IBM rep told me that Big Blue will indeed be moving OmniFind Enterprise Edition to a Lucene-based core architecture eventually. This is big news from a number of standpoints. It's a huge endorsement (if Lucene needed any, at this point) of the open-source search engine's maturity and soundness; and it can only solidify Lucene's position of dominance in the open-source search firmament. It also brings Lucene and UIMA (Unstructured Information Management Architecture) closer together, hinting at the emergence (though not right away) of an industry-standard text analytics architecture.

A lot is at stake for IBM, too: The key pieces of IBM's information-access strategy -- including InfoSphere Content Assessment (ICA), InfoSphere Content Collector (ICC), and InfoSphere Classification Module (ICM) -- all employ the OmniFind Enterprise Edition search infrastructure in various ways. With Lucene and UIMA occupying center stage, IBM is betting a lot on this technology. 

What does it mean to you, the technology buyer? First, expect to see further significant investment in Lucene by the IT world -- and further blossoming of the technology ecosphere around Lucene -- as Lucene becomes the key enabling technology underneath a variety of content-analytics applications. A year from now, Lucene won't simply mean "search" -- it could become the enabling technology for content-analytics apps of various kinds (including some kinds that haven't even been envisioned yet).

Secondly, it may prompt the much-prophesied (but never realized) advent of a broad secondary ecosystem around UIMA: an ecosystem of parsers, annotators, and pluggable business rules.

Thirdly, we may see the emergence of a new wave of prospective standards around things like index formats, relevance, and tokenization.

And finally? Expect to see interesting arguments from the likes of Microsoft and Autonomy as to why their proprietary search solutions are better for you in the long run than more open architectures. It should make for an interesting discussion. Subscribers, stay tuned.

- Submitted by: Kas Thomas, Analyst - Twitter: kasthomas

All Search Channel Trends

Join the conversation

Digg This! Search Technorati Tag it on Del.icio.us




Get a Free Sample

Wondering about CMS Watch research? Sign up to receive free samples of any of our products.




What we do

CMS Watch™ evaluates content-oriented technologies, publishing head-to-head comparative reviews of leading solutions. What makes us special?

  • Our critical analysis exposes product weaknesses as well as strengths
  • We deliver unrivaled technical depth and comprehensive project advice
  • Our research is led by international topic experts
  • We only work for buyers -- never for vendors

Contact us

CMS Watch

info@cmswatch.com

3470 Olney-Laytonsville Road Suite 131

Olney, MD USA 20832

1 800 325 6190

1 617 340 6464

UK: +44 2033181911

Fax: +1 617 340 3541