• Home
  • Research
  • What We Offer
  • Who We Are
  • Blog
  • Your cart is empty.
  • Log in
  • Subscribe
  • Contact Us
  • Recent Entries
  • Get Custom Feeds
Team Blog
Free Research Sample
Bloem

Lucene can read almost anything: Lucid and ISYS team up

Added By Adriaan Bloem at 30-Jun-2009 | Twitter: @adriaanbloem |

A few months ago, I blogged about ISYS offering their document converter filters as a separate component. My thought was these would come in handy to add on to Lucene (which, by itself, can't actually read Microsoft Office files, let alone more exotic document types.) That would still leave you with a bit of DIY work, though: integrating the filters in your Lucene implementation.

As it turns out, Lucid Imagination had exactly that idea. The company, which offers commercial support for Lucene and Solr, is now offering it's own "LucidWorks" versions with the ISYS filters integrated. This means one of the gaps between open source and commercial search products has been bridged: with the filters, Lucene, too, can read over 200 file types.

According to Lucid, this has been one of the favorite doubts commercial vendors would cast over the open source search engine, and the move should level the playing field. However, as a customer, you should be aware that there's a couple of other things you may take for granted that are missing. Connectors to various content repositories, for instance, don't come with Lucene, not even a simple web crawler.

Still, the filters are a welcome addition, and they're certainly an improvement over what's currently available as open source. It's not just in the numbers: ask yourself how you think a converter will read a three-column Word document. You may be surprised to know that some will just go across all the first lines from left to right, then the second lines, etcetera. As always in Search & Information Access, the devil is in the details -- and knowing about these details will pay off.

The added filters aren't for free, but not exactly expensive, either. There's a 14-day trial, and you can get a subset (e.g., Microsoft Office) of the filters for as little as $3.250 for 2 years, or pay $10.000 for all of them (including those pesky legacy formats you'll discover in a distant corner of your fileserver when you least expect it.) That's still a long way off from the hundreds of thousands even a Google Appliance implementation may cost you in licensing. (Though there's no such thing as a free lunch or free beer with open source, either.)

So this is interesting news if you're considering Lucene, but what about ISYS? Aren't they selling the family silver? Well, let me wrap up this post by meandering off into history. As the (perhaps apocryphal) story has it, when the Dutch were at war with the Spanish in the 16th century, they were still selling cannons to their opponents. They figured they might as well make a profit out of it: the outcome would be determined by strategy, anyway.

Open source projects and commercial vendors, on the other hand, don't even have to be at war. And as with a Spanish Rioja or a Dutch Heineken, it's all about picking the right one for the occasion.

Categories: Adriaan Bloem, Search and Information Access, Open Source, Selecting Technology, Google Search Appliance, ISYS Search Suite, Lucene

  • Tweet This Entry

Online Education

Check out our classes and Register Today.

Evaluation Research

Get the real story about vendors and products.

My Research

Remember MeForgot password?

Not a subscriber? Learn about our subscriptions

Categories

Channel

  • Collaboration & Community Software (123)
  • Web Analytics (148)
  • Web Content Management (797)

Analyst

  • Adriaan Bloem (44)
  • Tony Byrne (660)
  • Apoorv Durga (7)
  • Jarrod Gingras (30)
  • Alan Pelz-Sharpe (59)
  • Theresa Regli (36)
  • Kas Thomas (77)

Topics

  • Asia-Pacific Marketplace (3)
  • Building Business Case (139)
  • Cloud Computing (5)
  • E-Discovery (1)
  • European Marketplace (15)
  • Governance (10)
  • Implementation (211)
  • Industry Events (1)
  • Industry Standards (110)
  • Information Architecture (84)
  • Intranets (6)
  • Marketplace at Large (503)
  • Open Source (93)
  • Selecting Technology (543)
  • Services Oriented Architecture (4)
  • Software-as-a-Service (17)
  • Usability (3)
  • Vendor Viability & Financials (128)
  • XML (28)

Industries

  • Finance (1)
  • Government (17)
  • Health Care (1)
  • Higher Ed (7)
  • Manufacturing (2)
  • Publishing-Media (4)
  • Retail (4)

Dates

  • 2010 (56)
  • 2009 (200)
  • 2008 (223)
  • 2007 (166)
  • 2006 (99)
  • 2005 (104)
  • 2004 (58)
  • 2003 (67)
  • 2002 (67)
  • 2001 (28)

Have Questions?

Sales & Customer Support

+1 800 325 6190 (USA)+44 (0) 20 3318 1911 (UK)+1 617 340 6464 (Int'l)sales@realstorygroup.com support@realstorygroup.com

All other inquiries: info@realstorygroup.com

Copyright, 2001 - 2010, Real Story Group. All rights reserved.

  • Contact Us
  • Copyright Policy
  • Privacy Policy
  • Terms of Use

The Real Story Group

  • CMS Watch
  • Enterprise Information
       Watch
  • SharePoint Watch
  • The Real Story Group

Research

  • Vendor Evaluations
  • Webinars & Advisory Papers
  • Online Education
  • Vendor Lists
  • Free Research Sample
  • Purchase Now

What We Offer

  • Research & Advisory
       Services
  • Frequently Asked Questions
  • Consulting Services
  • Customer Support
  • Contact Sales Team

Who We Are

  • We're Different
  • Our Team
  • Media
  • Customer List
  • Events
  • Contact Us

Get the real story via our bi-weekly newsletter.

Follow us on: RSS twitter

Log In

Remember MeForgot password?