<?xml version="1.0" encoding="us-ascii"?>
<rss version="2.0" xml:base="http://www.cmswatch.com" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/">
   <channel>
      <title>CMS Watch Enterprise Search Feed</title>
      <link>http://www.cmswatch.com</link>
      <description>CMS Watch headlines about Enterprise Search</description>
      <language>en-us</language>
      <lastBuildDate>Wed, 23 Jul 2008 23:53:34 -0400</lastBuildDate>
      <dc:creator>editor@cmswatch.com (Tony Byrne)</dc:creator>
      <dc:rights>Copyright 2005, CMS Watch</dc:rights>
      <dc:publisher>CMS Watch</dc:publisher>
      <image>
         <title>CMS Watch</title>
         <url>http://www.cmswatch.com/images/cmswatch_logo.gif</url>
         <link>http://www.cmswatch.com</link>
         <width>82</width>
         <height>36</height>
         <description>CMS Watch logo</description>
      </image>
      <item>
         <title>A new (and wearable) Content Technologies Subway Map</title>
         <description>A new season brings an updated &lt;a href=&quot;http://www.cmswatch.com/vendormap/&quot;&gt;vendor map&lt;/a&gt;:&lt;/p&gt;

&lt;table width=&quot;500&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;
&lt;tr&gt;&lt;td align=&quot;center&quot;&gt;&lt;a href=&quot;http://www.cmswatch.com/vendormap/&quot;&gt;&lt;img src=&quot;/images/CMS-Watch-Subway-2008-small.gif&quot; alt=&quot;CMS Watch Q3 2008 Subway Vendor Map low-rez&quot; width=&quot;500&quot; height=&quot;375&quot; border=&quot;0&quot;&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;
&lt;br /&gt;
&lt;p&gt;We added a Yellow Line -- for &lt;a href=&quot;http://www.cmswatch.com/CCM/Report/&quot;&gt;XML &amp;amp; Component Content Management vendors&lt;/a&gt;, 
  and reflected some other  station changes.&lt;/p&gt;
&lt;p&gt;And now, if you like what you see, you and your wall can wear it. &lt;a href=&quot;http://www.cafepress.com/cmswatch&quot;&gt;Our new store 
  at Cafe Press&lt;/a&gt; offers t-shirt and posters of various sizes, along 
  with other CMS Watch tchotchkes. &lt;/p&gt;
&lt;p&gt;Regarding the latter, perhaps you already own your fill of mugs and mousepads, but can you ever 
  have enough &lt;a href=&quot;http://www.cafepress.com/cmswatch.285260609&quot;&gt;beer steins&lt;/a&gt;? Bring it to the next &lt;a href=&quot;http://www.cmswatch.com/About/Events/&quot;&gt;event where we're speaking&lt;/a&gt; and 
  we'll fill it up with the closest available brew. ;-)</description>
         <link>http://www.cmswatch.com/Trends/1320-A-new-(and-wearable)-Content-Technologies-Subway-Map?source=RSS</link>
         <category>Enterprise Portals</category>
         <author>tbyrne@cmswatch.com(Tony Byrne)</author>
         <pubDate>Wed, 23 Jul 2008 14:55:00 -0400</pubDate>
      </item>
      <item>
         <title>Infrastructure Updates for SharePoint</title>
         <description>Through the &lt;a href=&quot;http://blogs.msdn.com/sharepoint/&quot;&gt;SharePoint product 
team's MSDN blog&lt;/a&gt;, Microsoft announced that it had released &lt;a href=&quot;http://blogs.msdn.com/sharepoint/archive/2008/07/15/announcing-availability-of-infrastructure-updates.aspx&quot;&gt;a 
significant infrastructure update for SharePoint&lt;/a&gt; (and related technologies 
like Project Server that leverages SharePoint components). The update seems to 
primarily address three areas:&lt;/p&gt; 
&lt;ul&gt;
  &lt;p&gt;Search functionality and search-related performance (like index performance). 
  &lt;/p&gt;
  &lt;p&gt;Content Deployment bug fixes (which hopefully will correct a series of irritating 
    bugs related to deploying content from one SharePoint environment to another 
    in web content management scenarios). These are include the hotfix packs Microsoft 
    released for content deployment back in May of this year. &lt;/p&gt;
  &lt;p&gt;General interface and performance improvements. In reading the three or four 
    pages in Microsoft's site that aimed to describe what was actually included, 
    it was difficult to pinpoint what these &amp;quot;improvements&amp;quot; actual mean 
    to SharePoint administrators. However, Microsoft describes them as &amp;quot;...fixes 
    and product performance updates driven by customer feedback which have resulted 
    in significant platform performance improvements...&amp;quot; Again, I was unable 
    to nail what precisely has changed or how significant the improvements were. 
  &lt;/p&gt;
&lt;/ul&gt;
&lt;p&gt;What's interesting, at least with regard to search, is that it seems the &amp;quot;ancillary&amp;quot; 
  search products like Search Server 2008 (and it's &amp;quot;free&amp;quot; sibling Search 
  Server Express 2008) are driving updates to SharePoint's search technology. 
  As mentioned in the &lt;a title=&quot;CMS Watch SharePoint Report 2008&quot; href=&quot;http://www.cmswatch.com/SharePoint/Report/&quot;&gt;&lt;em&gt;SharePoint 
  Report 2008&lt;/em&gt;&lt;/a&gt;, Microsoft has invested heavily in improving SharePoint 
  search. In fact, historically, it seemed as if SharePoint Search was the the 
  parent of these independent search tools, but it now appears as if &amp;quot;the 
  student [has become] the master&amp;quot; as Darth Vader said to Obi Wan. &lt;/p&gt;
&lt;p&gt;In particular, SharePoint is getting Search Server's federated search capabilities 
  and &amp;quot;a unified search dashboard.&amp;quot; From what I saw at the last SharePoint 
  conference, both of these search products borrowed very heavily from the SharePoint 
  interface construct, but improved the visibility of certain configuration settings. 
  In particular, I liked the ease with which you could configure the federated 
  search. &lt;/p&gt;
&lt;p&gt;However, these changes call into question how this will all play out within 
  the Shared Services provider and whether administrators who are struggling to 
  figure out where to go to change search settings -- at the site, site collection, 
  Central Administration (in the Application or Operation tab) or in Shared Services. 
  While most key search settings reside in Shared Services, SharePoint has search-relate 
  configuration in spread over virtually every administrative interface. My hope 
  is that this &amp;quot;unified search dashboard&amp;quot; brings some order to search 
  within SharePoint.&lt;/p&gt;

&lt;p&gt;In the end, these changes (along with the FAST search integration) also add 
  more evidence to the theory that Microsoft is going to decouple search from 
  SharePoint entirely (and potentially the Office team) -- making SharePoint a 
  client technology. As I blogged about in &lt;a href=&quot;http://www.cmswatch.com/Trends/1219-Thoughts-on-SharePoint-and-FAST-Search&quot;&gt;a post on the completion of the FAST 
  acquisition&lt;/a&gt;, Microsoft seems to be leaning very heavily towards and independent 
  search product team. And just to add fuel to the conspiratorial fire, this type 
  of organizational structure might make sense if, say, Microsoft were to acquire 
  a large Internet-centric search company (although it begs the question what 
  they'd do with all of this overlapping technology). </description>
         <link>http://www.cmswatch.com/Trends/1313-Infrastructure-Updates-for-SharePoint?source=RSS</link>
         <category>Enterprise Search</category>
         <author>shawn_shell@consejoinc.com(Shawn Shell)</author>
         <pubDate>Fri, 18 Jul 2008 00:20:00 -0400</pubDate>
      </item>
      <item>
         <title>Narrowcasting to your feed aggregator</title>
         <description>We're pleased that CMS Watch now covers &lt;a href=&quot;http://www.cmswatch.com/Reports/&quot;&gt;ten different technologies&lt;/a&gt;, but I suspect 
  that many of you take an interest in only one or two families of tools. If that's 
  you, here's a list of technology-specific RSS feeds that will just send relevant 
  postings to your reader or aggregator.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Digital Asset Management&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/DAM&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/DAM&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ECM Suites&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/ECM&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/ECM&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;E-mail Archiving &amp;amp; Management&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/E-mail&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/E-mail&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enterprise Portals&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Portal&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Portal&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enterprise Search&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Search&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Search&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;SharePoint&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/SharePoint&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/SharePoint&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Software&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Social&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Social&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Web Analytics&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Analytics&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/Analytics&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Web CMS / WCM&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/CMS&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/CMS&lt;/a&gt; 
&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;XML &amp;amp; Component Content Management&lt;/strong&gt;: &lt;a href=&quot;http://www.cmswatch.com/RSS/cmswatch.channel.xml/CCM&quot;&gt;http://www.cmswatch.com/RSS/cmswatch.channel.xml/CCM&lt;/a&gt;</description>
         <link>http://www.cmswatch.com/Trends/1303-Narrowcasting-to-your-feed-aggregator?source=RSS</link>
         <category>Enterprise Portals</category>
         <author>tbyrne@cmswatch.com(Tony Byrne)</author>
         <pubDate>Fri, 11 Jul 2008 15:39:00 -0400</pubDate>
      </item>
      <item>
         <title>Enterprise Search Scalability: A Big Issue</title>
         <description>I was talking to a search vendor the other day who said something that really got my attention. He remarked that a customer recently came to him and asked what it would take, in terms of software, hardware, and time, to index &lt;em&gt;30 billion documents&lt;/em&gt;. Mind you, this was not some hypothetical exercise. The question came from somebody whose company actually has 30 billion documents under management.&lt;/p&gt;

&lt;p&gt;Consider the dimensions of the problem. Assuming (for purposes of argument) you could index a thousand documents per second on one machine, it would take a full year just to build the index for 30 billion docs. If the solution scales linearly, building (or rebuilding) the index would keep a 100-machine server farm busy for the better part of a week.&lt;/p&gt;

&lt;p&gt;That's considering the scenario in a static context only. In the real world, of course, documents are revised (some frequently, others never, most somewhere in-between). New docs enter the system. Old ones are dropped. Otherwise-unchanged docs are moved to new locations. Unless you can update your index(es) incrementally, in real time, as docs are added, deleted, modified, or moved, you have an index shelf-life problem. &lt;/p&gt;

&lt;p&gt;The traditional answer to the shelf-life problem is to rebuild the index every few days (or every night, if resources allow). At the level of ten or twenty thousand docs, a total rebuild of the index every few days isn't a huge issue. But when you get beyond something like a few million docs, performing total-rebuilds on a frequent basis quickly becomes a worst practice (if indeed it's practicable at all). At some point, you &lt;em&gt;need&lt;/em&gt; the ability to do incremental indexing.&lt;/p&gt;

&lt;p&gt;But, someone will ask, can't you just throw more machine resources (threads, memory, cycles) at the problem? Yes and no. If you're spidering files over the wire, bandwidth exhaustion becomes an issue. If you're indexing files locally, there's an OS-imposed limit on how many files you can have open at once. There's also the question of how much file data you can hold in memory. The reason this is important is that some search systems (quite a few, actually) need to load an entire document into memory before the doc can be indexed. If you're indexing 10-megabyte PDFs, it might not matter how many threads you have available. (Note, incidentally, that most docs occupy a &lt;em&gt;lot &lt;/em&gt;more space in memory than on disk.) And anyway, the CPU can execute only so many instructions per second, no matter how many docs you can load at once. &lt;/p&gt;

&lt;p&gt;I bring all this up for a couple of reasons. First, if you're shopping for a search solution, you need to regard the various vendors' performance claims with more than a modicum of caution. No two search scenarios are the same, obviously. But more than that, the parameters that affect scalability and performance are numerous and non-obvious (and their interactions subtle), tending to moot most performance claims straight out of the gate.&lt;/p&gt;

&lt;p&gt;Takeaway No. 1: If you care about performance (and you should), &lt;em&gt;do your own testing.&lt;/em&gt; Insist on it as part of any product evaluation.&lt;/p&gt;

&lt;p&gt;Takeaway No. 2: Get your programmers involved in the evaluation process early. Some of these issues require computer-science expertise to evaluate properly.&lt;/p&gt;

&lt;p&gt;Also (very important), when shopping for a search solution, don't buy for your present needs. Shop for your &lt;em&gt;future&lt;/em&gt; needs. Your company probably has ten times more content under management today than it had just five years ago. &lt;em&gt;Five years from now, it could have ten times more than today.&lt;/em&gt; Will your search solution scale appropriately? More particularly, &lt;em&gt;how&lt;/em&gt; will it scale? Will it scale linearly? Will it hit a brick wall?&lt;/p&gt;

If I were searching for a search solution, I'd ask every vendor a few simple questions: &lt;ul&gt;&lt;li&gt;How big is your biggest customer installation and what did it take to build it? &lt;/li&gt;&lt;li&gt;Can your system do incremental indexing? How often is a full rebuild required?&lt;br /&gt; &lt;/li&gt;&lt;li&gt;Does your indexer need to read a document into memory (whole) before indexing it, or can files be stream-processed?&lt;/li&gt;&lt;li&gt;What's the largest document your system can index without either choking or stopping after a particular number of characters?
&lt;/li&gt;&lt;li&gt;How does indexing performance change as the index gets bigger? (Not just &amp;quot;does it slow down?&amp;quot; but &lt;em&gt;how&lt;/em&gt; does it slow? Linearly? Exponentially? If it's the latter, you're going to hit a brick wall.)&lt;/li&gt;&lt;li&gt;And: &lt;em&gt;Do you support 64-bit architectures?&lt;/em&gt; &lt;/li&gt;&lt;/ul&gt;

&lt;p&gt;Those are just a few conversation-starters. For more (lots more), be sure to consult our &lt;em&gt;&lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;Enterprise Search Report 2008&lt;/a&gt;&lt;/em&gt;. (You can get a free sample of it &lt;a href=&quot;http://www.cmswatch.com/Reports/Try&quot;&gt;online here&lt;/a&gt;.) And if you end up evaluating one or more search offerings in depth, please drop us a line and let us know what you learned. We're always interested in your feedback.</description>
         <link>http://www.cmswatch.com/Trends/1299-Enterprise-Search-Scalability:-A-Big-Issue?source=RSS</link>
         <category>Enterprise Search</category>
         <author>kthomas@cmswatch.com(Kas Thomas)</author>
         <pubDate>Wed,  9 Jul 2008 13:02:00 -0400</pubDate>
      </item>
      <item>
         <title>How Fast is Attivio?</title>
         <description>The &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Fast%20Search%20&amp;%20Transfer&quot;&gt;Fast Search &amp;amp; Transfer&lt;/a&gt; news continues to keep me on my toes. The Norwegian business weekly &lt;a href=&quot;http://www.dn.no/forsiden/borsMarked/article1434702.ece&quot;&gt;Dagens N&amp;aelig;ringsliv&lt;/a&gt; seems to have come up with &lt;a href=&quot;http://www.dn.no/forsiden/borsMarked/article1434702.ece&quot;&gt;some decent evidence&lt;/a&gt; (&lt;a href=&quot;http://www.scribd.com/doc/3809691/Fasts-Stock-Market-Bluff&quot;&gt;English translation&lt;/a&gt;) of many things everybody already suspected -- and a couple of new ones.&lt;/p&gt;

&lt;p&gt;It covers the &lt;a href=&quot;http://www.cmswatch.com/Trends/878-FAST-buys-Convera%27s-RetrievalWare&quot;&gt;Convera acquisition&lt;/a&gt; and the &amp;quot;separate&amp;quot; but surprisingly coincidental deal where Convera bought several million's worth of Fast software it didn't need. Or as rival &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Autonomy&quot;&gt;Autonomy&lt;/a&gt; (which was also in the running for buying Convera) has pointed out, Fast pumped up its revenues for that quarter with part of the money it paid for Convera, then got back for licenses. The DN article also covers a few other very suspicious deals, and some outright fraud. It's now even getting to the point where calling Fast &amp;quot;the Enron of Norway&amp;quot; &lt;a href=&quot;http://www.google.com/search?as_epq=enron+of+norway&quot;&gt;is getting long in the tooth&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;While that train wreck was unfolding before my eyes in slow motion, my fellow analyst &lt;a href=&quot;http://www.cmswatch.com/Analyst/15-Regli&quot;&gt;Theresa Regli&lt;/a&gt; pinged me last February about a new enterprise search company called &lt;a href=&quot;http://www.attivio.com&quot;&gt;Attivio&lt;/a&gt;. Information Today &lt;a href=&quot;http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=40763&quot;&gt;raved about their new product AIE&lt;/a&gt;, with analysts quoted as saying things like &amp;quot;they are moving rapidly to develop tools that will eliminate many of the practical barriers to easily and efficiently deploy robust enterprise search solutions,&amp;quot; with the unique selling point of &amp;quot;data integration plus search and content processing,&amp;quot; a &amp;quot;hot niche for the next few years.&amp;quot;&lt;/p&gt;

&lt;p&gt;Since I'm always interested to find out more about robust enterprise search tools to fill hot niches for the next few years, I scrolled down to read what the Attivio CTO would explain about &lt;i&gt;how&lt;/i&gt; the product would achieve what &amp;quot;should have been solved by the integration of text search and XML into relational database managers such as Oracle.&amp;quot; As it turns out, it is based on a &amp;quot;mash-up&amp;quot; of open source &lt;a href=&quot;http://www.cmswatch.com/Portal/Vendors/Apache&quot;&gt;Apache Lucene&lt;/a&gt; and &amp;quot;licensed commercial software.&amp;quot;&lt;/p&gt;

&lt;p&gt;As described in our &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;Enterprise Search Report&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/Trends/1247&quot;&gt;on this blog&lt;/a&gt;, Lucene itself is just a Java text search API. To be able to actually gather, convert, and query content you need many more components. It is perfectly feasible to put together a working enterprise search product around the core Lucene JAR (as demonstrated by &lt;a href=&quot;http://www.cmswatch.com/Trends/1247&quot;&gt;IBM's Omnifind Yahoo! Edition&lt;/a&gt;). But in order to get there, and to have Lucene index, for instance, Office documents and PDFs, you will have to first convert those documents to text. The filters to perform that conversion can be &lt;a href=&quot;http://www.cmswatch.com/Trends/1185&quot;&gt;bought from other vendors&lt;/a&gt;, based on open source such as &lt;a href=&quot;http://sourceforge.net/projects/pdftohtml/&quot;&gt;pdftohtml&lt;/a&gt;, or you'll have to build them yourself, which is a lot of work. There aren't too many vendors building their own filters, or even just modifying open source to do so. So if you &lt;i&gt;do&lt;/i&gt; build the filters needed to use Lucene yourself, you'd probably like to mention this as an advantage, and as Attivio states, &amp;quot;we developed our own Microsoft Office, WordPerfect, and PDF connectors to improve performance and reach deeper into the files than the conventional converters.&amp;quot;&lt;/p&gt;

&lt;p&gt;Since, like most enterprise search products, Lucene isn't based on a database and couldn't even connect to such content without help, it isn't surprising Attivio had to develop a &amp;quot;unique RDBMS data loader&amp;quot; which &amp;quot;indexes the tables individually.&amp;quot; This, again, is presented as a major advantage -- remember, converting documents and integrating structured and unstructured data are &amp;quot;a hot niche.&amp;quot;&lt;/p&gt;

&lt;p&gt;I remember seeing a vendor at a conference a few years back, with banners jokingly stating its product was &amp;quot;buzzword compliant!&amp;quot; Attivio certainly seems to have that skill down. The engineering effort is marketed as a &amp;quot;technology mashup,&amp;quot; &amp;quot;breaking down silos&amp;quot; between &amp;quot;open source and commercial software.&amp;quot; &amp;quot;We have lived with the challenge of having to choose between the precision of databases and the richness of search for a long time, but no longer&amp;quot; sounds great, but I don't see &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Oracle&quot;&gt;Oracle&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Thunderstone&quot;&gt;Thunderstone&lt;/a&gt;'s RDBMS-based solutions breaking out in a sweat just yet.&lt;/p&gt;

&lt;p&gt;Maybe my over-exposure to marketing materials and flashy demos has turned me into a cynic, and Attivio's &lt;a href=&quot;http://attivio.com/ourproducts_ektid134.aspx&quot;&gt;downloadable trial version&lt;/a&gt; will have to do at least a decent job to convince me of the product's added value. Fortunately, that free download is &amp;quot;coming soon!&amp;quot; Yes, I'm sorry, I'm finding it increasingly hard to turn off that cynicism, especially when I turn back to the DN article about Fast Search &amp;amp; Transfer. Attivio was founded by &lt;a href=&quot;http://attivio.com/aboutus_ektid90.aspx&quot;&gt;former Fast employees&lt;/a&gt; and the Attivio CTO is Sid Probstein, formerly vice president of technology at FAST. More importantly, Attivio's CEO is Ali Riaz, who was COO at Fast but unexpectedly left the company in late 2006. Well, in hindsight, perhaps not so unexpectedly, though DN quotes him as saying &amp;quot;I had nothing to gain from manipulation of the accounts. I had no shares in the company. I wanted shares and quit because I didn't get any. If you want to find out what's wrong with the accounts, you need to look at those who could gain from it. And it wasn't me.&amp;quot;&lt;/p&gt;

&lt;p&gt;Dagens N&amp;aelig;ringsliv doesn't appear to agree with Riaz, however; if you want the full analysis of the what and why, I suggest you read the article. I myself find it surprising that the CEO of a technology startup backed by $6.2 million in venture capital would drive &lt;a href=&quot;http://multimedia.dn.no/archive/00144/LB_Ali_Riaz_Fast_144367m.jpg&quot;&gt;an Audi R8&lt;/a&gt;, but that doesn't mean anything (other than that I'm envious of his car). I also find it surprising a former Fast COO would be co-owner of a company reselling Fast licenses, but walking like an Enron duck and quacking like an Enron duck doesn't necessarily mean that it's really anything like Enron. And Attivio's clouding the core technology in marketing hyperboles and buzzword compliance is slightly disconcerting, but &lt;a href=&quot;http://www.cmswatch.com/Trends/1122&quot;&gt;many renowned companies engage in the same practice&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;DN quotes Riaz as saying &amp;quot;you should be much better at praising the people who have success, instead of pushing them down.&amp;quot; And I would certainly love to be proven wrong by Attivio's software; as soon as I get my hands on the trial download I requested, I will let you know if it lives up to the high expectations. As one of my teachers in school once told me, &amp;quot;I'm known for being cynical, or even sarcastic -- myself, I prefer to call it healthy skepticism and mild irony.&amp;quot;&lt;/p&gt;

&lt;p&gt;Being a cynic isn't a lot of fun -- but for now, I would advise you to be at least healthily skeptical of what Attivio has to offer.</description>
         <link>http://www.cmswatch.com/Trends/1294-How-Fast-is-Attivio?source=RSS</link>
         <category>Enterprise Search</category>
         <author>bloem@radagio.com(Adriaan Bloem)</author>
         <pubDate>Sat,  5 Jul 2008 09:55:00 -0400</pubDate>
      </item>
      <item>
         <title>Best bets: a worst practice?</title>
         <description>&lt;p&gt;As one of the authors of our &lt;i&gt;&lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;Enterprise Search Report 2008&lt;/a&gt;&lt;/i&gt;, I've been spending a lot of time lately looking at search technology and talking to folks who care deeply about the subject (i.e., vendors and their customers).&lt;/p&gt;&lt;p&gt;One thing everyone seems to agree on is that providing relevant results to the user is a very hard problem indeed. People don't want to enter a few keywords and then get 10,000 hits on documents that contain those keywords, 9,999 of which might be irrelevant. They want pointers to the one or two documents that &lt;em&gt;are &lt;/em&gt;relevant.&lt;/p&gt;&lt;p&gt;The signal-to-noise problem is so thorny that many enterprise search products include an optional feature known as &amp;quot;best bets.&amp;quot; The idea is that certain very common searches should point to particular documents (or intranet pages) that are known, or presumed, to apply. Imagine that a lawyer working for a large legal firm logs into the company portal and searches on &amp;quot;poison pill.&amp;quot; A thousand hits might come back, of which 990 are related to medications, allergic reactions, toxicity, malpractice, and so on, even though all the person was really looking for was a link to the company's &amp;quot;merger and acquisitions&amp;quot; resource page. (&amp;quot;Poison pill&amp;quot; is a term for tactics a company can use to fend off hostile takeover attempts.) The idea of &amp;quot;best bets&amp;quot; is that you rig the system to promote the company's &amp;quot;M&amp;amp;A resources&amp;quot; link to the top of the hit list whenever someone does a search on &amp;quot;poison pill.&amp;quot;&lt;/p&gt;&lt;p&gt;Sometimes &amp;quot;best bets&amp;quot; refers to presenting the user with a recommendation when, say, several repositories exist, one or more of which could be better-suited to a given search than the others. (&amp;quot;Would you like to search the Parts Catalog for this?&amp;quot;) This is more of a navigational scenario. That's not really what I'm talking about here. I'm talking about&amp;#xa0; the practice of biasing search results by hard-coding certain answers to certain common queries. &lt;/p&gt;&lt;p&gt;Setting up &amp;quot;best bets&amp;quot; is typically a manual process. A person in IT will use search analytics to determine the most common search queries and the most-followed links associated with them. Then those associations will be captured in a database and wired into the search software in such a way that when a user issues a query for which a best bet already exists, the best-bet link(s) will automatically be shown at the top of the results page (either as a regular hit or under a separate heading of &amp;quot;Best Bets&amp;quot;). &lt;/p&gt;&lt;p&gt;&lt;a href=&quot;http://dennisdeacon.wordpress.com/2008/06/25/search-engine-best-bets/&quot;&gt;Not everyone&lt;/a&gt; thinks the &amp;quot;best bets&amp;quot; mechanism is a good idea. The problem is that, fundamentally, it's a hack. It's arguably the worst kind of hack in that it involves serious amounts of human intervention. Someone has to create the best-bet database. (Typically there will be hundreds, if not thousands, of best-bet links.) Then the database has to be updated and kept fresh as user needs change and documents are added to or dropped from the system. &lt;/p&gt;&lt;p&gt;In point of fact, the search software should do all this for you. After all, that's its job: to return relevant results (automatically) in response to queries. Why would you sink tens (or hundreds) of thousands of dollars into an enterprise search system only to override it with a manually assembled collection of point-hacks?&lt;/p&gt;&lt;p&gt;Sure, search is a hard problem. But if your search system is so poor at delivering relevant results that it can't figure out what your users need without someone in IT explicitly &lt;em&gt;telling &lt;/em&gt;it the answer, maybe you should search for a new search vendor. (And for help with that, see &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;Enterprise Search Report 2008&lt;/a&gt;, a free sample of which is available &lt;a href=&quot;http://www.cmswatch.com/Reports/Try/&quot;&gt;right here&lt;/a&gt;.)&lt;/p&gt;</description>
         <link>http://www.cmswatch.com/Trends/1286-Best-bets:-a-worst-practice?source=RSS</link>
         <category>Enterprise Search</category>
         <author>kthomas@cmswatch.com(Kas Thomas)</author>
         <pubDate>Thu, 26 Jun 2008 21:50:00 -0400</pubDate>
      </item>
      <item>
         <title>The value of archiving and the limitations of e-discovery</title>
         <description>Today we read about yet &lt;a href=&quot;http://www.bloomberg.com/apps/news?pid=20601087&amp;sid=aUWyzWYWGuFY&amp;refer=home&quot;&gt;another 
  major financial scandal&lt;/a&gt; allegedly exposed through the discovery of an e-mail 
  message from a fund principal that apparently stated that their fund was going 
  to be&amp;quot;'&lt;em&gt;toast&lt;/em&gt;.&amp;quot;&lt;/p&gt;
&lt;p&gt;The first thing I thought about this was that (&lt;em&gt;if true&lt;/em&gt;) it was a fantastically 
  stupid communication to put in an e-mail exchange. Secondly, I wondered why 
  it took so long to find this mail -- surely such high-profile financial managers 
  would have their mail exchanges monitored automatically and an exchange like 
  this should have rung every major alarm bell in the firm within seconds. Of 
  course they could have been using an external system to get around that; we 
  don't know at present. But this case once more highlights the limitations of 
  e-mail monitoring (&lt;a href=&quot;http://www.cmswatch.com/Trends/1274-EAM-focused-on-the-wrong-elements?&quot;&gt;discussed 
  here the other day&lt;/a&gt;) and e-discovery, and conversely the value of content 
  archiving.&lt;/p&gt;
&lt;p&gt;E-discovery is in many regards simply glorified &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;Enterprise 
  Search technology&lt;/a&gt;, but with the added ability to apply legal holds to data. 
  Just as Enterprise Search is limited by the quality and location of the content 
  it indexes, so too are e-discovery tools. Though in the case of e-discovery 
  the limitations are often more severe: evidence may or may not be conveniently 
  located in an e-mail message, as seems to be the case at Bear Stearns. More 
  commonly evidence has to be culled from not only e-mail stores, but also from 
  instant messaging systems, document systems, ERP systems, financial and business 
  applications, external drives, and so on. The idea that e-discovery is limited 
  to mail -- as many vendors (&lt;em&gt;and worryingly many buyers&lt;/em&gt;) seem to think 
  -- is naive in the extreme. Yet this misplaced belief is based on the reality 
  that the bulk of the data you will have to search will indeed be mail. Mail 
  represents the largest form of data in any organization, typically by an order 
  of magnitude (10x) or more. &lt;/p&gt;
&lt;p&gt;But here's the rub. Most of that e-mail mountain consists of redundant data 
  or as the technical terms goes, &amp;quot;&lt;em&gt;crap&lt;/em&gt;.&amp;quot; As we discuss at 
  length in our &lt;a href=&quot;http://www.cmswatch.com/E-mail/Report/&quot;&gt;&lt;em&gt;E-mail Archiving 
  &amp;amp; Management Report&lt;/em&gt;&lt;/a&gt;, typically 80% of mail data consists of a duplication. 
  Yet any search tool has to treat each piece of data equally, slowing the process 
  down massively and shooting discovery costs through the roof. How much more 
  sensible to use an archiving method to capture, filter, and reduce that volume 
  -- and ease the burden and cost of discovery?&lt;/p&gt;
&lt;p&gt;What did we learn today from the Bear Stearns scandal? Not much really, other 
  than mail (&lt;em&gt;and messages&lt;/em&gt;) continue to the be the key &amp;quot;gotcha&amp;quot; 
  elements of the data mountain, and that we need to monitor and manage them ever 
  more closely. Though the monitoring elements are far from mature, &lt;a href=&quot;http://www.cmswatch.com/E-mail/Report/&quot;&gt;EAM&lt;/a&gt; 
  tools today archive and filter very efficiently indeed. The need to take mail 
  and mail content seriously is now an imperative, and building a strategy, agreeing 
  methods and policies, and selecting the right tools -- however complex -- is 
  a must.</description>
         <link>http://www.cmswatch.com/Trends/1279-The-value-of-archiving-and-the-limitations-of-e-discovery?source=RSS</link>
         <category>Enterprise Search</category>
         <author>aps@cmswatch.com(Alan Pelz-Sharpe)</author>
         <pubDate>Thu, 19 Jun 2008 13:49:00 -0400</pubDate>
      </item>
      <item>
         <title>FAST clarification</title>
         <description>Apparently, when I &lt;a href=&quot;http://www.cmswatch.com/Trends/1257&quot;&gt;previously wrote&lt;/a&gt; about Carrie's &lt;a href=&quot;http://faculty.washington.edu/kgb/horror/handfromgrave.jpg&quot;&gt;hand reaching out from the grave&lt;/a&gt;, the metaphor was too subtle. So let me make it absolutely clear: I'm not hoping the FAST turmoil will &amp;quot;&lt;a href=&quot;http://www.ki4u.com/guide1.jpg&quot;&gt;just go away&lt;/a&gt;&amp;quot; so I won't be bothered by it anymore. I was pointing out the fact that the consequences could be further reaching than it being &amp;quot;just another investor soap&amp;quot;, and the acquisition of &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Fast%20Search%20&amp;%20Transfer&quot;&gt;FAST&lt;/a&gt; by &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Microsoft&quot;&gt;Microsoft&lt;/a&gt; won't be enough to bury it.&lt;/p&gt;
&lt;p&gt;Now that I've ruined the joke by explaining it anyway, let me add that this is no laughing matter. Of course, while it is much more fun to &lt;a href=&quot;http://larsumlaut.wordpress.com/&quot;&gt;ridicule the situation without having to be able to back it up&lt;/a&gt;, I'll have to stick to the known facts: FAST is under investigation by the police. By stating just that, I estimate the readers of this blog to understand this as a serious warning.&lt;/p&gt;
&lt;p&gt;When I say I'd rather discuss the technology, again, I imply rather then explain (to do a good job of that &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;takes quite a few pages&lt;/a&gt;). So let me explicitly state some of the questions arising.&lt;/p&gt;
&lt;p&gt;Now that FAST is turning out to be more of a &lt;a href=&quot;http://en.wikipedia.org/wiki/1948_Tucker_Sedan&quot;&gt;Tucker Torpedo&lt;/a&gt; than a &lt;a href=&quot;http://en.wikipedia.org/wiki/Cadillac&quot;&gt;Cadillac&lt;/a&gt; (and to quote Wikipedia: &amp;quot;[Tucker's] Accessories Program raised funds by selling accessories before the car was even in production&amp;quot;): what is it Microsoft actually bought?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt; Customers (who were, it now seems, fraudulently overvalued, and many of which are on the Linux platform), whom Microsoft probably won't support in the long run?&lt;/li&gt;
&lt;li&gt; Street-cred in enterprise search, which is now seriously marred by the consistently negative reports about FAST, compounded by suspicions that MS may have once again been &lt;a href=&quot;http://www.portfolio.com/news-markets/top-5/2008/05/19/Microsofts-Deal-Plans&quot;&gt;blinded by competition with Google&lt;/a&gt;?&lt;/li&gt;
&lt;li&gt; A vision on enterprise search, in the form of a &lt;a href=&quot;http://www.microsoft.com/presspass/press/2008/apr08/04-25LervikPR.mspx&quot;&gt;vice president of Enterprise Search&lt;/a&gt; possibly implicated in a police investigation?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;These have been widely reported, and there doesn't seem to be a whole lot more of value that came for Microsoft's $1.2 billion than technology. Which is what prompted my widely misunderstood remark that I'd rather discuss just that piece of the puzzle. Surely, it would have been possible to acquire the technology at a friendlier price point, say &lt;a href=&quot;http://www.cmswatch.com/Trends/878&quot;&gt;$23 million&lt;/a&gt; (oh wait! they got that as a free bonus), but it raises enough questions on its own. &lt;a href=&quot;http://www.cmswatch.com/Trends/1185&quot;&gt;Which parts&lt;/a&gt; of ESP aren't third party, bought, or open source? &lt;a href=&quot;http://www.cmswatch.com/Trends/1219&quot;&gt;How would these be integrated&lt;/a&gt;, or will a separate product line emerge? What are Microsoft's long-term views on enterprise search?&lt;/p&gt;
&lt;p&gt;I've criticized Microsoft's unclear strategy and roadmap for &lt;a href=&quot;http://www.cmswatch.com/SharePoint/Report/&quot;&gt;MOSS&lt;/a&gt; in the past, and once again, Redmond seems to be covered in fog. Since search solutions are written off in years, not months, knowing what future development holds for the software is very relevant to existing and prospective customers. I'd think twice before investing in an implementation until the clouds subside.&lt;/p&gt;
&lt;p&gt;On a side note, kudos to &lt;a href=&quot;http://marklogic.blogspot.com/2008/06/blind-eyes-industry-analysts-and.html&quot;&gt;Dave Kellogg&lt;/a&gt; for being the first to actually respond to my previous post under his own name. I received a fair bit of anonymous mail on this subject, and it seems paradoxical to be criticized for not speaking out loud enough by those who choose to remain incognito themselves. Though this post is not about Dave, as much as his post is not about me, I would like to point out that my name contains three A's (not two), and I'm not a 20-something English major. Yes, CMS Watch does have &lt;a href=&quot;http://www.cmswatch.com/vendormap/&quot;&gt;maps&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/Feature/175-Search-2008&quot;&gt;charts&lt;/a&gt;, and no, we don't dismiss business problems -- we just prefer not to place too much emphasis on the intricate details of &lt;a href=&quot;http://marklogic.blogspot.com/search?q=GAAP&quot;&gt;GAAP accounting procedures&lt;/a&gt; on this blog...;-)&lt;/p&gt;</description>
         <link>http://www.cmswatch.com/Trends/1262-FAST-clarification?source=RSS</link>
         <category>Enterprise Search</category>
         <author>bloem@radagio.com(Adriaan Bloem)</author>
         <pubDate>Tue,  3 Jun 2008 09:43:00 -0400</pubDate>
      </item>
      <item>
         <title>CMS Watch Competition Winner</title>
         <description>You may remember a while back we launched our &lt;a href=&quot;http://www.cmswatch.com/Trends/1201-Readers'-challenge---name-our-new-chart!&quot;&gt;little competition&lt;/a&gt; to come up with a new name for our vendor positioning chart. We had some great (&lt;em&gt;and varied&lt;/em&gt;) responses from all over the world. And it took quite an internal debate to decide on the eventual winner, but decide we did. And the winner is...&lt;a href=&quot;http://wordofpie.wordpress.com/&quot;&gt;Laurence Hart&lt;/a&gt; who offered us the name &amp;quot;Cross Check.&amp;quot; Laurence, a bottle of champagne is yours. &lt;/p&gt;
&lt;p&gt;Over the next month we will continue working with our designer to revamp the chart, and of course to rename it -- so look out for the Cross Check in all our report updates this year. </description>
         <link>http://www.cmswatch.com/Trends/1260-CMS-Watch-Competition-Winner?source=RSS</link>
         <category>Enterprise Portals</category>
         <author>aps@cmswatch.com(Alan Pelz-Sharpe)</author>
         <pubDate>Mon,  2 Jun 2008 09:45:00 -0400</pubDate>
      </item>
      <item>
         <title>FAST and the Furious</title>
         <description>I vividly remember being terrified  as a child by the movie &lt;a href=&quot;http://www.imdb.com/title/tt0074285/&quot;&gt;Carrie&lt;/a&gt;.  The closing scene -- where Carrie's hand &lt;a href=&quot;http://faculty.washington.edu/kgb/horror/handfromgrave.jpg&quot;&gt;reaches out from the grave&lt;/a&gt; -- left a lasting impression.&lt;/p&gt;
&lt;p&gt;Now I'm a technology analyst, not a financial analyst, and certainly no Stephen 
  King. But sometimes it's hard to ignore what goes on in our industry beyond the realm of ones and zeros, and when &lt;a href=&quot;http://www.wired.com/techbiz/it/news/2008/05/portfolio_0529&quot;&gt;Wired&lt;/a&gt; 
  reports it (or, if you prefer to read about it in Norwegian, &lt;a href=&quot;http://e24.no/selskap/FAST/article2442276.ece&quot;&gt;Aftenposten&lt;/a&gt;), 
  I should at least casually mention it. The central character here, &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Fast%20Search%20&amp;%20Transfer&quot;&gt;FAST 
  Search &amp;amp; Transfer&lt;/a&gt;, is not exactly dead -- just acquired, by Microsoft.&lt;/p&gt;
&lt;p&gt;After &lt;a href=&quot;http://www.cmswatch.com/Trends/995&quot;&gt;enduring many difficulties last year&lt;/a&gt;, 
  FAST's problems turned out to be more than just &lt;a href=&quot;http://marklogic.blogspot.com/2007/08/fast-search-train-wreck-whos.html&quot;&gt;accounting 
  headaches&lt;/a&gt;. So much so, in fact, that the Kredittilsynet (the Norwegian financial 
  authority) found the irregularities important enough to refer the case to &amp;Oslash;kokrim.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.okokrim.no/&quot;&gt;&amp;Oslash;kokrim&lt;/a&gt; describes itself as &amp;quot;The 
  Norwegian National Authority for Investigation and Prosecution of Economic and 
  Environmental Crime.&amp;quot; A bit of a mouthful which I can abbreviate to &amp;quot;the 
  police.&amp;quot; In other words, it is now a criminal investigation.&lt;/p&gt;
&lt;p&gt;Of course, it would be thrilling to continue this story by recounting past rumors about FAST, and what &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Microsoft&quot;&gt;Microsoft&lt;/a&gt; did or did not know about it before they started talking about an &lt;a href=&quot;http://www.cmswatch.com/Trends/1112&quot;&gt;acquisition&lt;/a&gt;. Followed by an interpretation of the meaningful silences in the conference call where the two companies jointly announced the deal. And to round it off, a discourse on how this will affect the future of enterprise search for Microsoft.&lt;/p&gt;
&lt;p&gt;But in fact, I hope we've pretty much heard the last of it and can return to simply discussing the merits and demerits of the technology. So even though I have this lingering image of the last scene of &amp;quot;Carrie&amp;quot; in the back of my mind, after this short break, we'll return to our &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;regularly scheduled programming&lt;/a&gt;.</description>
         <link>http://www.cmswatch.com/Trends/1257-FAST-and-the-Furious?source=RSS</link>
         <category>Enterprise Search</category>
         <author>bloem@radagio.com(Adriaan Bloem)</author>
         <pubDate>Fri, 30 May 2008 01:16:00 -0400</pubDate>
      </item>
      <item>
         <title>Dining at the intersection of Search and Retention</title>
         <description>Lawyers were well represented (you might say) at this year's &lt;a href=&quot;http://www.enterprisesearchsummit.com/&quot;&gt;Enterprise Search Summit&lt;/a&gt; in New York. At times, ESS felt more like an e-discovery conference with analytics and social-computing side-tracks rather than a search conference featuring a few e-discovery sessions. &lt;/p&gt;
&lt;p&gt;Based on what I saw at the Search Summit, there seems to be a renewed awareness, at ever-higher levels in the corporate responsibility chain, that in a litigious business environment &amp;quot;enterprise search&amp;quot; is not just a knowledge-management tactic or a productivity aid, but a survival imperative. You will be sued some day. (It's not a matter of &amp;quot;if,&amp;quot; but when.) During the discovery phase of the suit, you're going to provide (and also receive from the other side) bewilderingly immense amounts of data. Without good search technology, sifting through the data isn't just tedious but nightmarishly expensive.&lt;/p&gt;
&lt;p&gt;I didn't get a chance to attend any e-discovery sessions at the Search Summit. It didn't matter. At lunch, I happened to sit down next to litigation technology consultant (and ESS presenter) &lt;a href=&quot;http://www.linkedin.com/pub/3/9A9/224&quot;&gt;Jeff Flax&lt;/a&gt;. We had an illuminating chat about search and discovery in the context of records retention. &lt;/p&gt;
&lt;p&gt;Flax noted that many companies that have records retention policies aren't following them. He sees a &amp;quot;pack rat&amp;quot; syndrome: a tendency to let expired records remain in the morgue past the &amp;quot;save-till&amp;quot; date. The problem with this is that files that have been declared obsolete or marked for disposition, but have not yet been physically destroyed, are still subject to subpoena. &amp;quot;A good lawyer will ask for expired documents during discovery,&amp;quot; Flax notes.&lt;/p&gt;
&lt;p&gt;Lawyers are also demanding data in its &amp;quot;native state&amp;quot;: Not text dumps or PDFs or other derivative forms of the data, but the data as it actually exists. &amp;quot;If I'm a lawyer and I'm requesting someone's e-mails on a certain subject,&amp;quot; says Flax, &amp;quot;I don't want the e-mails as text files, I want the original e-mail archive in binary form so I can pick apart the bits and get at all the header and footer and other information in context.&amp;quot;&lt;/p&gt;
&lt;p&gt;Sometimes physical media must be handed over in discovery so that deleted files can be detected and recovered. &amp;quot;I've seen cases where browser search queries from many years back, supposedly no longer on disk, have been recovered forensically,&amp;quot; Flax told me. &amp;quot;And then certain keyword clumps are detected, and those query patterns can become admissible in court.&amp;quot;&lt;/p&gt;
&lt;p&gt;It turns out that the data in a search index (the index built by a search engine) can often be used to reconstruct a document even after the document itself has been irretrievably lost. Takeaway: A document can't be considered fully destroyed until you've destroyed its search-index data as well. (I wonder how many retention policies take this into account? Doubtless very few.) &lt;/p&gt;
&lt;p&gt;If you're concerned about e-mail retention (and if you're not, you should be), you might want to look into our latest offering: &lt;a href=&quot;http://www.cmswatch.com/E-mail/Report/&quot;&gt;&lt;em&gt;The E-mail 
  Archiving &amp;amp; Management Report 2008.&lt;/em&gt;&lt;/a&gt;You'll find that the report divides vendors (roughly) along three lines: policy-centric, archive-centric, and SaaS-based. (You can &lt;a href=&quot;http://www.cmswatch.com/Reports/Try/&quot;&gt;see a free sample here&lt;/a&gt;.)&lt;/p&gt;&lt;p&gt;My advice? Never pass up a chance to have lunch with a litigation technology expert. You'll be inundated with food for thought. &lt;/p&gt;</description>
         <link>http://www.cmswatch.com/Trends/1251-Dining-at-the-intersection-of-Search-and-Retention?source=RSS</link>
         <category>Records Management</category>
         <author>kthomas@cmswatch.com(Kas Thomas)</author>
         <pubDate>Thu, 22 May 2008 17:41:00 -0400</pubDate>
      </item>
      <item>
         <title>Enterprise search: free as in free beer?</title>
         <description>Searching information -- really, how hard can it be? So, why wouldn't you go 
  out and get a search engine that's for free? Well, to stick to the analogy of 
  &amp;quot;free beer,&amp;quot; you might wake up in the morning with a headache, only 
  to find your wallet gone.&lt;/p&gt;
&lt;p&gt;Of course, I'm paraphrasing the &lt;a href=&quot;http://www.gnu.org/philosophy/free-sw.html&quot;&gt;definition 
  of &amp;quot;free software&amp;quot;&lt;/a&gt;. &lt;a href=&quot;http://en.wikipedia.org/wiki/Richard_stallman&quot;&gt;Richard 
  Stallman&lt;/a&gt;'s example is used to point out the &lt;a href=&quot;http://en.wikipedia.org/wiki/Gratis_versus_Libre&quot;&gt;ambiguity 
  of the term &amp;quot;free&amp;quot;&lt;/a&gt; in the English language. With free software, 
  &amp;quot;you should think of free as in free speech, not as in free beer.&amp;quot; 
  Nevertheless, you should be warned: both open source beer (&lt;a href=&quot;http://freebeer.org/&quot;&gt;now 
  in version 3.3&lt;/a&gt;) and free commercial beer have the potential for leaving 
  you with a bit of a hangover.&lt;/p&gt;
&lt;p&gt;If you really think enterprise search is a simple commodity -- and I will only 
  comment on that with the obligatory statement that readers of our &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;&lt;em&gt;Enterprise 
  Search Report&lt;/em&gt;&lt;/a&gt; will probably know better than that -- getting a free 
  product would be ideal to get your feet wet (albeit somewhat &lt;a href=&quot;http://www.funnyphotos.net.au/images/beer-spillage-of-the-back-of-a-truck-having-some-t1.jpg&quot;&gt;sticky&lt;/a&gt;). 
  I get invited to &lt;a href=&quot;http://en.wikipedia.org/wiki/BYOB&quot;&gt;BYOB&lt;/a&gt; enterprise 
  search parties a lot, and usually come up with Apache Lucene, IBM Omnifind Yahoo! 
  Edition, and Microsoft Search Server 2008 Express. Let's get a closer taste 
  of each.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://lucene.apache.org/&quot;&gt;Apache Lucene&lt;/a&gt;. Lucene is open source, 
  which you are free to use. The problem is, it's not a complete enterprise search 
  product -- it's a &amp;quot;text search engine API.&amp;quot; What you get is a Java 
  JAR with the core functionality of a search engine. In typical hardcore Java 
  developer understatement this is described as &amp;quot;you write the easy stuff, 
  the UI and the process of selecting and parsing your data files to pump them 
  into the search engine, yourself.&amp;quot; To developers that doesn't sound too 
  difficult -- it's a library they'd be able to use to create search functionality 
  for many applications. As they embark on that journey, however, many will find 
  out they'll have to become experts on enterprise search to get their implementation 
  to perform basic tasks any &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Google/&quot;&gt;Google&lt;/a&gt; 
  user has come to expect. Index Word documents? You'll have to convert those 
  to text first. Remove stop words or perform spell checking? You'll have to get 
  some more jars to fit that in. And that familiar &lt;a href=&quot;http://www.google.com/search?q=google+ui&quot;&gt;user 
  interface&lt;/a&gt; isn't so easy to replicate, either.&lt;/p&gt;
&lt;p&gt;Of course, there's a couple of more &amp;quot;pre-packaged,&amp;quot; Lucene-based 
  engines (such as &lt;a href=&quot;http://lucene.apache.org/nutch/&quot;&gt;Nutch&lt;/a&gt; and &lt;a href=&quot;http://lucene.apache.org/solr/&quot;&gt;Solr&lt;/a&gt;), 
  but they'll only take you so far on that long and winding road. There's some 
  excellent examples of what you can achieve with Lucene, but many more of how 
  hard it can be to get there.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.cmswatch.com/Trends/802-IBM-and-Yahoo-to-Offer-Free-Enterprise-Search-Engine&quot;&gt;IBM Omnifind Yahoo! Edition&lt;/a&gt; (or OY!E). The Google appliances have the Google brand behind them, which must have got the IBM people thinking the Yahoo! brand would be excellent marketing for their free-to-use search engine. In fact, it's neither IBM nor Yahoo's technology, but Lucene wrapped in other open source software. A few commercial bits thrown in create a product that's easy to install and run. It will actually do many of the things Lucene will make you work hard to accomplish: it comes with support for several languages and quite a few source content filters. For users, it looks like a regular web search engine; for admins, there's a nicely designed and intelligible interface. In short, it does most of the things a Google Mini appliance will do -- but for free.&lt;/p&gt;
&lt;p&gt;So what's the catch? Well, the license (by the way, &lt;a href=&quot;http://www.google.com/search?q=%22IBM+Omnifind+Yahoo%21+Edition%22+license&quot;&gt;what 
  license&lt;/a&gt;?) limits you to 500,000 documents and 5 collections. After that, 
  you can &amp;quot;upgrade&amp;quot; to other Omnifind products. But since the technology 
  across the Omnifind line-up is completely different, this is the same as starting 
  from scratch, and you'll pay for the privilege. I've been critical of &lt;a href=&quot;http://www.cmswatch.com/Trends/1122-Google-Search-Appliance:-small-step-in-technology,-giant-leap-in-marketing&quot;&gt;the 
  limitations&lt;/a&gt; of Google's appliances in the past, and sure, the 50,000 document 
  limit of the entry-level Google Mini is a lot less than OY!E's half a million. 
  But that comparison isn't really fair, considering the fact the Mini actually 
  comes with the hardware to run the queries on &lt;a href=&quot;http://www.googlestore.com/appliance/product.asp?catid=3&quot;&gt;for 
  a mere $2,990&lt;/a&gt;. And don't think you'll be able to run IBM's software on an 
  old abandoned test server you have available -- OY!E will need more power than 
  the single blade Google Mini or &lt;a href=&quot;http://www.cmswatch.com/Trends/389-David-to-Google's-Mini-Goliath?&quot;&gt;Thunderstone 
  Appliance&lt;/a&gt; to match the performance. Tellingly, I wasn't able to dig up an 
  example of an OY!E implementation to mention while researching the &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;&lt;em&gt;Enterprise 
  Search Report&lt;/em&gt;&lt;/a&gt; (if you know of one, let me know).&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.cmswatch.com/Trends/1064-Microsoft's-Free-Lunch&quot;&gt;Microsoft Search Server 2008 Express&lt;/a&gt;. Microsoft's free offering is basically the same software as the non-Express version, but then there's the seemingly innocent limitation: one server only. I wouldn't want to continue the theme of this post by saying this is akin to handing out free samples of beer to get you hooked; suffice it to say that if you start to run the Express version in a production environment, there will, no doubt, come a time when a single server won't be enough anymore. When you've come to rely on the solution, you'll suddenly have to shell out for the licenses. &lt;a href=&quot;http://www.cmswatch.com/Trends/1064-Microsoft's-Free-Lunch&quot;&gt;As I've said before&lt;/a&gt;, having a free lunch isn't necessarily a bad thing; just remember that you'll probably have to pay for the beer the lunch comes with.&lt;/p&gt;
&lt;p&gt;So, this might all start sounding like advice your mother gave you: never take 
  anything from a stranger, and certainly no free alcoholic beverages. Don't forget, 
  however, that I'm Dutch, and I've certainly developed &lt;a href=&quot;http://amstellight.com&quot;&gt;a&lt;/a&gt; 
  &lt;a href=&quot;http://www.heineken.com&quot;&gt;taste&lt;/a&gt; &lt;a href=&quot;http://www.grolsch.com/&quot;&gt;for&lt;/a&gt; 
  enterprise search. Free beer sounds too good to be true, but it could certainly 
  get your party started; just remember to drink in moderation, and never, ever, 
  drink and drive.</description>
         <link>http://www.cmswatch.com/Trends/1247-Enterprise-search:-free-as-in-free-beer?source=RSS</link>
         <category>Enterprise Search</category>
         <author>bloem@radagio.com(Adriaan Bloem)</author>
         <pubDate>Fri, 16 May 2008 13:23:00 -0400</pubDate>
      </item>
      <item>
         <title>Vendor criticism of CMS Watch</title>
         <description>As you know at CMS Watch we write critical product evaluations to help you avoid expensive procurement and deployment mistakes. We write reports that detail both the warts and merits of big vendors like &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/Documentum%20(EMC)&quot;&gt;EMC&lt;/a&gt;, &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/Oracle&quot;&gt;Oracle&lt;/a&gt;, &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/Xerox&quot;&gt;Xerox&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/IBM&quot;&gt;IBM&lt;/a&gt; -- through to smaller specialist vendors like &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/Hyland&quot;&gt;Hyland&lt;/a&gt;, &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Autonomy&quot;&gt;Autonomy&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/ECM/Vendors/Nuxeo&quot;&gt;Nuxeo&lt;/a&gt;. Readers of our reports often ask me &amp;quot;&lt;em&gt;what did vendor x say when they read &lt;u&gt;that&lt;/u&gt;!&lt;/em&gt;&amp;quot;   The assumption, sometimes correct, is that vendors freak out on reading such criticism. &lt;/p&gt;
&lt;p&gt;In an industry whereby most of the &amp;quot;&lt;em&gt;independent analysts&lt;/em&gt;&amp;quot; are heavily dependent on revenues from the very firms they claim to be &amp;quot;&lt;em&gt;independent&lt;/em&gt;&amp;quot; of, it's unusual to see truly critical research get published. So it becomes a surprise to both buyers and sellers when they read such criticism. In our reports we widely distribute the compliments and brickbats -- if something is truly terrible we will tell you.&lt;/p&gt;
&lt;p&gt;But most of the time it is not a case of bad technology versus good technology. Rather it is a case of good fit versus bad fit: a product that could become an outstanding performer in a larger legal firm may make a terrible fit in a mid-sized manufacturing and ERP-centric environment. Hence we urge you the  reader to study all the alternatives and balance them out, rather than look at one preferred vendor in isolation.&lt;/p&gt;
&lt;p&gt;Speaking of isolation, the marketing groups of some vendors seem to operate in in a kind of vacuum. I guess it's part of the job for them to drink their own Kool Aid, but some of them seem to think it's part of their job to attack and stop &lt;em&gt;any&lt;/em&gt; criticism of their product or company. At CMS Watch we're often on the receiving end of that wrath; that stinks sometimes, but so be it. Just as it is the vendor's job to wax lyrical about the joys of their product, so too is it ours to unearth the reality. If you want to get an insight into this particular dynamic, whether you're a curious end user or a vendor AR (Analyst Relations) person, check out &lt;a href=&quot;http://www.cmswatch.com/Feature/178-Analyst-Relations&quot;&gt;the article&lt;/a&gt; I published today. </description>
         <link>http://www.cmswatch.com/Trends/1234-Vendor-criticism-of-CMS-Watch?source=RSS</link>
         <category>Enterprise Portals</category>
         <author>aps@cmswatch.com(Alan Pelz-Sharpe)</author>
         <pubDate>Fri, 16 May 2008 10:43:00 -0400</pubDate>
      </item>
      <item>
         <title>SEO: furthering the case for better content hygiene</title>
         <description>The worlds of SEO and enterprise search are not as far apart as you might think.  Let me explain.  Along &lt;a href=&quot;http://www.cmswatch.com/Trends/1225-Content-Management---UK-vs.-US&quot;&gt;with my colleague Alan&lt;/a&gt;, I had the pleasure of attending the upbeat &lt;a href=&quot;http://www.internetworld.co.uk/visiting-the-show.html&quot;&gt;Internet World UK&lt;/a&gt; conference earlier this month in London. What I enjoy most about attending such large, diverse events is going to sessions about technologies related to, but not directly about, the technologies I cover -- and so I found myself attending numerous sessions on SEO, or Search Engine Optimization. 
&lt;/p&gt;
&lt;p&gt; Though there was an inquisitive crowd present for my talk, the topic I spoke 
  about -- &lt;a href=&quot;http://www.cmswatch.com/Search/&quot;&gt;enterprise search&lt;/a&gt; -- 
  is much lower on the totem pole for e-marketers (the principal attendees at 
  this event) than SEO. A few showed up, in fact, wanting to better understand 
  the nuances of website search vs. enterprise search vs. SEO. Are there universal 
  things we can do to simultaneously improve all things, I was asked? Indeed. 
  Despite the differences in searching the web, searching within your enterprise, 
  and making your own website more findable by search engines, many of the same 
  best practices apply. &lt;/p&gt;
&lt;p&gt; As readers of &lt;i&gt;&lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot;&gt;The Enterprise 
  Search Report&lt;/a&gt;&lt;/I&gt; know, good content hygiene is essential to good search 
  results. Consistent content structure, metadata, and simple things like clear 
  and meaningful document titles all help search engines work better -- be they 
  the public (e.g., Google) website, or enterprise kind. But in most cases, content 
  managers don't know where to start with their clean-up: it's an often overwhelming 
  task. As such, content clean-up continues to get cast aside (&amp;quot;too much 
  work&amp;quot;), and search technology vendors make it seem that much less important 
  when they promote technology as the panacea to content woes. Don't believe it 
  for a second. &lt;/p&gt;
&lt;p&gt;
A few more tidbits from the event:
&lt;/p&gt;
&lt;p&gt;
&lt;ol&gt;
  &lt;li&gt;These days, 80% of e-commerce transactions start with a web search, says 
    &lt;a href=&quot;http://dancohen.info/&quot;&gt;Dan Cohen&lt;/a&gt;, Head of SEO for MSN UK. 
  &lt;li&gt;The #1 hindrance to SEO is poor content and code (which also highly contributes 
    to poor web and intranet search results) 
  &lt;li&gt;Creating clear, topic-related content &amp;quot;hubs&amp;quot; on your web site is the next 
    most important thing for SEO 
  &lt;li&gt;As readers of our &lt;a href=&quot;http://www.cmswatch.com/Analytics/Report/&quot;&gt;&lt;I&gt;Web 
    Analytics Report&lt;/I&gt;&lt;/a&gt; know, &lt;a href=&quot;http://www.cmswatch.com/Analytics/&quot;&gt;analytics&lt;/a&gt; 
    should be a core part of your e-marketing pie, to get a clear picture of what 
    your customers are looking for and where they're getting stuck. 
&lt;/ol&gt;
&lt;/p&gt;
&lt;p&gt;Bottom line: cleaning up your content can improve your website search results &lt;em&gt;and&lt;/em&gt; your Google ranking.</description>
         <link>http://www.cmswatch.com/Trends/1236-SEO:-furthering-the-case-for-better-content-hygiene?source=RSS</link>
         <category>Enterprise Search</category>
         <author>tregli@cmswatch.com(Theresa Regli)</author>
         <pubDate>Fri,  9 May 2008 16:12:00 -0400</pubDate>
      </item>
      <item>
         <title>Budget time: How much should I set aside for software licenses?</title>
         <description>When budget-building time comes up, many technology customers face the interesting 
  question of how much money to put aside for new software licenses. Even without 
  looking at specific vendors, you might have to tell your manager some ballpark 
  figure for expected license costs.&lt;/p&gt;
&lt;p&gt;As an analyst I'm frequently asked about license prices. A recent interesting 
  discussion among peers challenged my views and provided helpful feedback that 
  might assist you in arriving at the right numbers in today's marketplace:&lt;/p&gt;
&lt;ul&gt;
  &lt;p&gt;List prices aside, buyers can presently obtain significant discounts 
    on enterprise portals and on Web CMS tools. This may be caused by the increased 
    SharePoint infiltration. A commentary in February on &lt;a href=&quot;http://www.cmswatch.com/Trends/1159-Big-software-discounts-ahead&quot;&gt;big 
    software discounts&lt;/a&gt; and &lt;a href=&quot;http://www.cmswatch.com/Trends/1222-Mortgage-crisis:-The-least-of-Vignette's-worries&quot;&gt;recent 
    numbers from Vignette&lt;/a&gt; seems to confirm this trend. &lt;a href=&quot;http://www.cmswatch.com/Trends/1160-More-on-SharePoint-licensing-costs&quot;&gt;SharePoint 
    licensing for websites&lt;/a&gt; is the exception that proves the rule. In general 
    if the Web CMS comes from an ECM vendor, it will be more expensive -- potentially 
    &lt;em&gt;way&lt;/em&gt; more expensive&lt;/p&gt;
  &lt;p&gt;With enterprise search at the high end, the reverse is true. The marketplace 
    is seeing strong demand at the moment. Many enterprise-tier search offerings 
    come only as a bundled offering, so there is little list pricing to benchmark 
    against. Deals quickly run into the millions of Euros in large, global, and 
    complex enterprises.&lt;/p&gt;
  &lt;p&gt;Among the huge array of mid-market vendors across different content technologies 
    -- many them local/regional in footprint -- you can typically find solutions 
    that meet the needs of even organization-wide deployments in most enterprises, 
    but at a factor of five (or more) cheaper than the higher-end solutions&lt;/p&gt;
  &lt;p&gt;If you are willing to serve as a reference client or appear on the customer 
    list -- or better within a press release -- this is very valuable for the 
    vendor and should help you to get significant discounts. (And of course as 
    you look to evaluate vendors and they provide such testimonials, you should 
    also understand how this game is played.)&lt;/p&gt;
&lt;/ul&gt;
&lt;p&gt;Remember that enterprise deals entail complex negotiation and pricing models 
  that ultimately boil down to what the salesperson thinks you can afford. Perhaps 
  needless to say, but still: Implementation costs are higher than licensing costs 
  and open source projects are not necessarily cheaper just because you might 
  save licensing costs.&lt;/p&gt;
&lt;p&gt;Thanks to &lt;a href=&quot;http://www.cmswatch.com/Analyst/10-Pelz-Sharpe&quot;&gt;Alan Pelz-Sharpe&lt;/a&gt;, &lt;a href=&quot;http://www.cmswatch.com/Analyst/17-Durga&quot;&gt;Apoorv Durga&lt;/a&gt;, &lt;a href=&quot;http://www.steptwo.com.au/about/staff/jamesr/index.html&quot;&gt;James Robertson&lt;/a&gt; and &lt;a href=&quot;http://www.intranetfocus.co.uk/&quot;&gt;Martin White&lt;/a&gt;.</description>
         <link>http://www.cmswatch.com/Trends/1227-Budget-time:-How-much-should-I-set-aside-for-software-licenses?source=RSS</link>
         <category>Enterprise Portals</category>
         <author>info@jboye.dk(Janus Boye)</author>
         <pubDate>Sat,  3 May 2008 17:14:00 -0400</pubDate>
      </item>
      <item>
         <title>Thoughts on SharePoint and FAST Search</title>
         <description>In our &lt;a title=&quot;CMS Watch SharePoint Report 2008&quot; href=&quot;http://www.cmswatch.com/SharePoint/Report/&quot; target=&quot;_blank&quot;&gt;&lt;em&gt;SharePoint 
  Report 2008&lt;/em&gt;&lt;/a&gt; we discuss SharePoint's shortcomings and strengths in the 
  search space. While SharePoint 2007's search capabilities have been improved 
  over the 2003 product, it's still not &amp;quot;enterprise class&amp;quot; for a variety 
  of different reasons.&lt;/p&gt;
&lt;p&gt;Clearly Microsoft saw this same shortcoming (both in SharePoint and it's overall 
  search offerings) and &lt;a href=&quot;http://www.microsoft.com/presspass/press/2008/jan08/01-08FastSearchPR.mspx&quot; target=&quot;_blank&quot;&gt;announced&lt;/a&gt; 
  that they were going to acquire enterprise search vendor &lt;a href=&quot;http://www.fastsearch.com/&quot; target=&quot;_blank&quot;&gt;FAST 
  Search and Transfer&lt;/a&gt; (more information on FAST can be found in our &lt;a href=&quot;http://www.cmswatch.com/Search/Report/&quot; target=&quot;_blank&quot;&gt;&lt;em&gt;Enterprise 
  Search Report 2008&lt;/em&gt;&lt;/a&gt;). &lt;/p&gt;
&lt;p&gt;For SharePoint users, this brings up a few opportunities and issues. In a &lt;a href=&quot;http://www.cmswatch.com/Trends/1176-SharePoint-Conference:-Something-Old,-Something-New,-Something-Borrowed-and-Something-Blue&quot; target=&quot;_blank&quot;&gt;previous 
  blog post&lt;/a&gt; about the SharePoint conference, I highlighted the presentation 
  that FAST employees gave. This presentation showed nifty new &lt;a href=&quot;http://www.microsoft.com/silverlight/&quot; target=&quot;_blank&quot;&gt;Silverlight&lt;/a&gt;-enabled 
  search Web Parts. These Web Parts demonstrated several capabilities that FAST 
  brings to the SharePoint world, like: content spotlighting, multimedia search, 
  and taxonomy management. &lt;/p&gt;
&lt;p&gt;The last capability is one that I believe would be particularly interesting 
  for SharePoint users, since taxonomy management represents a challenging area 
  for most SharePoint implementations -- SharePoint taxonomies are very rigidly 
  based on physical structure of the SharePoint sites and leave little flexibility 
  for a more logical taxonomic structure (Redmond folks would argue that the content 
  query Web Part might help, but it's not a solution). That said, what Microsoft 
  hasn't really provided is good guidance on what users can expect to see and 
  any migration path between SharePoint search and FAST. Unfortunately, that really 
  hasn't changed: Microsoft hasn't released any roadmap for FAST's integration, 
  though you could argue it's still early.&lt;/p&gt;
&lt;p&gt;That aside, what could the FAST acquisition mean to SharePoint customers? Here 
  are a few of my thoughts:&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;The FAST acquisition was &lt;a href=&quot;http://www.microsoft.com/presspass/press/2008/apr08/04-25LervikPR.mspx&quot; target=&quot;_blank&quot;&gt;officially 
    completed today&lt;/a&gt;. This is a good thing, since Microsoft can now get down 
    to the business of integrating the company's technology with the rest of Microsoft's 
    products. However, it's not clear what, if any, impact this acquisition will 
    have in the very short term. Judging by the Groove acquisition, which preceded 
    the SharePoint 2007 release by some months, an acquisition after the release 
    will not yield any updates in the core product until SharePoint vNext (probably 
    around 2010 or so) and beyond. However, I'd be willing to bet that some elements 
    of the demo given at the SharePoint conference make their way to sites like 
    &lt;a href=&quot;http://www.codeplex.com&quot; target=&quot;_blank&quot;&gt;Codeplex&lt;/a&gt; or as free 
    downloads on &lt;a href=&quot;http://www.microsoft.com&quot; target=&quot;_blank&quot;&gt;Microsoft's 
    site&lt;/a&gt;.&lt;/li&gt;
  &lt;li&gt;The former CEO of FAST will assume the role of VP of Enterprise Search. 
    His responsibility will include all search products: &lt;a href=&quot;http://www.microsoft.com/sharepoint/capabilities/es/overview.mspx&quot; target=&quot;_blank&quot;&gt;SharePoint 
    search&lt;/a&gt;, &lt;a href=&quot;http://www.microsoft.com/enterprisesearch/serverproducts/searchserverexpress/default.aspx&quot; target=&quot;_blank&quot;&gt;Search 
    Server Express&lt;/a&gt; and FAST ESP. So I guess the &amp;quot;Enterprise Search&amp;quot; 
    moniker might need to be removed or rewritten on the SharePoint search page; 
    FAST is, by far, the new &amp;quot;enterprise&amp;quot; search product at Microsoft. 
    What's interesting here is that Microsoft has historically brought Office-related 
    technologies under one roof; just look at what happened to SharePoint specifically 
    -- that product was once its own product group. As they integrate FAST, it 
    would appear that this announcement suggests Microsoft might break out search 
    into its own dedicated team and make SharePoint a &amp;quot;customer.&amp;quot; This 
    opens up the possibility of decoupling SharePoint from any particular search 
    technology -- perhaps a pipe dream, but we can always hope.&lt;/li&gt;
  &lt;li&gt;In a blog entry on the &lt;a href=&quot;http://blogs.msdn.com/enterprisesearch/&quot; target=&quot;_blank&quot;&gt;Enterprise 
    Search blog&lt;/a&gt;, Microsoft stated that the FAST offering will &lt;a href=&quot;http://blogs.msdn.com/enterprisesearch/archive/2008/04/25/fast-tender-offer-complete.aspx&quot; target=&quot;_blank&quot;&gt;continue&lt;/a&gt; 
    its Linux and Unix support. The blog entry was quick to reinforce the message 
    that Microsoft does not want to support or wish to invest in Linux or Unix 
    solutions. While they would like to &quot;delight a core part of FAST's customer 
    base,&quot; they are openly hoping those customers will convert to Windows and 
    .NET. This does call into question whether this Linux/Unix support will be 
    long for the world. In the short term, however, Microsoft can boast a better 
    product from which to launch a play to be a real contender in the enterprise 
    search space; see our related blog entries and article on FAST &lt;a href=&quot;http://www.cmswatch.com/Search/Vendors/Fast%20Search%20&amp;amp;%20Transfer&quot; target=&quot;_blank&quot;&gt;here&lt;/a&gt;. 
    In the long term, Microsoft will have to come to grips with the fact that 
    enterprises will continue to leverage non-Windows technology and if Microsoft 
    wants to benefit from that revenue, they should consider continuing FAST's 
    support for those technologies (their recent &lt;a href=&quot;http://biz.yahoo.com/ap/080425/tech_roundup.html?.v=1&quot; target=&quot;_blank&quot;&gt;earnings 
    announcement&lt;/a&gt;, and the subdued guidance for the year, may reinforce that 
    message).&lt;/li&gt;
  &lt;li&gt;Microsoft won't support SharePoint on Linux &amp;lt;gasp!&amp;gt;. This is probably 
    not a surprise to anyone who is even vaguely familiar with Microsoft. However, 
    the fact that FAST (currently) is supported on Linux may introduce greater 
    content aggregation and, certainly, search capabilities within SharePoint. 
    Let's hope that Microsoft sees it that way.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In general, the FAST acquisition, for SharePoint, will likely have little impact 
  short term. Over the longer term, it's clear that the Office 14 version of SharePoint 
  will be substantially improved in the search area (depending on the SharePoint 
  product team's willingness to implement the new technology). I would personally 
  like to see some add-ons in the near term, since that would improve search within 
  organizations that may have both tools.&lt;/p&gt;
&lt;p&gt;Stay tuned for more on this topic as the integration progresses.</description>
         <link>http://www.cmswatch.com/Trends/1219-Thoughts-on-SharePoint-and-FAST-Search?source=RSS</link>
         <category>Enterprise Search</category>
         <author>shawn_shell@consejoinc.com(Shawn Shell)</author>
         <pubDate>Fri, 25 Apr 2008 18:11:00 -0400</pubDate>
      </item>

   </channel>
</rss>

