SharePoint Server 2013 Preview

August 13, 2012

Microsoft posted two documents which we believe merit any SharePoint licensee’s attention. The principal features of the latest SharePoint appear on the Microsoft SharePoint site.

Search will be particularly important because SharePoint 2013 will make it easier to incorporate social content and support mobile access. The new SharePoint will be available later this year or early in 2013. Getting a head start is important if you plan to upgrade.

The SharePoint Server 2013’s enterprise search model provides information we found quite useful. The diagram’s PDF is 560 Kb and available from the Microsoft download center. The PDF covers:

  • Search Components, including the application components and the search databases
  • Example topologies. The illustrated use case is a medium-sized search farm with 40 million items or content objects in the system
  • Scaling out. The diagram includes a proposal model for a search farm which handles 100 million item or content objects.

Of particular value are the details for the hardware required to support the 100 million item farm. A series of tables covers the scaling considerations, detail about the application servers recommended, and a table layout the hardware requirements necessary to handle upticks in the volume of content to be processed.

In the general guidance section, Microsoft points out that one additional crawl database is needed per additional 20 million items. One link database is recommend per additional 60 million items. The schematic’s detail recommends that the system include redundancy.

Bottom line, there is no mistaking the Fast-like functionality described here. Search Technologies has delivered more than 30,000 consultant-days of search implementation services to Fast and SharePoint users since 2005. We believe that this new search functionality will be widely adopted over the next few years, and we look forward to helping our customers to implement it.

Iain Fletcher, August 13, 2012

Sponsored by Augmentext

Open Source Solutions Continue to Gain Popularity

August 13, 2012

The H Open recently reported on some new developments for the open source search, discovery and analytics company Lucid Imagination in the article, “Lucid Imagination Becomes LucidWorks.”

According to the article, after continuously having customers confuse the name of the company with its flagship product, Lucid Imagination decided to go along with the customers perceptions and change its name to LucidWorks as means of avoiding further complicating branding efforts.

In addition its two product lines: LucidWorks Search and LucidWorks Big Data, both of which draw from open source products, the company has some additional plans on the horizon:

“LucidWorks has also announced that, in September, it will be setting up a community site called SearchHub.org, which will be oriented at developers. It is planned that this will include a blog from Lucene/Solr committers; of the 35 committers on the project, nine work for LucidWorks. Other planned features include video tutorials, podcasts, a community forum, up-to-date information on Lucene/Solr, and a calendar of enterprise search related hackathons and meetups.”

LucidWorks is an example of a company that has created an enterprise-grade embedded search development solution built on the power of the Apache Lucene/Solr open source search project. As technology continues to advance, companies that utilize open source technology are going to have an edge over their competition.

Jasmine Ashton, August 13, 2012

Sponsored by ArnoldIT.com, developer of Augmentext

Neofonie Technology Underpins Labdoo

August 10, 2012

Neofonie GmbH, based in Berlin, Germany, is a long-term player in search; the company has been in the market since 1998 and created the early German search engine fireball.de. Their technology is now being used at Labdoo.org, home base for the Labdoo project, a 501(c)(3) organization. The project’s About page explains its goals:

“A laptop is a door to education, providing children free access to open source education tools and electronic books through the Internet.

“In the richer countries, every year more than a hundred million laptops are replaced by new ones. This number continues to increase, yet most of the children in the poor regions of the world still lack access to education.

“The goal of Labdoo is to use grassroots, decentralized, social networking tools to efficiently bring excess laptops to the children in the developing world without wasting additional Earth resources.

“Join Labdoo and use the social network tools to bring a laptop to a child!”

A worthy cause, to be sure. Though the project won’t be officially launched until early next year, its Web site is up and running. The organization encourages visitors to use its tools to build their own “mini-missions and hubs.” Doing so, it emphasizes, will help further the development of their platform.

Neofonie began as an offshoot of the Technical University of Berlin. They make it a point to meet, and to innovate beyond, market demands. The company produces enterprise search as well as portal and vertical search products for both Web solutions and mobile apps.

Cynthia Murrell, August 10, 2012

Sponsored by ArnoldIT.com, developer of Augmentext

Will New CEOs Knock Out Search Revenue Pains?

August 8, 2012

Two search vendors have recently announced shuffling in their top management positions. The first vendor – Perfect Search. The enterprise search technology vendor replaced its President and CEO of five years last May 2012. George Watanabe, co-founder and VP of Business Development and Investor Relations filled in the empty seat.

The second vendor – MarkLogic. Also in May 2012, “Gary Bloom Joins MarkLogic as Chief Executive Officer”:

“MarkLogic Corporation, the company powering mission-critical Big Data Applications around the world, today announced that Gary Bloom… has been named president and chief executive officer.

Gary brings an exceptional background that includes more than two decades of successful leadership in enterprise software. He was the CEO and president at eMeter, which provides smart grid management software for electric, gas, and water utilities… Prior to that, Gary was a consultant of TPG, a leading global private investment firm.”

Both Watanabe and Bloom have the potential to further pave the way for success of their respective companies, given their impressive technical background in enterprise applications. But both companies have joined the bandwagon and just lately added Big Data to their focus. We’re waiting to see if the move to change executives will tip the enterprise search and Big Data scales in their favor.

Lauren Llamanzares, August XX, 2012

Sponsored by ArnoldIT.com, developer of Augmentext

Track the Output of SharePoint Fast Search Crawl Logs

August 7, 2012

Do you need to pull SharePoint Fast Search crawl logs? We do. We read with interest an item on Microsoft’s TechNet Web site. “Get SharePoint Search Crawl Logs” provides an almost ready-to-run script which will accept a search service name and display the associated crawl logs. If there is a crawl log with an error, the script flags that instance. To script can be edited so that it returns different information from the crawly logs. In order to make this tweak, the $crawlLogFilters can be edited.

SharePoint Fast usually does an excellent job of processing content. However, some documents can be malformed or an unexpected network issue can arise. As a result, certain content can be skipped or ignored. A visual inspection of crawl logs is not practical when SharePoint is processing large volumes of content.

If you want to view the crawl logs, TechNet provides a wealth of information. A good place to begin your investigation is in the TechNet Library. If you want to expOrt the SharePoint 2010 search crawl logs, you will find a useful Powershell script in Dave Mc’s Blog in the article “Export the SharePoint 2010 Search Crawl Log.” MSDN also provides information about exporting SharePoint 2010 search crawl logs. To access this information, navigate to the SharePoint Escalation Team’s blog.

Search Technologies’ team of experienced engineers can provide automation tools which eliminate the need to search for solutions to common problems. To learn more about our SharePoint and FFast Search implementation services, navigate to http://www.searchtechnologies.com/microsoft-search.html or contact us at info@searchtechnologies.com.

Iain Fletcher, August 7, 2012

Sponsored by Augmentext

Research and Development Innovation: A New Study from a Search Vendor

August 3, 2012

I received message from LinkedIn about a news item called “What Are the Keys to Innovation in R&D?” I followed the links and learned that the “study” was sponsored by Coveo, a search vendor based in Canada. You can access similar information about the study by navigating to the blog post “New Study: The Keys to Innovation for R&D Organizations – Their Own, Unused Knowledge.” (You will also want to reference the news release about the study as well. It is on the Coveo News and Events page.

Engineers need access to the drawings and those data behind the component or subsystem manufactured by their employer. Text based search systems cannot handle this type of specialized data without some additional work or the use of third party systems. A happy quack to PRLog: http://www.prlog.org/10416296-mechanical-design-drawing-services.jpg

The main of the study, as I interpret it, is marketing Coveo as a tool to facilitate knowledge management. Even though I write a monthly column for the print and online publication KMWorld, I do not have a definition of knowledge management with which I am comfortable. The years I spent at Booz, Allen & Hamilton taught me that management is darned tough to define. Management as a practice is even more difficult to do well. Managing research and development is one of the more difficult tasks a CEO must handle. Not even Google has an answer. Google is now buying companies to have a future, not inventing its future with existing staff.

The unhappy state of many search and content processing companies is evidence that those with technological expertise may not be able to generate consistent and growing revenues. Innovation in search has become a matter of jazzing up interfaces and turning up the marketing volume. The $10 billion paid for Autonomy, the top dog in the search and content processing space, triggered grousing by Hewlett Packard’s top executives. Disappointing revenues may have contributed to the departure of some high profile Autonomy Corporation executives. Not even the HP way can make traditional search technology pay off as expected, hoped, and needed. Search vendors are having a tough time growing fast enough to stay ahead of spiking technical and support costs.

When I studied for a year at the Jesuit-run Duquesne University, I encountered Dr. Frances J. Chivers. The venerable PhD was an expert in epistemology with a deep appreciation for the lively St. Augustine and the comedian Johann Gottlieb Fichte. I was indexing medieval Latin sermons. I had to take “required” courses in “knowledge.” In the mid 1960s, there were not too many computer science departments in the text indexing game, so I assume that Duquesne’s administrators believed that sticking me in the epistemology track would improve the performance of my mainframe indexing software. Well, let me tell you: Knowledge is a tough nut to crack.

Now you can appreciate my consternation when the two words are juxtaposed and used by search vendors to sell indexing. Dr. Chivers did not have a clue about what I was doing and why. I tried to avoid getting involved in discussions that referenced existentialism, hermeneutics, and related subjects. Hey, I liked the indexing thing and the grant money. To this day, I avoid talking about knowledge.

Selected Findings

Back to the study. Coveo reports:

We recently polled R&D teams about how they use and share innovation across offices and departments, and the challenges they face in doing so.  Because R&D is a primary creator and consumer of knowledge, these organizations should be a model for how to utilize and share it. However, as we’ve seen in the demand for our intelligent indexing technology, and as revealed in the study, we found that R&D teams are more apt to duplicate work, lose knowledge and operate in soloed, “tribal” environments where information isn’t shared and experts can’t be found.  This creates a huge opportunity for those who get it right—to out-innovate and out-perform their competition.

The question I raised to myself was, “How were the responses from Twitter verified as coming from qualified respondents?” And, “How many engineers with professional licenses versus individuals who like Yahoo’s former president just arbitrarily awarded themselves a particular certification were in the study?” Also, “What statistical tests were applied to the results to validate the the data met textbook-recommended margins of error?”

I may have the answers to these questions in the source documents. I have written about “number shaping” at some of the firms with which I have worked, and I have addressed the issue more directly in my opt in, personal news service Honk. (Honk, a free weekly newsletter, is a no-holds-barred look at one hot topic in search and content processing. Those with a propensity to high blood pressure should not subscribe.)

Read more

Maximizing the Value in High-Value Engineering Content

July 31, 2012

I read with interest the article “How Inforbix Extends the Value of PDM/PLM Systems.” For companies which create engineering drawings and other types of rich media, the cost and effort required to convert an idea into a digital object can be considerable. In my experience, most non-technical professionals do not understand the effort required to move from an idea into a digital file which, in turn, makes it possible to manufacture a part or an assembly. Not surprisingly, most enterprise information management systems do not handle engineering diagrams, CAD objects, and 3D data files particularly well. In fact, most findability systems ignore these content objects.

In an increasingly competitive world, ignoring high value engineering content can be an expensive mistake. The Inforbix article asserts:

At Inforbix we’ve been working on ways of giving people easy and simple access to data within their PDM/PLM system.  That way, anyone in the company, without any special skills or training can get at the data they need within their PDM/PLM system without using touching the actual system.

Professional systems used to produce a CAD object, for example, require training to use. Unlike a Web page or a mainstream office application like Microsoft Word, the interfaces and methods of performing basic tasks such as opening a project file may not be obvious.

Many non engineers do require access to specialized engineering data, information about a component, and data about suppliers. These types of information may not reside within an enterprise search system. If some of the data are present, those items may not be indexed by the project under which the items are organized.

The article continues:

Inforbix lets anyone in a company with a PDM/PLM system access data therein without touching, moving, or interacting with the actual PDM/PLM system and data. By giving anyone in a company the ability to gather and expose data within PDM/PLM systems.

If your organization wants to maintain and grow the value of its high value content, an easy to use findability system is necessary. In our search work, we know the value of the Inforbix system. You can get more information at www.inforbix.com.

Stephen E Arnold, July 31, 2012

Sponsored by HighGainBlog.com

Microsoft and Yammer: Extending SharePoint Functionality

July 31, 2012

Yammer is what an enterprise social network tool; organizations implement it to spur collaboration between users. On the Yammer homepage we found a new application which permits Microsoft SharePoint Integration. After reading the specs, we found on the Microsoft blog about “Yammer-The Next Step for Social Networking In Schools?”

According to the post, Microsoft recently purchased Yammer. The post explains Yammer’s basic functions, the dashboard mirrors Facebook’s design with hints of Twitter. The post digs into how Yammer would be used in schools, basically the same way it would for any company: staff would use to communicate between departments, share content, etc. It can also be a boon for students too:

“We know that group work is a great way to encourage students to engage with their peers, but this isn’t easy when they all use different social networks, clouds and systems. By joining Yammer, students can create secure groups via which they can communicate their ideas, ask questions and share files, as well as allowing for their competitive side to come out through ‘Leaderboards’, which show data about who has received the most likes, replies and much.”

Students can perform group work, receive studying help, share content, and even praise each other within Yammer. While it can be a tool of food for students, it can also make cheating and plagiarism easier if not monitored. Yammer should install an app that will be able to detect plagiarism.

The surge of interest in social content is growing in government agencies, commercial organizations, and educational institutions. However, indexing and making this content
findable can be a challenging task. The tools an organization uses require tight integration with
a search system. Mindbreeze provides capabilities to make social content easily findable within a SharePoint environment. A Yammer style can enhance productivity. Mindbreeze offers a range of social and collaborative features and has the engineering expertise to resolve almost any search and retrieval issue. Check out the Mindbreeze social collaboration Web page for more information.

Whitney Grace, July 31, 2012

Sponsored by Mindbreeze

Governance Listed as Top Challenge in Enterprise

July 25, 2012

A new Gartner report shows that more SharePoint users are failing in the category of governance than in any other.  SharePoint Joel gives his opinion on governance in, “Governance the #1 Challenge to Managing SharePoint.”

SharePoint Joel begins:

According to Gartner 66% of SharePoint Customers do not believe they have adequate governance. (Based on a recent Gartner survey) Governance can be defined as a set of defined roles, responsibilities, policies, and procedures that will help your company to proactively manage your information technology resources in a way that maximizes business value.

But governance does not have to be an ongoing struggle or an agonizing process.  Third party solutions like Fabasoft Mindbreeze Enterprise can work with an existing SharePoint installation to add efficiency and functionality.  Fabasoft Folio Connector works with Fabasoft Mindbreeze Enterprise to specifically address the issue of governance.

Fabasoft Folio is the standard software product for Enterprise Content Management, Collaboration, Compliance Management, agile Business Processes and Information Governance. The solution provides uniform, reliable and controlled management of digital content in the enterprise. Fabasoft Mindbreeze Enterprise links Fabasoft Folio for uniform enterprise-wide information access.

Do not continue to struggle with governance issues, allow Fabasoft Mindbreeze and their suite of smart solutions to ease your enterprise troubles.

Emily Rae Aldridge, July 25, 2012

Sponsored by Pandia.com

IDC Open Source Search Reports Announced

July 23, 2012

IDC has released the first of a series of analyses of open source search vendors. The subject of the report is LucidWords Platform. Lucid Imagination has become one of the key open source search vendors. Data for the IDC “situation overview, future outlook, and essential guidance” is a result of a painstaking process. The IDC research team interviewed principals of Lucid Imagination, conducted a technical analysis of the Lucid technology platform, and used a range of data analysis methods to pinpoint key information from open source content. In addition to detailed, jargon-free information about the Lucid Lucene/Solr approach, the report provides an unvarnished analysis of the firm’s business model.

masthead

Order the full report at tp://www.idc.com/getdoc.jsp?containerId=236086.

One of the important facts uncovered in the course of the research is the strong uptake of Lucid technology in specific market sectors. Also, Lucid, unlike some proprietary and other open source search vendors, has strong venture backing, revenue growth, and a full-time professional open source search technology team. Each of these issues is explored in the IDC report number 236086. You can get additional information about the for-fee report from IDC’s “Get Doc” online service.

The team working on this project included Sue Feldman, who specializes in research on information access technologies including, including search engines, text analytics, categorization, unified access to structured and unstructured information, Big Data, visualization, and rich media search.  Her research analyzes the trends and dynamics of the search and discovery software market and also quantifies the costs of information work to the organization. Ms. Feldman won IDC’s James Peacock Research award for her work on modeling and forecasting the search and retrieval technology markets, and an Innovation Award from IDC in 2007 for developing a new research program on the digital marketplace. She is a frequent speaker at industry events, and has won several national and international awards for her writing.  She wrote the chapter on search engines for the 1999 volume of the Encyclopedia of Library and Information Science and was the first editor of the IEEE Computer Society’s Digital Library News.  She is currently writing a book, The Answer Machine concerning the future of technology for information access.  Before coming to IDC in 2000, Ms. Feldman was President for twenty years of Datasearch, an independent technology consulting firm, where she consulted on new retrieval technologies such as natural language processing, search engines, usability of online systems, and digital libraries.

Read more

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta