Digital Reasoning Connects with TeraDact
January 4, 2012
Big data analytics specialist Digital Reasoning has been a regular topic of discussion here at Beyond Search, most recently for achieving series B funding for a big data intelligence push.
Now, we would like to share an exciting new development in the quest to solve the big data problem in the news release “Digital Reasoning and TeraDact Partner to Automatically Remove Sensitive Information from Big Data.”
According to the article, TeraDact Solutions, a software tools and data integration solutions provider, has integrated their TeraDactor Information Identification and Presentation capabilities with Synthesys Cloud, a software-as-a-service data analytics solution.
The news story states:
In conjunction with Synthesys, TeraDactor can automatically assist in appropriately classifying information not recognized by the original data provider. TeraDactor allows participants to push and pull information without waiting for the declassification process, assuring that formerly classified documents may be released without unintended leakages.
The innovative technology that TeraDact Solutions brings to Digital Reasoning’s table demonstrates the power of Synthesys as a cloud-based data analytics tool in building the next generation of Big Data analytic solutions. Kudos to the surging Digital Reasoning organization.
Jasmine Ashton, January 4, 2012
Sponsored by Pandia.com
SharePoint Troubleshooting Videos for the Enterprise Weary
January 4, 2012
The GetThePoint Blog is highlighting a series of troubleshooting videos being offered by Microsoft regarding SharePoint and its well-publicized list of common pain points.
As the write-up expresses:
Microsoft’s Technical Readiness team has been building a collection of what they call ‘Break/Fix’ videos that address specific technical issues when using Office 365 . . . The Microsoft SharePoint End-user Content team is investigating creating similar quick videos that address specific pain points for SharePoint, and would apply to both O365 online and on-premises versions.
While we understand the need for online assistance, especially in terms of quick fixes, the real issue lies in the fact that so many of these fixes are needed in the first place. SharePoint has never promised to be a complete solution, an out-of-the-box application that needs no further tweaking. However, the extent to which SharePoint has to be customized and manipulated leads us to believe that a third-party solution might be a less painful enterprise option.
A third party solution worth a second look is Fabasoft Mindbreeze and its suite of solutions. Receiving the KM World Trendsetting Product Award for the fourth consecutive year in 2011, Mindbreeze is often lauded for its ability to be customized, but also extremely efficient out-of-the-box.
“Fabasoft Mindbreeze Appliance as a pre-packaged solution (hardware and software) offers a quick and easy way to enjoy a high-end enterprise search solution out-of-the-box. The product is ready to use within in a very short timeframe. ‘We make it easy for our customers. We deliver the ready-to-run appliance and configure it together with the customer via an online meeting. Fabasoft Mindbreeze Enterprise is then ready for use.’ – Daniel Fallmann describes some of the advantages of the solution.”
Fabasoft Mindbreeze works as a standalone enterprise solution, or can be used to enhance an existing SharePoint infrastructure through the Mindbreeze Connectors offerings.
Emily Rae Aldridge, January 4, 2012
Sponsored by Pandia.com
Creative Tip to Avoid Indexing in SharePoint Fast
January 4, 2012
At his Tech and Me blog, Mikael Svenson provides a unique search tip in “How to Prevent an Item from Being Indexed with FAST for SharePoint.” Keeping an item from being indexed in FAST using the meta data or text of a file has long been considered next to impossible. Svenson, however, has found a way, and that way is through profanity. Yes, you can use the Offensive Content Filter to your advantage. The article explains:
The thing about the offensive content filter is that it will prevent documents from being indexed if they contain a certain about of bad language. If you get embarrassed by such words, then skip reading 🙂 “So now we have a stage which can drop items, the rest is to assign enough bad words to ‘ocfcontribution’ to get above the threshold it triggers on.
See the write up for a detailed description of how to implement this creative approach.
Svenson notes one important caveat: if you have any documents containing profanity that you actually want to have indexed, this solution may backfire. Avoid difficulties by tapping the deep search expertise of Search Technologies.
Iain Fletcher, January 4, 2011
Sponsored by Pandia.com
Common Pain in SharePoint
January 3, 2012
SharePoint brings just as much pain as it does joy in many circumstances. But one does not have to look too hard to realize how common certain SharePoint pain points are in the enterprise realm. The solutions to these SharePoint issues can be just as painful as the issues themselves. Consultants and third-party firms do not always live up to their promises. A SharePoint blogger gives us his take in, “SharePoint – To the Pain!”
As he points out:
One of the pains many of us have experienced is the sting of inexperienced SharePoint ‘experts.’ These people are giving bad advice in their blogs. These people are in ‘expert’ SharePoint firms charging us an arm and a leg and doing bad work. It’s rampant, and I’ve experienced it as a full time employee working for a large corporation paying a HUGE amount for SharePoint ‘experts’ who were breaking things and had NO CLUE what they were doing.
We want to emphasize that not all SharePoint experts and consulting firms are bad. The opinion above is merely that of the blogger. However, a bit of caution is a good rule of thumb when choosing an outside party to assist in SharePoint solutions. SharePoint is not an out-of-the-box enterprise solution. In order to be effective and efficient, third party solutions should be added onto an existing SharePoint framework.
This is where we think an option like Fabasoft Mindbreeze is smart. Offering an entire suite of solutions, Mindbreeze is neither excessive in cost nor disappointing in performance. Their Connectors offerings allow Mindbreeze to work alongside SharePoint, Exchange, and a variety of other applications. The Mindbreeze enterprise solution can stand alone or improve an already existing enterprise infrastructure.
As the optimum search and information access solution, Fabasoft Mindbreeze Enterprise facilitates the comprehensive incorporation of all electronic data repositories. Data sources and storage systems are connected to Fabasoft Mindbreeze Enterprise via Fabasoft Mindbreeze Connectors.
While a bit of caution is necessary, there are good third party SharePoint solutions out there. Find one that works for you and enjoy the ease and efficiency that it can offer.
Emily Rae Aldridge, January 3, 2012
Sponsored by Pandia.com
Inteltrax: Top Stories, December 26 to December 30
January 2, 2012
Inteltrax, the data fusion and business intelligence information service, captured three key stories germane to search this week, specifically, ways in which some are misusing big data analytics in the market today.
One story, our feature this week, “Real Estate Market Missing Out on Analytic Help” detailed the many ways in which the housing market could be aided by analytics, but is not taking advantage of.
One of the most important stories we’ve written was “Consumer Analytics Not a Strong Investment” which helps analytic software buyers avoid limited programs that will be no help to them.
Finally, we focused on how a lot of cloud analytic offerings don’t make security and customer service a priority in our story: “Accountability Should Be Top Priority for Cloud Analytics.”
Usually, we focus on the uplifting, exciting side of this growing market. However, big data analytics also has its downside, which deserves some light. We try and keep our coverage balanced, in order to give our readers the best overview.
Follow the Inteltrax news stream by visiting www.inteltrax.com
Patrick Roland, Editor, Inteltrax.
January 02, 2012
New Book on using SharePoint 2010 and Silverlight
January 2, 2012
While it is now too late for Christmas, a new book out by Bob German and Paul Stubbs gets a positive review from Andrew Connell, the Microsoft Developer with a focus on SharePoint. SharePoint 2010 Development with Silverlight is discussed in Connell’s “Great Resource for SharePoint 2010 & Silverlight.”
Connell’s review:
Are you interested in looking for a good resource on doing SharePoint 2010 or Silverlight development with both tools? This is the book for you. Bob & Paul, the two best people to write this book with their deep experience in both technologies, have done a fantastic job in starting with primers on both technologies and then diving in deep on doing various things you’d want to accomplish with both SharePoint 2010 & Silverlight.
The book includes a discussion on both SharePoint 2010 and Silverlight development as well as guidance for setting up your environment and building custom applications. Connell does provide the forward to the new publication, but his knowledge on the topic points to this being a good resource for interested parties.
However, if you do not have the time to read right now while you’re focused on adding value and efficiency to your business information, rely on experts at Fabasoft Mindbreeze. Their suite of solutions provides intuitive access for the right people with the right information at the right time.
Here you can read about their capabilities:
Fabasoft Mindbreeze Enterprise offers unified information access to enterprise and cloud resources. It combines and links the information. Information can be grouped and classified. A user can scan the results using search tabs and categories and find a relevant document without having to click through a list of links. The information’s semantic context is recognized and depicted, and navigation elements for drill down are provided; one click and the preview of any result is available. Fabasoft Mindbreeze provides a 360 degree view of your business, customers, competitors, and more.
Find what works for you with Fabasoft Mindbreeze.
Philip West, January 2, 2011
Sponsored by Pandia.com
Kapow Releases Katalyst Version 8.2
January 2, 2012
Kapow Software moves in a new direction that is a bit of a surprise to us. EWeek reports, “Kapow Software Punches Out Update of Cloud-Based Analytics Service.” Kapow is positioning its Katalyst version 8.2 as a self-service, subscription model analytics tool with an intuitive user interface. It also boasts 100% data accuracy. According to the write up:
Katalyst 8.2 can organize, integrate and analyze data from streams as diverse as legacy, on-premise, social media, partner, B2B, competitor, e-commerce, blogs and news sites, as well as location-based and mobile data, [founder Stefan] Andreasen said. The Kapow service is one that speaks to both IT and line-of-business people at an enterprise, and thus can bring them together (when they most often work separately) to solve common research needs.
Headquartered in Palo Alto, CA, Kapow Software has offered innovative technology solutions for a decade. The company prides itself on bridging the divide between IT departments and business users. It now has over 500 customers worldwide but its heart remains in Copenhagen. Take your conceptual umbrella we suggest.
Cynthia Murrell, January 2, 2012
Sponsored by Pandia.com
60 Months, Minimal Search Progress
January 1, 2012
When I was writing the Enterprise Search Report, I was younger, less informed, and slightly more optimistic. I wrote in August 2005 “Recent Trends in Enterprise Search”:
The truth is that nothing associated with locating information is cheap, easy or fast.
I omitted one item: accurate. About five years after writing this sentence, I have come to my senses. The volume of information flushing through the “tubes” continues to increase. To explain what petabytes means to the average liberal arts major now working at a services firm, someone coined the phrase “big data.” Simple. Tidy. Inaccurate.
That’s why the notion of accurate information is on my mind. I am tough to motivate in general, and burro like when I have to admit that something I wrote in one of my addled states is incomplete, stupid, or just plain wrong.
Let me start the New Year correctly. Here are four observations which will probably annoy the “real” experts, the self appointed search mavens, and the failed middle school teachers now consulting in the fields of ontology, massive parallelization in virtual environments, and “big data.” I don’t plan to alter my rhetorical approach, so too bad about giving some of these rescued Burger King workers some respite. Won’t happen.
First observation: Even a person as wild-and-wonderful as Jason Calacanis, the much admired innovator who makes a retreating Russian army’s scorched earth policy look green, wants to limit Internet content. “Jason Calacanis: Blogging Is Dead & Why Stupid People Shouldn’t Write” captures his take on accuracy. If one assumes stupid people should not write, then one reason may be that stupid people produce inaccurate information. Sounds okay to me, so let’s go with the stupid angle. In the era of “big data”, trimming out the stupid people should result in higher value information. Keep in mind I am addled. I am not sure where to stand on the “stupid” thing.
.jpg)
Image source: http://www.northernsun.com/Boldly-Going-Nowhere-T-Shirt-(8257).html
Second observation: Disinformation is becoming easier for me to spot. For you? I am not so sure. Let me give you a couple of examples. Navigate to the now out of date list of taxonomy systems prepared by Will Power. The page is available from Willpower Information in Middlesex. Now scan the description of the taxonomy system called MTM. Here’s a snippet:
MTM is the software for multilingual thesauri building and maintenance. It has been designed as a configurable system assisting a user in creating concepts, linking them by means of a set of predefined relations, and controlling the validity of the thesaurus structure…
The main features of the software are inter alia:
- thesaurus maintenance and support system;
- KWOC and full tree representation and navigation tools available on-line;
- KWIC, KWOC and full tree printouts (in an alphabetic and systematic order);
- defining and customization of up to 100 conceptual relationship types;
- management of facets, codes (top classification), sources, regional variants, historical notes, etc.;
- support of the various types of authority files;
- computer assisted merging;
- thesauri comparison by means of windows;
- support of the various alphabets;
- support of linguistic and orthographic variants;
- sorting facilities consistent with national standards;
- variable length data handling;
- flexibility in defining input and output forms;
- versatility in terms of relative ease of configuring the software for the various sets of languages;
- flexibility in defining data structures needed for a given application;
- a possibility to exchange data with other organizations and systems through exporting and importing terms and relations.
Digital Reasoning: A New Generation of Big Data Tools
December 31, 2011
I read “Tool Detects Patterns Hidden in Vast Data Sets.” The Broad Institute’s online Web site reported that a group of researchers in the US and Israel “have developed a tool that can tackle large data sets in a way that no other software program can.”
What seems exciting to me is that the mathematical procedure which involves creating a space and grids into which certain discerned patterns are placed provides a fascinating potential enhancement to companies like ours–Digital Reasoning. Our proprietary methods have performed similar associative analytics in order to reduce the uncertainty associated with processing large flows of data and distilling meaningful relationships from them. Some day computers and associated systems will be able to cope with exabytes of data from the Internet of things. Today, the Broad Institute validates the next-generation numerical methods that its researchers, Digital Reasoning’s engineers, and a handful of other organizations have been exploring.
The technical information about the method, which is called MIC, shorthand for Maximal Information Coefficient, is available to members of the AAAS. To get a copy of the original paper and its mathematical exegesis you will want the full bibliographic information:
“Detecting Novel Associations in Large Data Sets” by David N. Reshef, Yakir A. Reshef, Hilary K. Finucane, Sharon R. Grossman, Gilean McVean, Peter J. Turnbaugh, Eric S. Lander, Michael Mitzenmacher, and Pardis C. Sabeti, Science, 16 December 2011, Volume. 334, Number 6062, pages 1518-1524.
The core of the authors work is:
Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R2) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.
Digital Reasoning’s application of similar mathematical methods underpins our entity-oriented analytics. You can read more about our methods in our description of Synthesys, a platform for performing automated understanding of the meaning of Big Data in real time.
The significance of this paper is that it shines a spotlight on the increasing importance of research into applications of next-generation numerical methods. Public discussion of methods like MIC will serve to accelerate innovation and the diffusion of knowledge. At Digital Reasoning we see this as further evidence of the potential of algorithmic, unaided approaches like ours to achieve true “automated understanding” of all forms of text regardless of volume, velocity or variety. As we shift to IPv6, the “Internet of things” will dramatically increase the flows of real time data. With automobiles and consumer devices transmitting data continuously or on demand, the digital methods of 10 or five years ago fall short.
Three other consequences of MIC-style innovations will accrue:
First, at Digital Reasoning, we will be able to enhance our existing methods with the new insights, forming partnerships and investing in research to apply demonstrations to real world problems. The confidence SilverLake partners’ investment in Digital Reasoning has provided us with capital to extend our commercial system quickly and in new directions such as financial services, health care, legal, and other verticals.
Second, we see the MIC method fueling additional research into methods making Big Data more accessible and useful; that is, consumerize some applications without solutions. Big Data will eventually be part of a standard information process, not something discussed as “new” and “unusual.”
Third, greater awareness of the contribution of mathematics will, I believe, stimulate young men and women to make mathematics and statistics a career. With more talent entering the workforce, the pace of innovation and integration will accelerate. That’s good for many companies, not just Digital Reasoning.
Kudos to the MIC team. What’s next?
Tim Estes, December 31, 2011
Sponsored by Pandia.com
One Suggestion for a URL Shortening Solution in SharePoint
December 30, 2011
We know that communicating SharePoint links can get pretty painful when you copy and paste a link to a document from a Document Library buried deep in a hierarchal list of sites. URL shortening has eased a lot of this pain across a variety of platforms. Jan Tielens discusses URL shortening and provides a suggested solution in “URL Shortening for SharePoint 2010.” The author describes her solution:
So to make a long SharePoint URL short, you can copy the URL to the clipboard, go to a URL shortener, past the long link over there and copy the short URL you get in return back to the clipboard. Works perfectly, but there are quite some tedious steps to go through. Already a long time ago, when SharePoint 2007 was still the rage, I posted some code that automates all these steps. Finally I found some time to update the code to SharePoint 2010 and nicely package it in a Sandboxed Solution, so it works both for SharePoint 2010 deployed on premises as in the cloud on Office 365.
A handy tip for a pesky problem, no doubt. We’ve seen how short URLs allow for convenient messaging and sharing, like in the case with Twitter or Identi.ca. But URL shortening is a tedious process.
If you prefer to focus your time on tasks of greater importance, check out Fabasoft Mindbreeze. Over there, they really have text processing components in SharePoint down to an art. You can read more here at, “Information Pairing Makes Websites More Intelligent!” to learn about some of the benefits of their information pairing technology. “It smoothly integrates itself into your website so that the user doesn’t even realize that Cloud services are working in the background. Furthermore, InSite always knows what a user is interested in.” Their great deal of digital know-how takes convenience to a new level with mobility and maintenance-free capabilities; check out Fabasoft Mindbreeze.
Philip West, December 30, 2011
Sponsored by Pandia.com

