The Sound of Ontopia Silence

November 25, 2014

Ontopia has been silent since August 1, 2013. Prior to that outdated update, Ontopia used to share news three or four times a year. Ontopia was developed as a community for open source tools for building, maintaining, and deploying topic maps-based applications. Topic maps are knowledge structures that directly connect information to a source. The process is also are also called information mapping or mind mapping, which is a concept that has been played around with by many develops. An old Mashable article has a list: “Twenty Four Essential Mind Mapping And Brainstorming Tools.”

Perusing the Ontopia Web page leaves it in the throws of Web 1.0 and with only some features that could pass as a modern Web site. Even the product’s description, in all its simplicity, is dated:

“Ontopia is a set of tools which contains everything you need to build a full Topic Maps-based application. Using Ontopia you can design your ontology, populate the topic map manually and/or automatically, build the user interface, show graphical visualizations of the topic map, and much more.

The core of Ontopia is the engine, which stores and maintains the topic maps, and has an extensive Java API. On top of it are built a number of additional components, as shown in the diagram below. More information about these components can be found on the right.

Ontopia is 100% Java, and runs on any operating system which has Java 1.5. It is fully open source and can be used without any restrictions beyond those in the Apache 2.0 license.”

The last time Ontopia updated, they wrote a post about how version 5.3.0 was just released and the details were available on the wiki. Has Ontopia been in the sequestered in a closet working on the latest version or has it gained abandoned open source project?

Whitney Grace, November 25, 2014
Sponsored by ArnoldIT.com, developer of Augmentext

Elasticsearch Ups the Pressure on LucidWorks (Really?)

November 23, 2014

I am not too keen on videos. I prefer reading hard copies. I did find the video referenced in “Elasticsearch Uses Power of Community for Open Source Analytics” useful. My team and I are putting the finishing touches on a report that points out how enterprise search vendors have been leapfrogged by vendors rarely considered by mid tier consultants and the self appointed experts in search. The video drives home a simple point: Combining open source technologies delivers information access functions that are more useful to users than laundry lists, odd ball point and click suggested content, and confusing mash ups of information presented without context.

Why the reference to Lucid? One of the firm’s presidents had been involved with Jaspersoft, an open source analytics outfit. Despite this “inside track”, Elasticsearch has powered past Lucid, leaving that open source vendor struggling to reach parity with Elasticsearch. Elasticsearch itself faces challenges, but that’s the name of the game when keyword search is the keystone of a service. For now, Elasticsearch leaves competitors rushing to close the gap. By the way, this subject was the focal point of one of Dave Schubmehl’s IDC reports that surfed on my name. The juicy part about the “gap” was edited from my original write up. Nevertheless, the facts remain valid. Kudos to Elasticsearch.

Stephen E Arnold, November 23, 2014

Microsoft and Open Source: Exciting Marketing

November 15, 2014

I wonder who the wizards were who crafted the “news” that Microsoft was making Dot Net open source. I read what struck me as a reasonable view of Microsoft’s new open sourciness. Navigate to “.NET is NOT “Open Source”, But Microsoft’s Minions Shamelessly Openwash It Right Now.” Dig in. I noted this passage:

Microsoft is just so desperate to lock in developers, who are rapidly moving away to FOSS and saying goodbye to Windows because Android/Linux is on the rise.

This strikes me as a viewpoint that matches my own perception of the Metro-ized Microsoft. When will Fast Search become open source?

Stephen E Arnold, November 15, 2014

ElasticSearch How To: A Useful Case Example

October 21, 2014

If you want to avoid the hassle of some proprietary search engines, you may want to take a look at this case study about ElasticSearch. Navigate to “Building Scalable Search from Scratch with ElasticSearch.” The author works through his process for putting ElasticSearch to work in content space with a variety of information; for example, products, text collections, and user information.

What makes this write up useful is the logical layout of the article and the inclusion of a requirements summary, block diagrams, and code snippets.

This type of solid user support is one reason ElasticSearch is outpacing some open source search competitors like LucidWorks and Nutch.

Highly recommended. (As far as I can tell, no mid tier consulting firms has surfed on this content. Dave Schubmehl, this may be an opportunity.)

Stephen E Arnold, October 21, 2014

Open Source Search and Kicking the Bukkit

October 15, 2014

There is a presentation “Kicking the Bukkit: Anatomy of an Open Source Meltdown” by Ryan Michela, a developer with experience in open source. Over several years, a game open source project rose and fell. I am not too interested in open source games. At the end of the Slideshare document, there are five reasons an open source game project failed.

Let me summarize these and encourage you to work through he full 55 slide deck. How many of these issues may have an impact on open source search systems. Keep in mind that commercial enterprises like Attivio and IBM make use of open source technology.

  1. Inclusion of decompiled code in an open source project
  2. License issues
  3. Ties ups within the community before a project gains momentum
  4. No contributor license agreement
  5. Disgruntled developers in the community.

The presentation includes a quote that I noted:

It only takes one unhappy developer to kill an unprotected project.

Is there an open source search company vulnerable to one or more of these issues? I can name a couple. I wonder if the firm’s funding sources are concerned about their investment “kicking the bucket”?

Stephen E Arnold, October 15, 2014

Open Source Negativism and the Future of Open Source Documentation

October 14, 2014

The article on Linux Insider titled Dan Allen and Sarah White: Documentation Dearth Dooms Open Source Projects discusses the work of entrepreneurs Allen and White. The pair have focused on encouraging and aiding software developers in “superior documentation” for open source software. The article includes an interview with White and Allen explaining the function of their program, called Asciidoctor. Allen states in the interview,

“What we have done with Asciidoctor is make the documentation something of value. We do that by, number one, rewarding the writer. For most software developers of open source software, whatever documentation that is created gets published on the website. So we show the developer how the content looks on a Web page displayed in Asciidoctor. When the software developer sees how minor the content is, that triggers motivation to fill in the gaps.”

According to Sarah White, software developers have had a “stunning” response to the motivation to improve documentation (which includes, White notes, improvements to the homepage and to training materials.) Since their start in November, White claims that there has been a tremendous influx of clients interested in making the sort of improvements that White and Allen offer. In the future, White is particularly interested in ensuring that all documentation is integrated to render well on different types of devices, particularly mobile screens.

Chelsea Kerwin, October 14, 2014

Sponsored by ArnoldIT.com, developer of Augmentext

Open Source CRM Pulls in Open Source Search

October 8, 2014

I read “12 Open Source CRM Options.” I think of CRM as a synonym for customer experience or CRM as an easy way to suck down a top salesperson’s contact list when he or she heads to greener pastures. I know. I am shortsighted.

The write up surprised me because I did not know there were a dozen open source CRM solutions, components, or widgets. I assumed there were the big buck systems from Oracle and Salesforce.com. I was uninformed.

I had heard of SugarCRM because one of the proprietary search vendors supports the system. I had not heard of:

Vtiger (a variant of SugarCRM), SuiteCRM, Fat Free CRM, Odoo, Zurmo, EspoCRM, SplendidCRM, OpenCRX, X2Engfine, Concourse Suite, or CentraView.

Well, there you.

My reaction to this basket of “suites” is that search is going to be part of the offering. When the baked in solution falls short, then the licensees will look for more robust solutions. For me, that means taking a look at the open source search solutions. ElasticSearch and Sphinx Search come to mind, but there are others.

I would not be too keen to license one of the proprietary search systems for three reasons:

  1. Try open source and if it works, the money can be used for other things. Raises or hiring a tastier consultant
  2. There are satisfactory information retrieval solutions that run from the cloud, on premises, or in a hybrid mode
  3. The hassles of integrating an open source and a proprietary system can be sidestepped. Integration is never a walk in the park, but it seems that open source begets open source.

Stephen E Arnold, October 8, 2014

ElasticSearch. Version 1.3.3 Available

October 6, 2014

You can download the most recent version of ElasticSearch via the link in the ElasticSearch blog. Navigate to http://bit.ly/1uxnOfN and click the download button. Changes include a fix for shard recovery and corruption occurring when a licensee upgrades an old index.

Stephen E Arnold, October 1, 2014

Why Good Enough Is the New Norm in Search

September 29, 2014

Navigate to “Postgres Full Text Search Is Good Enough.” I first heard this argument at a German information technology conference a few years ago. The idea is surprisingly easy to understand. As long as a user can bang in a couple of key words, scan a result list, and locate information that the user finds helpful—job done. The search results may consist of flawed or manipulated information. The search results may be off point for the user’s query when evaluated by old fashioned methods such as precision and recall. The user may be dumb and relies on what the user finds accurate.

Whatever.

This write up explains the good enough approach in terms of PostgreSQL, a useful open source Codd type data management system. Please, note. I am not uncomfortable with good enough search. I understand that when the herd stampedes, it is not particularly easy to stop the run. Prudence suggests that one take cover.

Here’s the guts of the write up:

What do I mean by ‘good enough’? I mean a search engine with the following features:

  • Stemming
  • Ranking / Boost
  • Support Multiple languages
  • Fuzzy search for misspelling
  • Accent support

Luckily PostgreSQL supports all these features.

The write up contains some useful code snippets to make use of search features. The discussion of full text search is coherent and addresses a vast swath of content. Note that proprietary vendors have tilled acres of marketing earth and fertilizer to convert search into a mind boggling range of functions.

This article includes code snippets to tackle full text within PostgreSQL.

Querying is included as well. Again, code snippets are included. (My teenage advisors said, “Very useful snippets.” Okay. Good.

The write up concludes:

We have seen how to build a decent multi-language search engine based on a non-trivial document. This article is only an overview but it should give you enough background and examples to get you started with your own….Postgres is not as advanced as ElasticSearch and SOLR but these two are dedicated full-text search tools whereas full-text search is only a feature of PostgreSQL and a pretty good one

Reasonable observation. Worth reading.

If you are a vendor of proprietary search technology, there will be more individuals infused with the sprit of open source, not fewer. How many experts are there for proprietary systems? Fewer than the cadres of open source volk I surmise.

Stephen E Arnold, September 29, 2014

Tibco: Will It Regain Its Momentum?

September 29, 2014

I read “Tibco Sells Out to Private Equity in $4.3bn Deal with Vista Equity Partners.” I found Tibco interesting when I saw the servers used to power Yahoo News a number of years ago. The company is now owned by accountants and MBAs. I learned in the write up:

Tibco was founded in 1997 by its current chairman and CEO Vivek Ranadive. It was a pioneer of message-oriented middleware, particularly for the financial sector, which enables information to be pushed to multiple recipients at precisely the same time. However, Tibco’s expensive high-end proprietary software is under attack from open source in the form of the Advanced Message Queuing Protocol (AMQP), which promises not just lower-cost message queuing software, but also inter-operability between different vendors’ implementations of the open-source standard.

My recollection is that Tibco’s “information bus” made some of the old line outfits uncomfortable. Perhaps IBM? If the write up is accurate, open source is claiming a proprietary vendor.

How long will proprietary enterprise search vendors be able to keep the open source predators away? If the financial market gets the willies, the collapse of over hyped proprietary systems are likely to face high seas. Some swimmers drown in rough water even though the marketers insist the sun is shining.

Stephen E Arnold, September 29, 2014

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta