Dow Jones: Fake News As a Training Error

October 11, 2017

In the dead tree edition of the Wall Street Journal, I read an interesting but all too brief article; to wit: “Dow Jones Publishes Errant Headlines in Systems Snafu.” The main point is that Dow Jones pushed out “nearly 2,000 dummy headlines and articles.” The company, of course, is sorry, very sorry. The “false headlines” were disappeared. The small item on page B 5 at the bottom of the page of newsprint included this statement on October 11, 2017:

I take today’s inadvertent and erroneous publication of testing materials extremely seriously.

Fake news. Nah, just a digital flub from the proud Murdoch outfit. Mistakes happen. Perhaps the Dow Jones engine will factor in this human response when it next excoriates Silicon Valley outfits who stub their toes.

Oh, if you are looking for the story online, you have to search Google News for “fake news” and follow the links to everyone except the Wall Street Journal. Google does point to this item on the dowjones.com Web site. The publicist does not include the mea culpa, which I find interesting.

Stephen E Arnold, October 11, 2017

Written by Stephen E. Arnold · Filed Under News, Publishing | 3 Comments

Medieval Thoughts in a Mobile Smart Bubble

October 6, 2017

I read two articles this morning when the recalcitrant Vodaphone network finally decided that resolving links from Siena, Italy, was okay today. Yesterday the zippy technology did not work as Sillycon Valley wizards and “real” journalists expect.

The first write up is one of those “newspapers should be run by “real” journalists operating from a rock-solid, independent position as gatekeepers of the “truth.” You can draw your own conclusion about this “real” journalistic cartwheel by reading “If Journalists Take Sides, Who Will Speak Truth to Power?”

I noted this passage:

The essential argument was recently laid out by an outlet called 888.hu: “The international media, with a few exceptions, generally write bad things about the government because a small minority with great media influence does everything to tarnish the reputation of Hungary in front of the world – prestige that has been built over hundreds of years by patriots.”

The “real” Guardian newspaper presents opinion and news by blending observations, mixed sources, and “news.” Technology, zeros and ones, facts experts accept in order to win a grant, get tenure, or prove merit.

Navigate to “The Seven Deadly Sins of AI Predictions.”

Your are correct: medievalism meets “real” journalism. The argument in this “real” hard technology write up is that baloney, hoohah, and sci-fi has made “articiiial intelligence” into today’s boogeyman.

Chill out because those touting smart software and those who are afraid that a “real” Terminator will jump out of a police flying patrol car with Robocop are are coming to your city, village, or mud hut.

As readers of Beyond Search will be able to verify, I have poked fun at Technology Review for recycling the Watson confection with little or no critical analysis. I have also had a merry time commenting about the disconnect between the monopolistic systems which define “facts” and the old school journalists who flop between infatuation and odd ball criticism of the services which have captured their attention.

The reality is that artificial intelligence has been taking baby steps for decades. Computing power, data, and well-known numerical recipes can be combined to permit marketers to do what they have been doing for many years: Identify what’s hot and deliver more of that hotness in order to generate money via ads or provide services for which companies and governments will pay.

The notion that technology generates hyperbole is the stuff of entrepreneurs’ dreams. Today’s smart software is little more than making available some of the less crazy ideas from Star Trek.

Let me cite an example from “Seven Deadly”:

machine learning is very brittle, and it requires lots of preparation by human researchers or engineers, special-purpose coding, special-purpose sets of training data, and a custom learning structure for each new problem domain.

I am interested in watcching people struggle to make an app for adding ringtons to an Android mobile phone work. I am interested in watching people struggle with laptops which combine a keyboard and a touchscreen. I am interested in the conflation of news, opinion, facts, “weaponized” information, shaped data to sell ads, and online services providing a user what the user “really wants.”

AI raises some interesting challenges. First, for those “real” newspapers and magazines, I hope that more criticcal thinking is applied to the “real” story. I hope that regulators do more than flop around like a fish dumped on the dock. I hope that smart software can remediate some of the problems humans seem to be manufacturing with more efficiency than Kia implements on its assembly lines.

What’s the “truth” in the Guardian “real” news story, opinion, blog quoting write up. What’s the path forward for a champion of IBM Watson and the richly funded MTI IBM AI lab?

These are big issues. Digital Svanarola’s? Maybe not.

Stephen E Arnold, October 6, 2017

Written by Stephen E. Arnold · Filed Under AI, Business strategy, News, Publishing | Comments Off on Medieval Thoughts in a Mobile Smart Bubble

Google-Publishers Partnership Chases True News

September 22, 2017

It appears as though Google is taking the issue of false information, and perhaps even their role in its perpetuation, seriously; The Drum reveals, “Google Says it Wants to Fund the News, Not Fake It.” Reporters Jessica Goodfellow and Ronan Shields spoke with Google’s Madhav Chinnappa to discuss the Digital News Initiative (DNI), which was established in 2015. The initiative, a project on which Google is working with European news publishers, aims to leverage technology in support of good journalism. As it turns out, Wikipedia’s process suggests an approach; having discussed the “collaborative content” model with Chinnappa, the journalists write:

To this point, he also discusses DNI’s support of Wikitribune, asserting that it and Wikipedia are ‘absolutely incredible and misunderstood,’ pointing out the diligence that goes into its editing and review process, despite its decentralized means of doing so. The Wikitribune project tries to take some of this spirit of Wikipedia and apply this to news, adds Chinnappa. He further explains that [Wikipedia & Wikitribune] founder Jimmy Wales’ opinion is that the mainstream model of professional online publishing, whereby the ‘journalist writes the article and you’ve got a comment section at the bottom and it’s filled with crazy people saying crazy things’, is flawed. He [Wales] believes that’s not a healthy model. What Wikitribune wants to do is actually have a more rounded model where you have the professional journalist and then you have people contributing as well and there’s a more open and even dialogue around that,’ he adds. ‘If it succeeds? I don’t know. But I think it’s about enabling experimentation and I think that’s going to be a really interesting one.’

Yes, experimentation is important to the DNI’s approach. Chinnappa believes technical tools will be key to verifying content accuracy. He also sees a reason to be hopeful about the future of journalism—amid fears that technology will eventually replace reporters, he suggests such tools, instead, will free journalists from the time-consuming task of checking facts. Perhaps; but will they work to stem the tide of false propaganda?

Cynthia Murrell, September 22, 2017

Written by Stephen E. Arnold · Filed Under Content processing, Google, News, Publishing | 1 Comment

Old School Publishing: On the Ropes?

September 18, 2017

If you are interested professional publishing, you will want to read “We’ve Failed: Pirate Black Open Access Is Trumping Green and Gold and We Must Change Our Approach.” The “colorful” metaphors aside, there are some interesting statements in the article, which is available online without a fee.

I noted this passage:

Not for the first time, pirates are delivering where the established players and legal channels are not.

I also highlighted this idea for professional publishers:

What if, like the airline industry, publishers unbundled their product and started to test the value of some of the elements that form the bundle?

Please, read the full article, which is free I wish to reiterate, and think about the business decisions companies dependent on the business model for professional information services.

There’s nothing like an uncomfortable coach class seat.

Stephen E Arnold, September 18, 2017

Written by Stephen E. Arnold · Filed Under Business strategy, News, Publishing | Comments Off on Old School Publishing: On the Ropes?

Bing and Google: The News Battle

September 15, 2017

I read “Bing Battles Google News with Its Own Make-Over.” I noted the alliteration: Bing battle. I immediately thought, “Google Gropes.” Both of these companies are trying to reinvent the newspaper using zeros and ones, not dead trees. Let’s look at some of the points I highlighted:

I noted this statement everyone’s most lovable online ad vendor:

Google redesigned their desktop Google News website. Their [sic] new UI has a clean and uncluttered look.

Microsoft responded. I circled this statement:

Microsoft recently updated their Bing News experience that will help users in finding the most up to date and well-rounded information.

Note that the pivot of both sentences is a subjective assertion: “Clean and uncluttered” for the GOOG, and “most up to date and well rounded.”

Some facts would be useful. I am not sure what “clean” or “uncluttered” means. My recollection is that Einstein’s desk like most “dead tree” newspapers are organized in an eclectic manner. Facts supporting these assertions might be difficult to conjure.

The “most up to date” statement should be easy to back up. What’s the latency of the system? The superlative “most” means that Bing is the top dog in news. Hmmm. I don’t buy this.

My point is that the write up provides a useful idea: Neither Bing nor Google has figured out how to present “news” to each system’s online users. The implicit idea is that “dead tree” methods are of little use. Inspiration comes from each system’s response to what the other system does.

Cold War methods applied to online “news”? That’s what the write signals me.

Let’s step back.

Online users have different reasons for wanting news. Some folks chase sports, which as I recall was the most read section of the “dead tree” newspaper company at which I once worked. Other people have quite different reasons for scanning the news; for example, there are some who read the obituaries, others seek cartoons, and others want the latest on the real housewives.

Bing and Google have to figure out how to meet these diverse needs because the “dead tree” crowd has fallen in the forest.

The write up tells me one thing: Neither Google nor Microsoft has any idea about reinventing what “dead tree” newspapers used to do.

Now what? Shape the news to fit what each company’s filters “decide” is “real news”?

Stephen E Arnold, September 15, 2017

Written by Stephen E. Arnold · Filed Under Google, Microsoft, News, Online (general), Publishing | 2 Comments

A New and Improved Content Delivery System

September 7, 2017

Personalized content and delivery is the name of the game in PRWEB’s, “Flatirons Solutions Launches XML DITA Dynamic Content Delivery Solutions.” Flatirons Solutions is a leading XML-based publishing and content management company and they recently released their Dynamic Content Delivery Solution. The Dynamic Content Delivery Solution uses XML-based technology will allow enterprises to receive more personalized content. It is advertised that it will reduce publishing and support costs. The new solution is built with the Mark Logic Server.

By partnering with Mark Logic and incorporating their industry-leading XML content server, the solution conducts powerful queries, indexing, and personalization against large collections of DITA topics. For our clients, this provides immediate access to relevant information, while producing cost savings in technical support, and in content production, maintenance, review and publishing. So whether they are producing sales, marketing, technical, training or help documentation, clients can step up to a new level of content delivery while simultaneously improving their bottom line.

The Dynamic Content Delivery Solution is designed for government agencies and enterprises that publish XML content to various platforms and formats. Mark Logic is touted as a powerful tool to pool content from different sources, repurpose it, and deliver it to different channels.

MarkLogic finds success in its core use case: slicing and dicing for publishing. It is back to the basics for them.

Whitney Grace, September 7, 2017

—

Written by Stephen E. Arnold · Filed Under Content processing, Marketing, News, Publishing | Comments Off on A New and Improved Content Delivery System

Academic Publication Rights Cause European Dispute

September 4, 2017

Being published is the bread and butter of intellectuals, especially academics. publication, in theory, is a way for information to be shared across the globe, but it also has become big business. In a recent Chemistry World article the standoff between Germany’s Project DEAL (a consortium comprised of German universities) and Dutch publisher, Elsevier, is examined along with possible fall-out from the end result.

At the heart of the dispute is who controls the publications. Currently, Elsevier holds the cards and has wielded their power to make a clear point on the matter. Project DEAL, though, is not going down without a fight and Chemistry World quotes Horst Hippler, a physical chemist and chief negotiator for Project DEAL, as saying,

In the course of digitisation, science communication is undergoing a fundamental transformation process. Comprehensive, free and – above all – sustainable access to scientific publications is of immense importance to our researchers. We therefore will actively pursue the transformation to open access, which is an important building block in the concept of open science. To this end, we want to create a fair and sustainable basis through appropriate licensing agreements with Elsevier and other scientific publishers.

As publications are moving farther from ink and paper and more to digital who owns the rights to the information is becoming murkier. It will be interesting to see how this battle plays out and if any more disgruntled academics jump on board.

Catherine Lamsfuss, September 4, 2017

Written by Stephen E. Arnold · Filed Under Digital Library, Education, News, Publishing | Comments Off on Academic Publication Rights Cause European Dispute

Demonizing the Ever Helpful Alaphabet Google XXVI Things

September 2, 2017

Gentle reader, I am horrified at the indirect vilification of my beloved Alphabet Google XXVI things. You must judge for yourself. Navigate to “A Serf on Google’s Farm.” A serf, as I understand the term is a person who is in thrall to a noble. The noble provides the land, and the serf the labor. As our modern world embraces the precepts of the Great Chain of Being, serfs are below the one percent. Thus, it is. In the Dark Ages, one did not grouse too much about the one percent. Bad things could happen because that was the mechanism for the Great Chain of Being. It was a perception that the top spot was occupied by a deity. The lower levels were ranked by their station in life. In short, it was and is good to be up near the top of the pecking order.

The write up makes clear that publishers find themselves lower in the Great Chain of Digital Being than they were in the pre-Google era. Yep, when the king disowned an annoying son, life was not as good outside the castle as it was inside the castle.

Publisher types are now looking at the castle from the mud and straw vantage points close to the pigs and chickens. Big change. The trip to the castle may have been short in terms of steps but long in terms of the Great Chain of Being.

The article points out that Google has put publishers and related content types in the squalid hovels built near the castle walls. Life can be fun when the wine and mead are available, and the harvest is good. But at other times, those lice and muddy lanes were a bummer.

The write up points out that the Google has assembled an advertising Catch 22. Get with the program and you may be squeezed by the program. Thus it was for serfs and thus it is for those who have little choice but accept Google’s way of life.

I noted three statements which characterize the world as perceived by a digital serf:

as the adage puts it, if you don’t pay for the product, you are the product. Google isn’t doing us any favors. We get these services for free because Google’s empire and the vast amounts of money it brings in every year is built on the unimaginable amounts of data that come from, among other places, DoubleClick for Publishers and Analytics. We’re [the article author’s company] just one of a kabillion [sic] sites allowing Google to harvest our data.
Running TPM [the article author’s company] absent Google’s various services is almost unthinkable. Like I literally would need to give it a lot of thought how we’d do without all of them. Some of them are critical and I wouldn’t know where to start for replacing them. In many cases, alternatives don’t exist because no business can get a footing with a product Google lets people use for free.
And in general Google tends to be a relatively benign overlord….Google’s monopoly control is almost comically great. It’s a monopoly at every conceivable turn and consistently uses that market power to deepen its hold and increase its profits. Just the interplay between DoubleClick and Adexchange is textbook anti-competitive practices.

My view is that the Google has been operating in a consistent manner since it was inspired by the Yahoo, GoTo, Overture pay to play model. That shift from better Web search to the ad thing took place before the Google initial public offering. That works out to 13 years ago.

In that span of time, publishers wanted the world to be like the good old days of print which put the publishers in the role of gatekeepers and power brokers. Nice try, but publishers were unable to adapt to the Googley world. Just like the hapless retail giants, the failure to take advantage of digital opportunities has put Sears, JC Penny, and other “giants” outside the castle walls. Wattle, not Walmart, is the go to operating model.

Forget Google. Had there been no Google, another outfit would have filled the void. Google is a reflection of today’s version of the Middle Ages.

Do I feel sorry for traditional publishers? Nope. These outfits embrace systems and methods like XML, slicing and dicing, and surfing on Google as the skateboard wheels that will carry them to the future.

The wheels spin but don’t win X Games competitions.

Now Google itself is vulnerable. There is Facebook, the Chinese outfits, and the Bezos transformer machine. Perhaps publishers should think about ways to exploit Google’s flaws instead of grousing about Google being Google for 13 years. The Alphabet Google XXVI things are not likely to change their stripes overnight.

Publishers might find life easier if they quit complaining and name calling. Meeting user needs might be a path forward. But Google bashing is so easy and so much fun. Figuring out how to make money is work. Who wants to do that?

Stephen E Arnold, September 2, 2017

Written by Stephen E. Arnold · Filed Under Business strategy, Google, News, Publishing | Comments Off on Demonizing the Ever Helpful Alaphabet Google XXVI Things

Google: A Me Too from Mountain View

August 7, 2017

It is a tough world out there for a seller of online ads. From my point of view, the concentration of online advertising in the hands of Facebook and Google is a natural consequence of digital disintermediation. He who is most like the old Bell Telephone wins.

What does one do when an upstart comes up with a better idea? If one is a giant company’s chief innovator, the answer is obvious: Imitate, then use the power of scale to take lots of money.

I thought about this characteristic of online when I read “Google Reportedly Building Its Own Snapchat Competitor.” I would have used the word “killer,” not competitor, but that’s why I am a 74 year old retired person in rural Kentucky.

The write up (which may be a recycled variant of another real journalism effort) said:

Google is working on its answer to Snapchat. It’s called Stamp — a portmanteau of “stories” and “AMP,” the acronym for Accelerated Mobile Pages …The new platform would be similar to Snapchat’s Discover feature, where publishers create and share made-for-Snapchat (or repurposed-for-Snapchat) content.

Didn’t Google try to buy Snap when it was just Snapchat?

Moral of the story:

The model and wife of Snapchat CEO Evan Spiegel has historically not been too thrilled about other tech companies ripping off her husband’s product. “Do they have to steal all of my partner’s ideas? I’m so appalled by that … When you directly copy someone, that’s not innovation.”

Steal?

Nah, that’s innovation the online way.

Stephen E Arnold, August 7, 2017

Written by Stephen E. Arnold · Filed Under News, Publishing, Social Media | 1 Comment

Big Data as Savior of Newspapers? Tell That to NYT Editors

August 7, 2017

This would be ironic. The SmartDataCollective posits, “Is Big Data the Salvation of the Newspaper Industry?” The write-up tells us that several prominent publications are turning to data analysis to boost their bottom lines and, they hope, save themselves from extinction. Writer Rehan Ijaz cites this post from the US Chamber of Commerce Foundation as he describes ways the New York Times and the Financial Times are leveraging data. He quotes publishing pro, David Soloff:

The Financial Times, one of our global publisher customers, uses big data analytics to optimize pricing on ads by section, audience, targeting parameters, geography, and time of day. Our friends at the FT sell more inventory because the team knows what they have, where it is and how it should be priced to capture the opportunity at hand. To boot, analytics reveal previously undersold areas of the publication, enabling premium pricing and resulting in found margin falling straight to the bottom line.

What about the venerable New York Times? That paper hired a data scientist in 2014, yet now is slashing staff, we learn from Reuters’ piece, “New York Times Offers Buyouts, Scraps Public Editor Position.” It is, in fact, most editors facing unemployment (because clear prose and verified facts are so last century, I suppose.) Reporters Jessica Toonkel and Narottam Medhora reveal:

The newspaper said it would eliminate the in-house watchdog position of public editor as it shifts focus to reader comments. ‘Today, our followers on social media and our readers across the internet have come together to collectively serve as a modern watchdog, more vigilant and forceful than one person could ever be,’ publisher Arthur Sulzberger Jr said in a memo, which was reviewed by Reuters.

“Vigilant and forceful?” Is “correct” not a consideration? Professional editors exist for a reason; crowdsourcing will not always suffice. Also, call me old-fashioned, but I think facts should be confirmed before publication. This is an interesting choice for the Times to be making particularly now, amid the “fake news” commotion.

Cynthia Murrell, August 7, 2017

Written by Stephen E. Arnold · Filed Under Internet, News, Publishing, Social Media | 1 Comment

« Previous Page — Next Page »

Search the site
Subscribe to Beyond Search
Feature archive
News archive

Stephen E. Arnold monitors search, content processing, text mining and related topics from his high-tech nerve center in rural Kentucky. He tries to winnow the goose feathers from the giblets. He works with colleagues worldwide to make this Web log useful to those who want to go "beyond search". Contact him at sa [at] arnoldit.com. His Web site with additional information about search is arnoldit.com.

Categories
- 3D-Printing
- Acquisition
- Advertising
- Aggregation
- AI
- Alexa
- algorithms
- Amazon
- Amazonia
- Analytics
- Appliance
- Applications
- Audio
- Augmented Reality
- Big data
- Bing
- Bitcoin
- Bitext
- Book review
- Business intelligence
- Business process
- Business strategy
- Censorship
- Cloud computing
- Company Profile
- Conferences
- Connectors
- Consulting
- Consumer
- Content processing
- Copyright
- Corporate Concerns
- Cost
- Crawl
- Crowdfunding
- cryptocurrency
- Customer support
- Cyber OSINT
- cybercrime
- cybersecurity
- Dark Web
- DarkCyber
- Data
- Data mining
- Database
- Deepfakes
- Digital Assistant
- Digital Library
- E2EE
- ECommerce
- EDiscovery
- Editorial opinion
- Education
- Emoticons
- Employment
- Enterprise
- Enterprise search
- Entity extraction
- Ethics
- Facebook
- Faceted search
- Factualities
- Feature
- Federated search
- Financial
- Fogint
- Google
- Governance
- Government
- Hackers
- healthcare
- IBM Watson
- Image search
- Indexing
- Infrastructure
- Innovation
- Integration
- intelware
- Interface
- Internet
- Interview
- Investment
- law enforcement
- Legal matters
- Library automation
- Management
- Marketing
- Mathematics
- Metadata
- Microsoft
- Mobile
- Natural language processing
- News
- NGIA
- Online (general)
- Open Access
- Open source
- OSINT
- Osint Radar
- Overflight
- Palantir
- Patents
- Personnel
- Podcast
- Policeware
- Portals
- Predictive coding
- Privacy
- Profile
- Publishing
- Quotation
- Real time search
- Reference tool
- Rich media
- Robot Writer
- Search
- Search enabled applications
- search engine
- Search quality
- Security
- Semantic
- Sentiment analysis
- SEO
- SharePoint
- Short Honks
- Smart Technology
- Social
- Social Media
- software
- Statistics
- Taxonomy
- Technology
- Telegram
- Text analytics
- Text processing
- Tools
- Tor
- Training
- Translation
- Twitter
- Uncategorized
- Unstructured Data
- User experience
- User Interface
- Vertical search
- Video
- visualization
- Voice search
- Voice technology
- Web 3
- Web Services
- Webinar
- Windows
- Work flow
- XML
- Yahoo

Beyond Search

Dow Jones: Fake News As a Training Error

Medieval Thoughts in a Mobile Smart Bubble

Google-Publishers Partnership Chases True News

Old School Publishing: On the Ropes?

Bing and Google: The News Battle

A New and Improved Content Delivery System

Academic Publication Rights Cause European Dispute

Demonizing the Ever Helpful Alaphabet Google XXVI Things

Google: A Me Too from Mountain View

Big Data as Savior of Newspapers? Tell That to NYT Editors

Search the site

Categories

Archives

Recent Posts

Meta

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Search the site

Categories

Archives

Recent Posts

Meta