The Internet Archives: Is It the New Library of Congress?
May 28, 2025
It’s been a challenging time with two US presidential administrations allegedly rewriting history and basic facts to fit their political agendas. Adding and deleting information can make it quite difficult to determine if a fact is an old-fashioned fact or an alternative fact. Powerful people, companies, and software seem to be using the Internet and other digital tools to weaponize what might be “basic” information. Hey, Mr. Orwell did you envision the US in the 21st century. Naked Capitalism works hard to explain the importance of preserving digital information and protecting corresponding archives in “As History Erasure Intensifies, Independent Internet Archives Are Helping Fortify The ‘Digital Preservation Infrastructure.’”
The Freedoms Archives in Berkeley, California are preserving digital information related to activism, social issues, and history. They’ve noticed that one administration is rewriting historical facts more so than any other president and changing how certain aspects of history can be taught in public schools. Their work is integral to the historical record, but they’re not taking into account other administrations and that most modern presidents have dealt with such sweeping reformations with access to this level of technology before.
Other important archives include the Internet Archive now seeking additional funds:
“In April 2025, the San Francisco Standard reported that the Department of Government Efficiency (DOGE) had cut funding for the Internet Archive while the organization “was halfway through an NEH [National Endowment for the Humanities] grant of $345,960.” Jefferson Bailey, the Internet Archive’s director of archiving and data services, said that funding from other sources would help the organization stay afloat, but he worried about the impact of the cuts on smaller nonprofits.”
Nonprofit organizations, libraries, and those intent on preserving the historical record as it stands are important. As a librarian, there may now be more people eager to shape facts than people interested in accessing information that can be verified in an objective way. But this may not be a problem. When facts become fluid, no one will know what happened. The Internet Archive is fighting a good fight, but it may lose the war.
Whitney Grace, May 28, 2025
China Slated To Overtake US In AI Development. How about Bypass?
May 28, 2025
China was scheduled to become the world’s top performing economy by now. This was predicted in the early 2000s, but the Middle Kingdom has experienced some roadblocks. Going through all of them would require an entire class on world history and economics. We don’t have time for that because SCMP says, “China To Harness Nation’s Resources To AI Self-Reliance Ambitions."
Winnie the Pooh a.k.a. President Xi Jinping told the Communist Party’s inner circle that he plans to stimulate AI theory and core technologies. Xi wants to leverage his country’s “new whole national system” to repair bottlenecks like high end chips. The “new whole national system” is how the Community Party describes directing resources towards national strategic goals.
Xi is desperate for China to overtake the US in AI development. This pipe dream was crushed when the US placed tariffs on Chinese goods. While the tariff war is on hiatus for a few months, it doesn’t give China a desperate leg up.
Xi said:
“‘We must acknowledge the technological gap, redouble our efforts to comprehensively push forward technological innovation, industrial development and applications, and the AI regulatory system,’ state news agency Xinhua quoted Xi as saying. ‘[China should] continue to strengthen basic research, and concentrate on conquering core technologies such as high-end chips and basic software, so as to build an independent, controllable, and collaborative AI basic software and hardware system. ‘[We should then] use AI to lead the paradigm shift in scientific research and accelerate scientific and technological innovation breakthroughs in various fields.’”
So said Winnie the Pooh. He’s searching for that irresistible pot of honey while dealing with US and Trump bumblebees. Maybe if he disguises himself as a little black raincloud instead of a “weather balloon” he might advance further in AI? However, some tension in the military may lead to a bit of choppy weather in what is supposed to be a smooth, calm sea of agreement.
Let’s ask Deepseek.
Whitney Grace, May 28, 2025
SEO Dead? Nope, Just Wounded But Will Survive Unfortunately
May 27, 2025
SEO or search engine optimization is one of the forces that killed old fashioned precision and recall. Precision morphed from presenting on point sources to smashing a client’s baloney content into a searcher’s face. Recall went from a metric indicating that a query was passed across available processed content. Now it means, “Buy, believe, and baloney information.”
The write up “The Future of SEO As the Future Google Search Rolls Out” explains:
“Google isn’t going to keep its search engine the way it was for the past two decades. Google knows it has to change, despite them making an absolute fortune from search ads. Google is worried about TikTok, worried about, ChatGPT, worried about searchers going to something new and better.”
These paragraphs make clear that SEO is not going to its grave without causing some harm to the grave diggers:
“There are a lot of concerned people in the search marketing industry right now. The bottom line is while many of us like to complain and we honestly have good reason to be upset, complaining won’t help. We need to adapt and change and experiment. Experiment with these new experiences, keep on top of these changes happening in Google and at other AI and search companies. Then try new things and keep testing. If you do not adapt, you will die. SEO won’t die, but you will become irrelevant. The good news, SEOs are some of the best at adapting, embracing change and testing new strategies out. So you are all ready and equipped for the future of search.”
Let me share some observations about this statement from the cited write up:
First, the SEO professionals are concerned. About relevance and returning precise on point information to the user? Are you kidding me? SEO professionals are worried about their making money. Google, after using SEOs as part of their push to sell ads, the SEO crowd is wracked with uncertainty.
Second, adaptation is important. A failure to adapt means no money. Now the SEO professionals must embrace anxiety. Is stress good for SEO professionals? Probably not.
Third, SEO professionals with 20 years of experience must experiment. Are these individuals equipped to head to the innovation space and crank out new ways to generate money? A few will be able to be the old that that learns to roll over on late night television. Most — well — will struggle to get up or die trying.
What’s my prediction for the future of SEO? Snake oil vendors are part of the data carnival. Ladies and gentlemen, get your cure for no traffic here. Step right up.”
Stephen E Arnold, May xx, 2-25
Coincidence or No Big Deal for the Google: User Data and Suicide
May 27, 2025
Just the dinobaby operating without Copilot or its ilk.
I have ignored most of the carnival noise about smart software. Google continues its bug spray approach to thwarting the equally publicity-crazed Microsoft and OpenAI. (Is Copilot useful? Is Sam Altman the heir to Steve Jobs?)
Two stories caught my attention. The first is almost routine. Armed with the Chrome Hoover, long-lived cookies, and the permission hungry Android play — The Verge published “Google Has a Big AI Advantage: It Already Knows Everything about You.” Sigh. another categorical affirmative: “Everything.” Is that accurate? “Everything” or is it just a scare tactic to draw readers? Old news.
But the sub title is more interesting; to wit:
Google is slowly giving Gemini more and more access to user data to ‘personalize’ your responses.
Slowly. Really? More access? More than what? And “your responses?” Whose?
The write up says:
As an example, Google says if you’re chatting with a friend about road trip advice, Gemini can search through your emails and files, allowing it to find hotel reservations and an itinerary you put together. It can then suggest a response that incorporates relevant information. That, Google CEO Sundar Pichai said during the keynote, may even help you “be a better friend.” It seems Google plans on bringing personal context outside Gemini, too, as its blog post announcing the feature says, “You can imagine how helpful personal context will be across Search, Gemini and more.” Google said in March that it will eventually let users connect their YouTube history and Photos library to Gemini, too.
No kidding. How does one know that Google has not been processing personal data for decades. There’s a patent *with a cute machine generated profile of Michael Jackson. This report generated by Google appeared in the 2007 patent application US2007/0198481:
The machine generated bubble gum card about Michael Jackson, including last known address, nicknames, and other details. See US2007/0198481 A1, “Automatic Object Reference Identification and Linking in a Browsable Fact Repository.”
The inventors Andrew W. Hogue (Ho Ho Kus, NJ) and Jonathan T. Betz (Summit, NJ) appear on the “final” version of their invention. The name of the patent was the same, but there was an important different between the patent application and the actual patent. The machine generated personal profile was replaced with a much less useful informative screen capture; to wit:
From Google Patent 7774328, granted in 2010 as “Browsable Fact Repository.”
Google wasn’t done “inventing” enhancements to its profile engine capable of outputting bubble gum cards for either authorized users or Google systems. Check out Extension US9760570 B2 “Finding and Disambiguating References to Entities on Web Pages.” The idea is that items like “aliases” and similarly opaque factoids can be made concrete for linking to cross correlated content objects.,
Thus, the “everything” assertion while a categorical affirmative reveals a certain innocence on the part of the Verge “real news” story.
Now what about the information in “Google, AI Firm Must Face Lawsuit Filed by a Mother over Suicide of Son, US Court Says.” The write up is from the trusted outfit Thomson Reuters (I know it is trusted because it says so on the Web page). The write up dated May 21, 2025, reports:
The lawsuit is one of the first in the U.S. against an AI company for allegedly failing to protect children from psychological harms. It alleges that the teenager killed himself after becoming obsessed with an AI-powered chatbot. A Character.AI spokesperson said the company will continue to fight the case and employs safety features on its platform to protect minors, including measures to prevent "conversations about self-harm." Google spokesperson Jose Castaneda said the company strongly disagrees with the decision. Castaneda also said that Google and Character.AI are "entirely separate" and that Google "did not create, design, or manage Character.AI’s app or any component part of it."
Absent from the Reuters’ report and the allegedly accurate Google and semi-Google statements, the company takes steps to protect users, especially children. With The profiling and bubble gum card technology Google invented, does it seem prudent for Google to identify a child, cross correlate the child’s queries with the bubble gum card and dynamically [a] flag an issue, [b] alert a parent or guardian, [c] use the “everything” information to present suggestions for mental health support? I want to point out that if one searches for words on a stop list, the Dark Web search engine Ahmia.fi presents a page providing links to Clear Web resources to assist the person with counseling. Imagine: A Dark Web search engine performing a function specifically intended to help users.
Google, is Ahmia,fi more sophisticated that you and your quasi-Googles? Are the statements made about Google’s AI capabilities in line with reality? My hunch is requests like “Google spokesperson Jose Castaneda said the company strongly disagrees with the decision. Castaneda also said that Google and Character.AI are "entirely separate" and that Google "did not create, design, or manage Character.AI’s app or any component part of it." made after the presentation of evidence were not compelling. (Compelling is a popular word in some AI generated content. Yeah, compelling: A kid’s death. Inventions by Googlers specifically designed to profile a user, disambiguate disparate content objects, and make available a bubble gum card. Yeah, compelling.
I am optimistic that Google knowing “everything,” the death of a child, a Dark Web search engine that can intervene, and the semi-Google lawyers add up to comfort and support.
Yeah, compelling. Google’s been chugging along in the profiling vineyard since 2007. Let’s see that works out to longer than the 14 year old had been alive.
Compelling? Nah. Googley.
Stephen E Arnold, May 27, 2025
AI Search: Go Retro
May 27, 2025
CIO’s article, “Invest In AI Search As An Enterprise Business Asset” reads like a blast from the pasta circa early 2000s. Back then it was harder to find decent information, ergo the invention of Google. However, it was also a tad easier to get ranked. With the advent of AI search the entire game has shifted so these tips are questionable.
CIO shares helpful stats about AI: 90% of AI projects never develop beyond proof of concept and 97% of organizations have trouble demonstrating the business value of generative AI. Then this apt paragraph is tossed at readers:
- “A major reason is that many cautious business leaders treat AI as a source of incremental improvements to existing processes rather than a tool to reshape core business functions. Too often, business leaders underestimate the people, behavior, and organizational changes entailed by strategically using AI.”
- Generative AI is still a new technology so it’s rational not everyone understands its implications and potential. The article then transitions into the difficulties employees have finding information. Another apt observation is made:
- “They have become accustomed to instant gratification on the web, but the lack of investment many organizations make in relevance and content curation makes searching inside the corporate firewall maddeningly unproductive.”
Then readers are treated to sales pitch that’s been heard since every new search technology emerged (well before Google):
“AI search not only incrementally improves productivity but can radically reshape core business capabilities. It replaces simple keyword searches with advanced semantic techniques that understand the intent and context behind a query. Semantic search combines technologies including natural language processing, vector data stores, and machine learning to deliver results that more closely match what users need than keywords without requiring major investments in content curation.”
There is something new that Steve Mayzak, the global managing director of Search at Elastic said: “With semantic search, you can search across an entire book instead of relying on the index alone.”
Now that has my attention. Indices are great but are limited. When I’m doing research, I love having a digital copy and physical copy of the book. The physical copy is easier to maneuver and read, while I have the searching feature, copy/paste, and notes tool in the digital version.
Helpful? Sort of.
Whitney Grace, May 27, 2025
Real News Outfit Finds a Study Proving That AI Has No Impact in the Workplace
May 27, 2025
Just the dinobaby operating without Copilot or its ilk.
The “real news” outfit is the wonderful business magazine Fortune, now only $1 a month. Subscribe now!
The title of the write up catching my attention was “Study Looking at AI Chatbots in 7,000 Workplaces Finds ‘No Significant Impact on Earnings or Recorded Hours in Any Occupation.” Out of the blocks this story caused me to say to myself, “This is another you-can’t-fire-human-writers” proxy.”
Was I correct? Here are three snips, and I not only urge you to subscribe to Fortune but read the original article and form your own opinion. Another option is to feed it into an LLM which is able to include Web content and ask it to tell you about the story. If you are reading my essay, you know that a dinobaby plucks the examples, no smart software required, although as I creep toward 81, I probably should let a free AI do the thinking for me.
Here’s the first snip I captured:
Their [smart software or large language models] popularity has created and destroyed entire job descriptions and sent company valuations into the stratosphere—then back down to earth. And yet, one of the first studies to look at AI use in conjunction with employment data finds the technology’s effect on time and money to be negligible.
You thought you could destroy humans, you high technology snake oil peddlers (not the contraband Snake Oil popular in Hong Kong at this time). Think old-time carnival barkers.
Here’s the second snip about the sample:
focusing on occupations believed to be susceptible to disruption by AI
Okay, “believed” is the operative word. Who does the believing a University of Chicago assistant professor of economics (Yay, Adam Smith. Yay, yay, Friedrich Hayak) and a graduate student. Yep, a time honored method: A graduate student.
Now the third snip which presents the rock solid proof:
On average, users of AI at work had a time savings of 3%, the researchers found. Some saved more time, but didn’t see better pay, with just 3%-7% of productivity gains being passed on to paychecks. In other words, while they found no mass displacement of human workers, neither did they see transformed productivity or hefty raises for AI-wielding super workers.
Okay, not much payoff from time savings. Okay, not much of a financial reward for the users. Okay, nobody got fired. I thought it was hard to terminate workers in some European countries.
After reading the article, I like the penultimate paragraph’s reminder that outfits like Duolingo and Shopify have begun rethinking the use of chatbots. Translation: You cannot get rid of human writers and real journalists.
Net net: A temporary reprieve will not stop the push to shift from expensive humans who want health care and vacations. That’s the news.
Stephen E Arnold, May 27, 2025
Microsoft Investigates Itself and a Customer: Finding? Nothing to See Here
May 26, 2025
No AI, just a dinobaby and his itty bitty computer.
GeekWire, creator of the occasional podcast, published “Microsoft: No Evidence Israeli Military Used Technology to Harm Civilians, Reviews Find.” When an outfit emits occasional podcasts published a story, I know that the information is 100 percent accurate. GeekWire has written about Microsoft and its outstanding software. Like Windows Central, the enthusiasm for what the Softies do is a key feature of the information.
What did I learn included:
- Israel’s military uses Microsoft technology
- Israel may have used Microsoft technology to harm non-civilians
- The study was conducted by the detail-oriented and consistently objective company. Self-study is known to be reliable, a bit like research papers from Harvard which are a bit dicey in the reproducible results department
- The data available for the self-study was limited; that is, Microsoft relied on an incomplete data set because certain information was presumably classified
- Microsoft “provided limited emergency support to the Israeli government following the October 7, 2023, Hamas attacks.”
Yeah, that sounds rock solid to me.
Why did the creator of Bob and Clippy sit down and study its navel? The write up reported:
Microsoft said it launched the reviews in response to concerns from employees and the public over media reports alleging that its Azure cloud platform and AI technologies were being used by the Israeli military to harm civilians.
The Microsoft investigation concluded:
its recent reviews found no evidence that the Israeli Ministry of Defense has failed to comply with its terms of service or AI Code of Conduct.
That’s a fact. More than rock solid, the fact is like one of those pre-Inca megaliths. That’s really solid.
GeekWire goes out on a limb in my opinion when it includes in the write up a statement from an individual who does not see eye to eye with the Softies’ investigation. Here’s that passage:
A former Microsoft employee who was fired after protesting the company’s ties to the Israeli military, he said the company’s statement is “filled with both lies and contradictions.”
What’s with the allegation of “lies and contradictions”? Get with the facts. Skip the bogus alternative facts.
I do recall that several years ago I was told by an Israeli intelware company that their service was built on Microsoft technology. Now here’s the key point. I asked if the cloud system worked on Amazon? The response was total confusion. In that English language meeting, I wondered if I had suffered a neural malfunction and posed the question, “Votre système fonctionne-t-il sur le service cloud d’Amazon?” in French, not English.
The idea that this firm’s state-of-the-art intelware would be anything other than Microsoft centric was a total surprise to those in the meeting. It seemed to me that this company’s intelware like others developed in Israel would be non Microsoft was inconceivable.
Obviously these professionals were not aware that intelware systems (some of which failed to detect threats prior to the October 2023 attack) would be modified so that only adversary military personnel would be harmed. That’s what the Microsoft investigation just proved.
Based on my experience, Israel’s military innovations are robust despite that October 2023 misstep. Furthermore, warfighting systems if they do run on Microsoft software and systems have the ability to discriminate between combatants and non-combatants. This is an important technical capability and almost on a par with the Bob interface, Clippy, and AI in Notepad.
I don’t know about you, but the Microsoft investigation put my mind at ease.
Stephen E Arnold, May 26, 2025
Ten Directories of AI Tools
May 26, 2025
Just the dinobaby operating without Copilot or its ilk.
I scan DailyHunt, an India-based news summarizer powered by AI I think. The link I followed landed me on a story titled “Best 10 AI Directories to Promote.” I looked for a primary source, an author, and links to each service. Zippo. Therefore, I assembled the list, provided links, and generated with my dinobaby paws and claws the list below. Enjoy or ignore. I am weary of AI, but many others are not. I am not sure why, but that is our current reality, replete with alternative facts, cheating college professors, and oodles of crypto activity. Remember. The list is not my “best of”; I am simply presenting incomplete information in a slightly more useful format.
AIxploria https://www.aixploria.com/en/ [Another actual directory. Its promotional language says “largest list”. Yeah, I believe that]
AllAITool.ai at https://allaitool.ai/
FamouseAITools.ai https://famousaitools.ai/ [Another marketing outfit sucking up AI tool submissions]
Futurepedia.io https://www.futurepedia.io/
TheMangoAI.co https://themangoai.co/ [Not a directory, an advertisement of sorts for an AI-powered marketing firm]
NeonRev https://www.neonrev.com/ [Another actual directory. It looks like a number of Telegram bot directories]
Spiff Store https://spiff.store/ [Another directory. I have no idea how many tools are included]
StackViv https://stackviv.ai/ [An actual directory with 10,000 tools. No I did not count them. Are you kidding me?]
TheresanAIforThat https://theresanaiforthat.com/ [You have to register to look at the listings. A turn off for me]
Toolify.ai https://www.toolify.ai/ [An actual listing of more than 25,000 AI tools organized into categories probably by AI, not a professional indexing specialist]
When I looked at each of these “directories”, marketing is something the AI crowd finds important. A bit more effort in the naming of some of these services might help. Just a thought. Enjoy.
Stephen E Arnold, May 26, 2025
Microsoft: Did It Really Fork This Fellow?
May 26, 2025
Just the dinobaby operating without Copilot or its ilk.
Forked doesn’t quite communicate the exact level of frustration Philip Laine experienced while working on a Microsoft project. He details the incident in his post, “Getting Forked By Microsoft.” Laine invented a solution for image scalability without a stateful component and needed minimal operation oversight. He dubbed his project Spegel, made it open source, and was contacted by Microsoft.
Microsoft was pleased with Spegel. Laine worked with Microsoft engineers to implement Spegel into its architecture. Everything went well until Microsoft stopped working with him. He figured the moved onto other projects. Microsoft did move on but the engineers developed their own version of Spegel. They have the grace to thank Laine and in a README file. It gets worse:
"While looking into Peerd, my enthusiasm for understanding different approaches in this problem space quickly diminished. I saw function signatures and comments that looked very familiar, as if I had written them myself. Digging deeper I found test cases referencing Spegel and my previous employer, test cases that have been taken directly from my project. References that are still present to this day. The project is a forked version of Spegel, maintained by Microsoft, but under Microsoft’s MIT license.”
Microsoft plagiarized…no…downright stole Spegel’s base coding from Laine. He, however, published Spegel with Microsoft’s MIT licensing. The MIT licensing means:
“Software released under an MIT license allows for forking and modifications, without any requirement to contribute these changes back. I default to using the MIT license as it is simple and permissive.”
It does require this:
“The license does not allow removing the original license and purport that the code was created by someone else. It looks as if large parts of the project were copied directly from Spegel without any mention of the original source.”
Laine wanted to work with Microsoft and have their engineers contribute to his open source project. He’s dedicated his energy, time, and resources to Spegel and continues to do so without much contribution other than GitHub sponsors and the thanks of its users. Laine is considering changing Spegel’s licensing as it’s the only way to throw a stone at Microsoft.
If true, the pulsing AI machine is a forker.
Whitney Grace, May 26, 2025
Censorship Gains Traction at an Individual Point
May 23, 2025
No AI, just the dinobaby expressing his opinions to Zillennials.
I read a somewhat sad biographical essay titled “The Great Displacement Is Already Well Underway: It’s Not a Hypothetical, I’ve Already Lost My Job to AI For The Last Year.” The essay explains that a 40 something software engineer lost his job. Despite what strike me as heroic efforts, no offers ensued. I urge you to take a look at this essay because the push to remove humans from “work” is accelerating. I think with my 80 year old neuro-structures that the lack of “work” will create some tricky social problems.
I spotted one passage in the essay which struck me as significant. The idea of censorship is a popular topic in central Kentucky. Quite a few groups and individuals have quite specific ideas about what books should be available for students and others to read. Here is the quote about censorship from the cited “Great Displacement” essay:
I [the author of the essay] have gone back and deleted 95% of those articles and vlogs, because although many of the ideas they presented were very forward-thinking and insightful at the time, they may now be viewed as pedestrian to AI insiders merely months later due to the pace of AI progress. I don’t want the wrong person with a job lead to see a take like that as their first exposure to me and think that I’m behind the last 24 hours of advancements on my AI takes.
Self-censorship was used to create a more timely version of the author. I have been writing articles with titles like “The Red Light on the Green Board” for years. This particular gem points out that public school teachers sell themselves and their ideas out. The prostitution analogy was intentional. I caught a bit of criticism from an educator in the public high school in which I “taught” for 18 months. Now people just ignore what I write. Thankfully my lectures about online fraud evoke a tiny bit of praise because the law enforcement, crime analysts, and cyber attorneys don’t throw conference snacks at me when I offer one of my personal observations about bad actors.
The cited essay presents a person who is deleting content into to present an “improved” or “shaped” version of himself. I think it is important to have in original form essays, poems, technical reports, and fiction — indeed, any human-produced artifact — available. These materials I think will provide future students and researchers with useful material to mine for insights and knowledge.
Deletion means that information is lost. I am not sure that is a good thing. What’s notable is that censorship is taking place by the author for the express purpose of erasing the past and shaping an impression of the present individual. Will that work? Based on the information in the essay, it had not when I read the write up.
Censorship may be one facet of what the author calls a “displacement.” I am not too keen on censorship regardless of the decider or the rationalization. But I am a real dinobaby, not a 40-something dinobaby like the author of the essay.
Stephen E Arnold, May 23, 2025