DAILY NEWS

Stay Ahead, Stay Informed – Every Day

Advertisement
Anthropic Has Added Several More Religions on Its Quest to Inject Perfect Morals into Claude



            The original mysterious black box wasn’t an AI model at all, but the Kaaba, the black cube at the center of the Sacred Mosque of Mecca. Prior to Muhammad’s conquest of Mecca, the Kaaba was a sort of all-purpose repository of 360 sacred symbols from around the region. If you were, say, a busy merchant on his way to Medina, whatever the great spiritual truths of the universe may be, they were in there somewhere, so a prayer to the Kaaba had you covered in the god department and you were good to go. Anthropic seems to be doing something along these lines with Claude. Last week, representatives from Anthropic—along with OpenAI—attended an event in New York called the “Faith-AI Covenant” roundtable. The New York Board of Rabbis, the Hindu Temple Society of North America, the Church of Jesus Christ of Latter-day Saints, the U.S.-based Sikh Coalition, and the Greek Orthodox Archdiocese of America were all in attendance.

Last month, I wrote about a series of meetings and dinners Anthropic organized with a collection of 15 Christian leaders. Anthropic was looking for advice from the Christians, and guidance on the supposed “spiritual development” of its Claude AI model. At the time Anthropic said it was working on arranging meetings with moral thinkers who represented other groups. It’s not clear from a fresh Associated Press piece about the Faith-AI Covenant meeting whether these latest conversations with religious leaders and the earlier meetings with Christians were part of a single coherent program at Anthropic, and whether the staff members who participated in the Christian summit participated in this one as well. Gizmodo asked Anthropic for clarity about this on Saturday, but Anthropic did not return our request as of this writing.

The Associated Press also says OpenAI and Anthropic “initiated outreach,” but also that a Swiss NGO called the Interfaith Alliance for Safer Communities organized it, and has plans for future events along similar lines in China, Kenya and the United Arab Emirates. Also mentioned as a “key partner” was Baroness Joanna Shields, a member of the British House of Lords.

There’s not a single clear takeaway in the AP story—no religious instructions laid out by all these spiritual leaders. But what Anthropic calls Claude’s constitution includes a dissection of the philosophically fraught moral work Anthropic is at least trying to do by injecting morals into a machine: getting it to make the decision of a person with perfect values when there’s no way to write a rule for a situation that arises, and the consequences of making the wrong decision could be dire. This, Anthropic writes, is “centrally because we worry that our efforts to give Claude good enough ethical values will fail.” To this end, the Associated Press story extracts some quietly devastating commentary from Rumman Chowdhury, CEO of a nonprofit called Humane Intelligence: “I think a very naive take that Silicon Valley has had for a couple of years related to generative AI was that we could arrive at some sort of universal principles of ethics,” Chowdhury told the AP, adding, “They have very quickly realized that that’s just not true. That’s not real. So now they’re looking at maybe religion as a way of dealing with the ambiguity of ethically gray situations.”

They are indeed looking at maybe religion. But it’s hard to picture Anthropic coming away from these meetings converted, and inserting one set of specific religious doctrines into Claude. They’re just trying to glean high order ethical truths, and demonstrating to the world that they’ve—ostensibly—left no stone unturned in searching for them. Your mileage will vary on whether you think a machine charged with making decisions or giving important advice would, when the chips are down, be able to synthesize ideal morals thanks to meetings its creators held with administrators from some of humanity’s premier religions. It probably can’t hurt, sorta like nodding at the pre-Islamic Kaaba. But then again, only God knows for sure.



Source link

Trump’s Potential New AI Executive Order May Take a Swipe at Anthropic



An executive order reportedly being mulled by President Donald Trump could deepen—or, who knows, resolve?—its ongoing conflict with Anthropic. Yesterday I wrote about reports that Trump was working on creating an AI “working group” by executive order. This group would be made up of government officials and members of the tech industry, and one of its roles might be to devise a review process for unreleased AI models. In other words, after promising a light regulatory touch, Trump might be dipping his toe into creating some AI guardrails. I noted that the Times’ sources compared Trump’s potential working group to a similar group in the process of being created in the U.K., and that that group was spurred into existence by the revelations of security vulnerabilities brought about by Anthropic’s Claude Mythos Preview model.

Yesterday afternoon, the AI companies Microsoft, xAI, and Google all signed deals allowing a Biden-created arm of the Commerce Department called the Center for AI Standards and Innovation (CAISI), to inspect their new models prior to release. Anthropic wasn’t included, but it did sign a similar agreement with CAISI under President Biden in 2024. But new information reported by Politico about the in-progress order says it may prohibit companies from “interfering” with government uses of AI. This is according to four of Politico’s seven anonymous sources cited in the story.

To refresh your memory, Anthropic has been blacklisted by the Pentagon—ostensibly for interfering with the government’s use of its models. It’s a bizarre story with many unanswered questions, but on its face, Anthropic appears to have refused to lift guardrails aimed at preventing the Pentagon from engaging in mass surveillance or full automation of weapons systems, which resulted in a standoff, threats, and ultimately the designation of Anthropic as a supply chain risk, and the requirement that all Pentagon contractors cut all business ties with it.

It’s not clear what this even is. Language about “interfering” could either reinforce Anthropic’s pariah status within the Trump Administration, attempt to circumvent it somehow, seek to resolve the matter while saving face, or be purely symbolic. So far the White House has not addressed any of the specific reports about this order, telling Politico discussion before the order is announced is “speculation.”



Source link