-
Kenya's economy faces climate change risks: World Bank
-
'There's no soul': Tony Leung weighs in on AI in filmmaking
-
French mountain lodges worry over strained water supply
-
Heatwave hits more than one in two people in France
-
From birds to fish, how extreme heat causes wildlife to suffer
-
The Sun may not engulf Earth after all, scientists say
-
Russia signals slower rate cuts amid high Ukraine war spending
-
Heatwave hits more than half of France's population
-
Online threats, insults fuel S.Africa's anti-foreigner hate
-
Gaza ceasefire a 'deadly illusion': UNICEF
-
European robotics start-ups go up against Chinese heavyweights
-
'Alter-Ego': An Italian hospital's little robot carer
-
Indonesia to capture last-known wild Bornean rhino for IVF
-
No vaccine, conflict, mistrust: Ebola's return to DR Congo
-
AI museum brings sights, sounds and smells of the rainforest
-
New Zealand minister defends fishers after two orcas killed in net
-
Football 'ambassador' and fan favorite: a duck becomes a star in Mexico
-
Fossils challenge assumptions on how animals adapted to land
-
US stocks resume upward climb as dollar advances again after Fed outlook
-
Al-Qaeda-linked jihadists attack Niger airport, 11 soldiers killed
-
AI-generated videos use Down syndrome to make sales
-
Ghana pushes for concrete slavery reparations
-
Europe risks 'total irrelevance' without sovereign tech: Cohere chief
-
AI-generated videos wield Down syndrome to make sales
-
Suspected jihadists stage deadly new attack on Niger airport
-
Man dies, trains and classes disrupted as heatwave hits France
-
Oil tankers pass Hormuz Strait after war deal: tracker
-
Swiss central bank holds interest rates, with eye on currency risks
-
S.African sentenced in 'world's largest' rhino trafficking case
-
Bank of England follows Fed in holding interest rate
-
German chemical company to cut 3,200 jobs as crisis worsens
-
Range raises $8.3M Series A to unify treasury, risk and compliance across stablecoins and fiat
-
Innovations on show at Paris Vivatech fest
-
Bird flu kills 13,000 seal pups on remote Australian island
-
New wave of anti-LGBTQ laws sweeps Africa
-
Drastic restrictions on public transport take effect in Cuba
-
Cuba approves economic reforms to boost private sector, investment: state TV
-
Robots pour cocktails and run marathons, but still can't multitask
-
Birthright citizenship helps spark US World Cup run
-
Castro gives crucial backing to Cuba reforms
-
Driving the World's Leading Supply Chains: 9 OMP Customers Named to The 2026 Gartner Top 25
-
Qantas to launch non-stop Sydney-London flights in October 2027
-
US Fed chair Warsh vows reforms as central bank signals rate hikes on horizon
-
US Federal Reserve holds rates steady, raises inflation expectations
-
Brest boss Roy dies aged 58 from cancer
-
Military salutes and K-pop madness shake up Colombia campaigning
-
Recovery of ship traffic in Hormuz limited, but signs emerge
-
England's World Cup opener puts Spanish resort on beer alert
-
Nations allege 'attacks' on science at key climate talks
-
Plague was killing hunter-gatherers 5,500 years ago: study
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
M.P.Jacobs--CPN