-
Kenya's economy faces climate change risks: World Bank
-
US university killer's mystery motive sought after suicide
-
IMF approves $206 mn aid to Sri Lanka after Cyclone Ditwah
-
Rome to charge visitors for access to Trevi Fountain
-
Stocks advance with focus on central banks, tech
-
Norway crown princess likely to undergo lung transplant
-
France's budget hits snag in setback for embattled PM
-
Volatile Oracle shares a proxy for Wall Street's AI jitters
-
Japan hikes interest rates to 30-year-high
-
Brazil's top court strikes down law blocking Indigenous land claims
-
'We are ghosts': Britain's migrant night workers
-
Asian markets rise as US inflation eases, Micron soothes tech fears
-
Trump signs $900 bn defense policy bill into law
-
EU-Mercosur deal delayed as farmers stage Brussels show of force
-
Harrison Ford to get lifetime acting award
-
Trump health chief seeks to bar trans youth from gender-affirming care
-
Argentine unions in the street over Milei labor reforms
-
Brazil open to EU-Mercosur deal delay as farmers protest in Brussels
-
Brussels farmer protest turns ugly as EU-Mercosur deal teeters
-
US accuses S. Africa of harassing US officials working with Afrikaners
-
ECB holds rates as Lagarde stresses heightened uncertainty
-
Trump Media announces merger with fusion power company
-
Stocks rise as US inflation cools, tech stocks bounce
-
Zelensky presses EU to tap Russian assets at crunch summit
-
Danish 'ghetto' residents upbeat after EU court ruling
-
ECB holds rates but debate swirls over future
-
Bank of England cuts interest rate after UK inflation slides
-
Have Iran's authorities given up on the mandatory hijab?
-
British energy giant BP extends shakeup with new CEO pick
-
EU kicks off crunch summit on Russian asset plan for Ukraine
-
Sri Lanka plans $1.6 bn in cyclone recovery spending in 2026
-
Most Asian markets track Wall St lower as AI fears mount
-
Danish 'ghetto' tenants hope for EU discrimination win
-
What to know about the EU-Mercosur deal
-
Trump vows economic boom, blames Biden in address to nation
-
ECB set to hold rates but debate swirls over future
-
EU holds crunch summit on Russian asset plan for Ukraine
-
Nasdaq tumbles on renewed angst over AI building boom
-
Billionaire Trump nominee confirmed to lead NASA amid Moon race
-
CNN's future unclear as Trump applies pressure
-
German MPs approve 50 bn euros in military purchases
-
EU's Mercosur trade deal hits French, Italian roadblock
-
Warner Bros rejects Paramount bid, sticks with Netflix
-
Crude prices surge after Trump orders Venezuela oil blockade
-
Warner Bros. Discovery rejects Paramount bid
-
Doctors in England go on strike for 14th time
-
Ghana's Highlife finds its rhythm on UNESCO world stage
-
Stocks gain as traders bet on interest rate moves
-
France probes 'foreign interference' after malware found on ferry
-
Europe's Ariane 6 rocket puts EU navigation satellites in orbit
| RBGPF | 0% | 80.22 | $ | |
| SCS | 0.12% | 16.14 | $ | |
| RYCEF | -0.98% | 15.25 | $ | |
| AZN | 1.22% | 91.73 | $ | |
| BTI | -0.17% | 56.945 | $ | |
| RIO | 1.08% | 78.48 | $ | |
| CMSD | -0.05% | 23.269 | $ | |
| CMSC | -0.22% | 23.24 | $ | |
| NGG | 0.35% | 76.655 | $ | |
| GSK | 1.07% | 48.81 | $ | |
| RELX | 0.55% | 40.875 | $ | |
| BCE | 0.24% | 22.905 | $ | |
| VOD | 0.64% | 12.883 | $ | |
| BP | 2.07% | 34.015 | $ | |
| BCC | -4.05% | 74.675 | $ | |
| JRI | 0.04% | 13.436 | $ |
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
M.P.Jacobs--CPN