The Prague Post - AI's blind spot: tools fail to detect their own fakes

EUR -
AED 4.172533
AFN 72.147498
ALL 94.446414
AMD 416.184199
ANG 2.034179
AOA 1042.422579
ARS 1680.653568
AUD 1.647772
AWG 2.046503
AZN 1.94392
BAM 1.955726
BBD 2.283813
BDT 139.474705
BGN 1.921105
BHD 0.427682
BIF 3384.726811
BMD 1.136157
BND 1.473025
BOB 7.835703
BRL 5.898359
BSD 1.133957
BTN 107.303926
BWP 15.513343
BYN 3.195765
BYR 22268.674564
BZD 2.280513
CAD 1.618018
CDF 2577.93958
CHF 0.92244
CLF 0.026512
CLP 1043.424184
CNY 7.715077
CNH 7.737728
COP 3912.924245
CRC 516.17586
CUC 1.136157
CUP 30.108157
CVE 110.260814
CZK 24.23576
DJF 201.922334
DKK 7.475582
DOP 66.466892
DZD 151.638316
EGP 56.387922
ERN 17.042353
ETB 182.81205
FJD 2.549762
FKP 0.863423
GBP 0.862287
GEL 2.999539
GGP 0.863423
GHS 12.700518
GIP 0.863423
GMD 82.315257
GNF 9935.491624
GTQ 8.649672
GYD 237.190995
HKD 8.907186
HNL 30.341581
HRK 7.53283
HTG 148.262414
HUF 355.156486
IDR 20372.428755
ILS 3.386037
IMP 0.863423
INR 107.388181
IQD 1485.443605
IRR 1562272.497635
ISK 144.201475
JEP 0.863423
JMD 178.592434
JOD 0.805539
JPY 183.862032
KES 147.133961
KGS 99.356303
KHR 4555.766892
KMF 493.092633
KPW 1022.541577
KRW 1752.283149
KWD 0.351572
KYD 0.944964
KZT 551.82905
LAK 24890.055042
LBP 101555.797479
LKR 382.555476
LRD 206.542159
LSL 18.852084
LTL 3.354776
LVL 0.68725
LYD 7.292723
MAD 10.661295
MDL 20.082149
MGA 4736.79932
MKD 61.61368
MMK 2385.400948
MNT 4071.785272
MOP 9.158352
MRU 45.340079
MUR 54.75128
MVR 17.553658
MWK 1966.216699
MXN 20.011357
MYR 4.672335
MZN 72.612193
NAD 18.852084
NGN 1557.212948
NIO 41.727865
NOK 11.203075
NPR 171.684971
NZD 2.012912
OMR 0.43686
PAB 1.133957
PEN 3.845754
PGK 4.974745
PHP 69.666849
PKR 315.373439
PLN 4.286618
PYG 6916.737404
QAR 4.122343
RON 5.235068
RSD 117.349115
RUB 85.096665
RWF 1665.72943
SAR 4.25752
SBD 9.148281
SCR 16.823661
SDG 681.693902
SEK 11.076051
SGD 1.473794
SHP 0.848256
SLE 28.173786
SLL 23824.645554
SOS 648.072544
SRD 42.560928
STD 23516.153224
STN 24.498746
SVC 9.921623
SYP 125.581802
SZL 18.849201
THB 37.950477
TJS 10.5286
TMT 3.976549
TND 3.370872
TOP 2.735594
TRY 52.848676
TTD 7.688708
TWD 36.145468
TZS 2977.510374
UAH 50.898944
UGX 4183.841159
USD 1.136157
UYU 45.268281
UZS 13635.482325
VES 705.272766
VND 29915.578347
VUV 136.135153
WST 3.155989
XAF 655.929211
XAG 0.019883
XAU 0.000285
XCD 3.070521
XCG 2.043622
XDR 0.815765
XOF 655.932097
XPF 119.331742
YER 271.115476
ZAR 18.81311
ZMK 10226.774941
ZMW 20.439224
ZWL 365.842047
  • RBGPF

    0.0000

    61.3

    0%

  • BCC

    5.8600

    77.66

    +7.55%

  • BCE

    0.1600

    23.2

    +0.69%

  • GSK

    -0.9800

    51.09

    -1.92%

  • CMSC

    -0.0450

    22.065

    -0.2%

  • NGG

    1.2600

    82.83

    +1.52%

  • RELX

    -0.0600

    31.15

    -0.19%

  • CMSD

    0.0600

    22.02

    +0.27%

  • RIO

    -1.5500

    94.03

    -1.65%

  • AZN

    2.0000

    183.02

    +1.09%

  • JRI

    -0.0600

    12.57

    -0.48%

  • RYCEF

    -0.1600

    18

    -0.89%

  • BP

    -1.4700

    37.86

    -3.88%

  • VOD

    -0.2400

    13.81

    -1.74%

  • BTI

    0.6500

    61.39

    +1.06%

AI's blind spot: tools fail to detect their own fakes
AI's blind spot: tools fail to detect their own fakes / Photo: Chris Delmas - AFP

AI's blind spot: tools fail to detect their own fakes

When outraged Filipinos turned to an AI-powered chatbot to verify a viral photograph of a lawmaker embroiled in a corruption scandal, the tool failed to detect it was fabricated -- even though it had generated the image itself.

Text size:

Internet users are increasingly turning to chatbots to verify images in real time, but the tools often fail, raising questions about their visual debunking capabilities at a time when major tech platforms are scaling back human fact-checking.

In many cases, the tools wrongly identify images as real even when they are generated using the same generative models, further muddying an online information landscape awash with AI-generated fakes.

Among them is a fabricated image circulating on social media of Elizaldy Co, a former Philippine lawmaker charged by prosecutors in a multibillion-dollar flood-control corruption scam that sparked massive protests in the disaster-prone country.

The image of Co, whose whereabouts has been unknown since the official probe began, appeared to show him in Portugal.

When online sleuths tracking him asked Google's new AI mode whether the image was real, it incorrectly said it was authentic.

AFP's fact-checkers tracked down its creator and determined that the image was generated using Google AI.

"These models are trained primarily on language patterns and lack the specialized visual understanding needed to accurately identify AI-generated or manipulated imagery," Alon Yamin, chief executive of AI content detection platform Copyleaks, told AFP.

"With AI chatbots, even when an image originates from a similar generative model, the chatbot often provides inconsistent or overly generalized assessments, making them unreliable for tasks like fact-checking or verifying authenticity."

Google did not respond to AFP’s request for comment.

- 'Distinguishable from reality' -

AFP found similar examples of AI tools failing to verify their own creations.

During last month's deadly protests over lucrative benefits for senior officials in Pakistan-administered Kashmir, social media users shared a fabricated image purportedly showing men marching with flags and torches.

An AFP analysis found it was created using Google's Gemini AI model.

But Gemini and Microsoft's Copilot falsely identified it as a genuine image of the protest.

"This inability to correctly identify AI images stems from the fact that they (AI models) are programmed only to mimic well," Rossine Fallorina, from the nonprofit Sigla Research Center, told AFP.

"In a sense, they can only generate things to resemble. They cannot ascertain whether the resemblance is actually distinguishable from reality."

Earlier this year, Columbia University's Tow Center for Digital Journalism tested the ability of seven AI chatbots -- including ChatGPT, Perplexity, Grok, and Gemini -- to verify 10 images from photojournalists of news events.

All seven models failed to correctly identify the provenance of the photos, the study said.

- 'Shocked' -

AFP tracked down the source of Co's photo that garnered over a million views across social media -- a middle-aged web developer in the Philippines, who said he created it "for fun" using Nano Banana, Gemini's AI image generator.

"Sadly, a lot of people believed it," he told AFP, requesting anonymity to avoid a backlash.

"I edited my post -- and added 'AI generated' to stop the spread -- because I was shocked at how many shares it got."

Such cases show how AI-generated photos flooding social platforms can look virtually identical to real imagery.

The trend has fueled concerns as surveys show online users are increasingly shifting from traditional search engines to AI tools for information gathering and verifying information.

The shift comes as Meta announced earlier this year it was ending its third-party fact-checking program in the United States, turning over the task of debunking falsehoods to ordinary users under a model known as "Community Notes."

Human fact-checking has long been a flashpoint in hyperpolarized societies, where conservative advocates accuse professional fact-checkers of liberal bias, a charge they reject.

AFP currently works in 26 languages with Meta's fact-checking program, including in Asia, Latin America, and the European Union.

Researchers say AI models can be useful to professional fact-checkers, helping to quickly geolocate images and spot visual clues to establish authenticity. But they caution that they cannot replace the work of trained human fact-checkers.

"We can't rely on AI tools to combat AI in the long run," Fallorina said.

burs-ac/sla/sms

G.Kucera--TPP