The Prague Post - Grok shows 'flaws' in fact-checking Israel-Iran war: study

EUR -
AED 4.311949
AFN 78.774474
ALL 96.785497
AMD 449.925555
ANG 2.102142
AOA 1076.665434
ARS 1671.941563
AUD 1.778439
AWG 1.651102
AZN 2.00066
BAM 1.955911
BBD 2.363934
BDT 142.838113
BGN 1.956185
BHD 0.442525
BIF 3457.296368
BMD 1.174117
BND 1.513186
BOB 8.110461
BRL 6.268027
BSD 1.173667
BTN 104.276923
BWP 15.601886
BYN 3.979526
BYR 23012.687081
BZD 2.360534
CAD 1.639713
CDF 2976.386228
CHF 0.927733
CLF 0.02871
CLP 1126.283971
CNY 8.359129
CNH 8.378509
COP 4560.459026
CRC 589.833502
CUC 1.174117
CUP 31.114092
CVE 110.271263
CZK 24.260073
DJF 209.011872
DKK 7.468022
DOP 73.364167
DZD 151.963631
EGP 56.040183
ERN 17.61175
ETB 169.619634
FJD 2.644468
FKP 0.87099
GBP 0.865262
GEL 3.199515
GGP 0.87099
GHS 14.78884
GIP 0.87099
GMD 86.302098
GNF 10181.578296
GTQ 8.996511
GYD 245.553947
HKD 9.13363
HNL 30.819751
HRK 7.533255
HTG 153.578723
HUF 388.257361
IDR 19449.595168
ILS 3.881165
IMP 0.87099
INR 104.185601
IQD 1537.587332
IRR 49371.607136
ISK 142.009866
JEP 0.87099
JMD 188.510707
JOD 0.832495
JPY 173.117681
KES 151.638613
KGS 102.664298
KHR 4710.267535
KMF 493.12942
KPW 1056.675295
KRW 1652.651876
KWD 0.359116
KYD 0.978056
KZT 642.776509
LAK 25440.444973
LBP 105105.169792
LKR 355.030165
LRD 213.0321
LSL 20.232349
LTL 3.466862
LVL 0.710212
LYD 6.34336
MAD 10.683607
MDL 19.659117
MGA 5240.29764
MKD 61.6285
MMK 2464.790802
MNT 4223.559618
MOP 9.408634
MRU 46.785657
MUR 53.199671
MVR 17.968375
MWK 2035.215597
MXN 21.602225
MYR 4.940729
MZN 75.030435
NAD 20.232349
NGN 1727.807078
NIO 43.192453
NOK 11.704446
NPR 166.843476
NZD 2.017903
OMR 0.451256
PAB 1.173667
PEN 4.078732
PGK 4.994284
PHP 67.985705
PKR 332.569089
PLN 4.253967
PYG 8247.468442
QAR 4.278143
RON 5.088039
RSD 117.176655
RUB 96.535483
RWF 1702.496699
SAR 4.399623
SBD 9.664166
SCR 17.170975
SDG 706.235504
SEK 11.005659
SGD 1.513911
SHP 0.922671
SLE 27.369095
SLL 24620.644187
SOS 670.738097
SRD 44.739761
STD 24301.844905
STN 24.501392
SVC 10.269583
SYP 15265.52387
SZL 20.227149
THB 37.965108
TJS 10.933021
TMT 4.109408
TND 3.419294
TOP 2.749903
TRY 48.628982
TTD 7.958452
TWD 35.742872
TZS 2884.363827
UAH 48.431151
UGX 4067.231012
USD 1.174117
UYU 46.822659
UZS 14201.806638
VES 217.407137
VND 30979.068808
VUV 141.883276
WST 3.267748
XAF 655.994259
XAG 0.024466
XAU 0.000302
XCD 3.17311
XCG 2.11532
XDR 0.815846
XOF 655.994259
XPF 119.331742
YER 280.67302
ZAR 20.227871
ZMK 10568.463339
ZMW 27.963588
ZWL 378.065094
  • CMSC

    -0.0700

    23.87

    -0.29%

  • SCS

    -0.0100

    17.21

    -0.06%

  • CMSD

    0.0600

    24.45

    +0.25%

  • NGG

    0.9000

    73.43

    +1.23%

  • GSK

    -0.3400

    43.35

    -0.78%

  • BTI

    -0.3609

    51.24

    -0.7%

  • AZN

    1.6600

    85.31

    +1.95%

  • BP

    0.3000

    34.16

    +0.88%

  • RIO

    -0.1500

    66.11

    -0.23%

  • BCC

    -0.4100

    77.63

    -0.53%

  • JRI

    0.0300

    14.3

    +0.21%

  • BCE

    0.1300

    23.36

    +0.56%

  • RYCEF

    0.0800

    15.76

    +0.51%

  • RELX

    -0.2000

    46.41

    -0.43%

  • VOD

    -0.0600

    11.36

    -0.53%

  • RBGPF

    0.0000

    78.22

    0%

Grok shows 'flaws' in fact-checking Israel-Iran war: study
Grok shows 'flaws' in fact-checking Israel-Iran war: study / Photo: Lionel BONAVENTURE - AFP

Grok shows 'flaws' in fact-checking Israel-Iran war: study

Elon Musk's AI chatbot Grok produced inaccurate and contradictory responses when users sought to fact-check the Israel-Iran conflict, a study said Tuesday, raising fresh doubts about its reliability as a debunking tool.

Text size:

With tech platforms reducing their reliance on human fact-checkers, users are increasingly utilizing AI-powered chatbots -- including xAI's Grok -- in search of reliable information, but their responses are often themselves prone to misinformation.

"The investigation into Grok's performance during the first days of the Israel-Iran conflict exposes significant flaws and limitations in the AI chatbot's ability to provide accurate, reliable, and consistent information during times of crisis," said the study from the Digital Forensic Research Lab (DFRLab) of the Atlantic Council, an American think tank.

"Grok demonstrated that it struggles with verifying already-confirmed facts, analyzing fake visuals, and avoiding unsubstantiated claims."

The DFRLab analyzed around 130,000 posts in various languages on the platform X, where the AI assistant is built in, to find that Grok was "struggling to authenticate AI-generated media."

Following Iran's retaliatory strikes on Israel, Grok offered vastly different responses to similar prompts about an AI-generated video of a destroyed airport that amassed millions of views on X, the study found.

It oscillated -- sometimes within the same minute -- between denying the airport's destruction and confirming it had been damaged by strikes, the study said.

In some responses, Grok cited the a missile launched by Yemeni rebels as the source of the damage. In others, it wrongly identified the AI-generated airport as one in Beirut, Gaza, or Tehran.

When users shared another AI-generated video depicting buildings collapsing after an alleged Iranian strike on Tel Aviv, Grok responded that it appeared to be real, the study said.

The Israel-Iran conflict, which led to US air strikes against Tehran's nuclear program over the weekend, has churned out an avalanche of online misinformation including AI-generated videos and war visuals recycled from other conflicts.

AI chatbots also amplified falsehoods.

As the Israel-Iran war intensified, false claims spread across social media that China had dispatched military cargo planes to Tehran to offer its support.

When users asked the AI-operated X accounts of AI companies Perplexity and Grok about its validity, both wrongly responded that the claims were true, according to disinformation watchdog NewsGuard.

Researchers say Grok has previously made errors verifying information related to crises such as the recent India-Pakistan conflict and anti-immigration protests in Los Angeles.

Last month, Grok was under renewed scrutiny for inserting "white genocide" in South Africa, a far-right conspiracy theory, into unrelated queries.

Musk's startup xAI blamed an "unauthorized modification" for the unsolicited response.

Musk, a South African-born billionaire, has previously peddled the unfounded claim that South Africa's leaders were "openly pushing for genocide" of white people.

Musk himself blasted Grok after it cited Media Matters -- a liberal media watchdog he has targeted in multiple lawsuits -- as a source in some of its responses about misinformation.

"Shame on you, Grok," Musk wrote on X. "Your sourcing is terrible."

A.Novak--TPP