The Prague Post - ChatGPT's taste for literary nonsense sparks alarm

EUR -
AED 4.241855
AFN 72.754432
ALL 96.085419
AMD 435.786045
ANG 2.067238
AOA 1058.976619
ARS 1584.416613
AUD 1.668657
AWG 2.081577
AZN 1.963562
BAM 1.958501
BBD 2.324366
BDT 141.598951
BGN 1.973957
BHD 0.4371
BIF 3429.837876
BMD 1.154828
BND 1.483084
BOB 7.992229
BRL 6.039519
BSD 1.154021
BTN 108.748324
BWP 15.866361
BYN 3.465669
BYR 22634.620324
BZD 2.321041
CAD 1.59793
CDF 2639.364949
CHF 0.916119
CLF 0.026908
CLP 1062.27995
CNY 7.978876
CNH 7.987226
COP 4265.678972
CRC 535.051764
CUC 1.154828
CUP 30.602931
CVE 110.419186
CZK 24.48783
DJF 205.509637
DKK 7.471699
DOP 69.577759
DZD 153.567517
EGP 60.919445
ERN 17.322414
ETB 178.357225
FJD 2.596341
FKP 0.863621
GBP 0.864129
GEL 3.112263
GGP 0.863621
GHS 12.616672
GIP 0.863621
GMD 84.881166
GNF 10116.864079
GTQ 8.828404
GYD 241.439229
HKD 9.036947
HNL 30.644056
HRK 7.535594
HTG 151.132345
HUF 387.707374
IDR 19533.908305
ILS 3.605952
IMP 0.863621
INR 108.504369
IQD 1511.824159
IRR 1516461.819995
ISK 142.794582
JEP 0.863621
JMD 181.370119
JOD 0.818764
JPY 184.255628
KES 150.011361
KGS 100.990148
KHR 4621.4733
KMF 493.110949
KPW 1039.411558
KRW 1738.569596
KWD 0.354798
KYD 0.961751
KZT 555.968746
LAK 24926.915142
LBP 103344.902703
LKR 362.949956
LRD 211.76754
LSL 19.74324
LTL 3.409906
LVL 0.698544
LYD 7.369162
MAD 10.774645
MDL 20.270569
MGA 4809.737001
MKD 61.728412
MMK 2425.11916
MNT 4138.703025
MOP 9.299606
MRU 46.033882
MUR 53.849906
MVR 17.842152
MWK 2001.120298
MXN 20.502867
MYR 4.612359
MZN 73.795522
NAD 19.74324
NGN 1600.175159
NIO 42.469671
NOK 11.138601
NPR 173.997719
NZD 1.996437
OMR 0.444039
PAB 1.154016
PEN 3.993912
PGK 4.986964
PHP 69.450197
PKR 322.123193
PLN 4.272562
PYG 7553.009814
QAR 4.207018
RON 5.097294
RSD 117.41827
RUB 93.810626
RWF 1685.267852
SAR 4.332547
SBD 9.287166
SCR 15.993858
SDG 694.05154
SEK 10.849022
SGD 1.482671
SHP 0.86642
SLE 28.350504
SLL 24216.169179
SOS 659.529514
SRD 43.377631
STD 23902.59906
STN 24.534472
SVC 10.098101
SYP 128.697299
SZL 19.737732
THB 37.904329
TJS 11.044217
TMT 4.041896
TND 3.39495
TOP 2.780547
TRY 51.230572
TTD 7.833006
TWD 36.827525
TZS 2967.974997
UAH 50.639111
UGX 4293.013226
USD 1.154828
UYU 46.784924
UZS 14056.506376
VES 533.634686
VND 30430.861232
VUV 137.451427
WST 3.175234
XAF 656.877088
XAG 0.016748
XAU 0.000259
XCD 3.12098
XCG 2.079913
XDR 0.814663
XOF 656.87424
XPF 119.331742
YER 275.599659
ZAR 19.643269
ZMK 10394.833581
ZMW 21.667349
ZWL 371.854006
  • CMSC

    -0.1200

    22.79

    -0.53%

  • BCC

    -0.9900

    73.66

    -1.34%

  • GSK

    -0.3300

    54.37

    -0.61%

  • RIO

    -2.0750

    85.465

    -2.43%

  • AZN

    -3.6350

    183.505

    -1.98%

  • BCE

    -0.0750

    25.415

    -0.3%

  • NGG

    -1.8100

    82.48

    -2.19%

  • BTI

    -0.1600

    58.29

    -0.27%

  • BP

    0.8550

    46.265

    +1.85%

  • CMSD

    -0.0840

    22.596

    -0.37%

  • JRI

    -0.0020

    12.098

    -0.02%

  • RBGPF

    -13.5000

    69

    -19.57%

  • VOD

    -0.0150

    14.705

    -0.1%

  • RYCEF

    -0.6000

    15.3

    -3.92%

  • RELX

    -0.2800

    32.19

    -0.87%

ChatGPT's taste for literary nonsense sparks alarm
ChatGPT's taste for literary nonsense sparks alarm / Photo: Anna Moneymaker - GETTY IMAGES NORTH AMERICA/AFP

ChatGPT's taste for literary nonsense sparks alarm

OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.

Text size:

Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.

"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.

His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.

He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."

He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.

The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.

"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.

"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.

He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.

His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.

After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.

- 'Ripe for exploitation' -

"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.

"But it's just not clear to me that it's so very different for human beings," he added.

"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."

The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.

K.Pokorny--TPP