The Prague Post - ChatGPT's taste for literary nonsense sparks alarm

EUR -
AED 4.269205
AFN 73.236671
ALL 95.352378
AMD 427.641212
ANG 2.081374
AOA 1067.15645
ARS 1624.29075
AUD 1.626528
AWG 2.095079
AZN 1.978238
BAM 1.96037
BBD 2.34206
BDT 142.913152
BGN 1.941248
BHD 0.438575
BIF 3461.865182
BMD 1.16248
BND 1.488971
BOB 8.03473
BRL 5.827752
BSD 1.162811
BTN 112.531327
BWP 15.769762
BYN 3.190437
BYR 22784.606301
BZD 2.338652
CAD 1.598358
CDF 2619.634852
CHF 0.915201
CLF 0.026531
CLP 1044.174606
CNY 7.906611
CNH 7.906386
COP 4332.097645
CRC 525.525077
CUC 1.16248
CUP 30.805718
CVE 110.725989
CZK 24.302682
DJF 206.596142
DKK 7.472851
DOP 68.472302
DZD 154.5263
EGP 62.097237
ERN 17.437199
ETB 183.381294
FJD 2.56002
FKP 0.867574
GBP 0.86546
GEL 3.109608
GGP 0.867574
GHS 13.426958
GIP 0.867574
GMD 84.283726
GNF 10206.573972
GTQ 8.864665
GYD 243.176881
HKD 9.10424
HNL 30.956458
HRK 7.535312
HTG 152.214835
HUF 359.825312
IDR 20609.373887
ILS 3.374557
IMP 0.867574
INR 112.212501
IQD 1522.848686
IRR 1535577.841127
ISK 143.403259
JEP 0.867574
JMD 183.968859
JOD 0.824193
JPY 184.720964
KES 150.494396
KGS 101.659315
KHR 4661.544
KMF 494.0539
KPW 1046.198886
KRW 1743.458329
KWD 0.359485
KYD 0.969059
KZT 548.648982
LAK 25522.246872
LBP 104100.075949
LKR 400.593844
LRD 213.02446
LSL 19.122879
LTL 3.432501
LVL 0.703172
LYD 7.387575
MAD 10.718544
MDL 20.210113
MGA 4864.978274
MKD 61.637912
MMK 2440.351379
MNT 4161.345258
MOP 9.382071
MRU 46.481727
MUR 55.113081
MVR 17.913476
MWK 2019.227052
MXN 20.137581
MYR 4.603532
MZN 74.286399
NAD 19.268085
NGN 1594.050753
NIO 42.680511
NOK 10.760699
NPR 180.049723
NZD 1.98258
OMR 0.446981
PAB 1.162811
PEN 3.966964
PGK 5.064518
PHP 70.833397
PKR 323.870125
PLN 4.245951
PYG 7164.701984
QAR 4.238419
RON 5.238248
RSD 117.441882
RUB 82.782221
RWF 1699.545633
SAR 4.362155
SBD 9.322428
SCR 16.046758
SDG 698.114806
SEK 10.860881
SGD 1.486039
SHP 0.867909
SLE 28.62612
SLL 24376.624989
SOS 664.34154
SRD 43.133765
STD 24060.987168
STN 24.818946
SVC 10.174719
SYP 128.505755
SZL 19.122779
THB 37.858487
TJS 10.802582
TMT 4.080304
TND 3.362476
TOP 2.798972
TRY 53.004669
TTD 7.882375
TWD 36.741224
TZS 3034.081833
UAH 51.481712
UGX 4389.231952
USD 1.16248
UYU 46.879283
UZS 14060.194848
VES 604.795229
VND 30658.082754
VUV 137.487219
WST 3.157138
XAF 657.489706
XAG 0.0154
XAU 0.000256
XCD 3.14166
XCG 2.095685
XDR 0.816239
XOF 656.221124
XPF 119.331742
YER 277.396778
ZAR 19.158541
ZMK 10463.71141
ZMW 22.0063
ZWL 374.318058
  • BCE

    0.1900

    24.17

    +0.79%

  • CMSD

    0.1400

    22.89

    +0.61%

  • CMSC

    -0.0200

    22.78

    -0.09%

  • BTI

    -0.7600

    65.3

    -1.16%

  • GSK

    -0.2700

    50.78

    -0.53%

  • RBGPF

    0.7200

    63.23

    +1.14%

  • BCC

    1.8100

    67.28

    +2.69%

  • NGG

    0.5700

    84.72

    +0.67%

  • RIO

    2.3900

    103.31

    +2.31%

  • BP

    -1.0100

    45.13

    -2.24%

  • JRI

    0.2000

    12.67

    +1.58%

  • AZN

    2.8200

    187.46

    +1.5%

  • RYCEF

    0.8800

    16.25

    +5.42%

  • VOD

    0.0900

    15.24

    +0.59%

  • RELX

    0.0200

    33.6

    +0.06%

ChatGPT's taste for literary nonsense sparks alarm
ChatGPT's taste for literary nonsense sparks alarm / Photo: Anna Moneymaker - GETTY IMAGES NORTH AMERICA/AFP

ChatGPT's taste for literary nonsense sparks alarm

OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.

Text size:

Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.

"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.

His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.

He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."

He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.

The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.

"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.

"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.

He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.

His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.

After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.

- 'Ripe for exploitation' -

"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.

"But it's just not clear to me that it's so very different for human beings," he added.

"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."

The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.

K.Pokorny--TPP