The Prague Post - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.270462
AFN 76.735326
ALL 96.500375
AMD 445.353536
ANG 2.081122
AOA 1066.15044
ARS 1731.475339
AUD 1.786219
AWG 2.09277
AZN 1.981121
BAM 1.958107
BBD 2.341759
BDT 142.457246
BGN 1.954874
BHD 0.437525
BIF 3429.81738
BMD 1.16265
BND 1.511281
BOB 8.033466
BRL 6.266456
BSD 1.16267
BTN 102.01921
BWP 16.599559
BYN 3.962469
BYR 22787.939203
BZD 2.338355
CAD 1.628001
CDF 2569.456831
CHF 0.925157
CLF 0.027914
CLP 1095.042324
CNY 8.27987
CNH 8.285032
COP 4495.095405
CRC 583.888
CUC 1.16265
CUP 30.810224
CVE 110.742867
CZK 24.31927
DJF 206.626608
DKK 7.471775
DOP 74.468187
DZD 151.513102
EGP 55.237998
ERN 17.439749
ETB 176.868172
FJD 2.641313
FKP 0.874433
GBP 0.873779
GEL 3.156641
GGP 0.874433
GHS 12.643865
GIP 0.874433
GMD 85.459249
GNF 10089.47676
GTQ 8.905493
GYD 243.246619
HKD 9.033349
HNL 30.403748
HRK 7.534558
HTG 152.249397
HUF 390.057885
IDR 19308.767333
ILS 3.817974
IMP 0.874433
INR 102.103978
IQD 1523.071447
IRR 48918.497449
ISK 143.192418
JEP 0.874433
JMD 186.439683
JOD 0.824365
JPY 177.659936
KES 150.218794
KGS 101.674186
KHR 4691.292993
KMF 492.96399
KPW 1046.403068
KRW 1673.030484
KWD 0.356515
KYD 0.968942
KZT 626.027653
LAK 25241.131023
LBP 104115.304266
LKR 353.096056
LRD 213.118123
LSL 20.067782
LTL 3.433004
LVL 0.703276
LYD 6.325258
MAD 10.724329
MDL 19.904454
MGA 5266.804719
MKD 61.624998
MMK 2440.864264
MNT 4178.343982
MOP 9.305164
MRU 46.593242
MUR 52.947519
MVR 17.792891
MWK 2018.945998
MXN 21.46374
MYR 4.911079
MZN 74.297668
NAD 20.067777
NGN 1697.736788
NIO 42.557316
NOK 11.648711
NPR 163.230336
NZD 2.022475
OMR 0.44629
PAB 1.16267
PEN 3.934993
PGK 4.901777
PHP 68.311543
PKR 326.705036
PLN 4.244545
PYG 8226.693576
QAR 4.233616
RON 5.086249
RSD 117.430016
RUB 92.569097
RWF 1685.261116
SAR 4.360096
SBD 9.561428
SCR 16.259909
SDG 699.338224
SEK 10.926356
SGD 1.510403
SHP 0.872289
SLE 26.927404
SLL 24380.187775
SOS 664.45871
SRD 46.195615
STD 24064.506778
STN 24.822577
SVC 10.172943
SYP 12855.611086
SZL 20.044514
THB 38.024511
TJS 10.841775
TMT 4.080901
TND 3.408313
TOP 2.723047
TRY 48.759848
TTD 7.8923
TWD 35.865779
TZS 2893.539317
UAH 48.895614
UGX 4045.767158
USD 1.16265
UYU 46.374644
UZS 14102.944395
VES 246.694981
VND 30583.507181
VUV 141.916058
WST 3.256743
XAF 656.730831
XAG 0.023914
XAU 0.000283
XCD 3.14212
XCG 2.095369
XDR 0.81639
XOF 655.15743
XPF 119.331742
YER 277.761248
ZAR 20.070598
ZMK 10465.248981
ZMW 25.665242
ZWL 374.372813
  • CMSD

    -0.0500

    24.65

    -0.2%

  • JRI

    0.1200

    14.07

    +0.85%

  • BCC

    1.1200

    73.09

    +1.53%

  • BCE

    -0.0500

    23.81

    -0.21%

  • SCS

    0.0400

    16.78

    +0.24%

  • NGG

    0.2500

    76.95

    +0.32%

  • CMSC

    0.0900

    24.28

    +0.37%

  • RIO

    -0.0800

    70.54

    -0.11%

  • GSK

    -2.3000

    43.24

    -5.32%

  • RYCEF

    0.1300

    14.88

    +0.87%

  • BTI

    0.2200

    52.07

    +0.42%

  • RBGPF

    0.0000

    79.09

    0%

  • RELX

    0.6200

    46.57

    +1.33%

  • VOD

    0.0700

    11.73

    +0.6%

  • BP

    -0.4600

    34.54

    -1.33%

  • AZN

    -0.1100

    83.29

    -0.13%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

C.Zeman--TPP