The Prague Post - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.272483
AFN 80.853073
ALL 97.665328
AMD 445.199497
ANG 2.082185
AOA 1066.812425
ARS 1581.61212
AUD 1.78504
AWG 2.094071
AZN 1.975687
BAM 1.955813
BBD 2.342866
BDT 141.500942
BGN 1.954687
BHD 0.438495
BIF 3429.041155
BMD 1.163373
BND 1.498842
BOB 8.038157
BRL 6.362025
BSD 1.163233
BTN 102.621683
BWP 15.681172
BYN 3.92421
BYR 22802.105732
BZD 2.339466
CAD 1.603878
CDF 3334.804697
CHF 0.936829
CLF 0.028874
CLP 1132.706449
CNY 8.305902
CNH 8.307087
COP 4659.482335
CRC 588.116552
CUC 1.163373
CUP 30.829378
CVE 110.461781
CZK 24.490109
DJF 206.754844
DKK 7.463182
DOP 73.525797
DZD 151.126655
EGP 56.445917
ERN 17.450591
ETB 165.95518
FJD 2.630334
FKP 0.859049
GBP 0.86958
GEL 3.135235
GGP 0.859049
GHS 13.670006
GIP 0.859049
GMD 83.762384
GNF 10074.808025
GTQ 8.916059
GYD 243.262664
HKD 9.082917
HNL 30.709629
HRK 7.534697
HTG 152.184283
HUF 395.651515
IDR 19139.633836
ILS 3.942729
IMP 0.859049
INR 102.375464
IQD 1524.018291
IRR 48919.823774
ISK 143.607129
JEP 0.859049
JMD 186.591242
JOD 0.824808
JPY 172.840543
KES 150.657511
KGS 101.657948
KHR 4659.308025
KMF 492.68993
KPW 1047.007807
KRW 1624.068646
KWD 0.355946
KYD 0.969311
KZT 628.19418
LAK 25230.652726
LBP 104188.750357
LKR 351.529893
LRD 235.059908
LSL 20.603155
LTL 3.435137
LVL 0.703712
LYD 6.293838
MAD 10.52276
MDL 19.332212
MGA 5206.092501
MKD 61.52827
MMK 2442.319185
MNT 4185.24298
MOP 9.352902
MRU 46.511241
MUR 53.340489
MVR 17.919176
MWK 2020.778395
MXN 21.792996
MYR 4.92165
MZN 74.351705
NAD 20.602969
NGN 1788.929793
NIO 42.813318
NOK 11.676266
NPR 164.195093
NZD 1.985127
OMR 0.447315
PAB 1.163233
PEN 4.116593
PGK 4.923974
PHP 66.70781
PKR 327.896619
PLN 4.263865
PYG 8402.05591
QAR 4.235609
RON 5.079748
RSD 117.181944
RUB 93.708437
RWF 1682.236984
SAR 4.365348
SBD 9.575246
SCR 16.503102
SDG 698.599579
SEK 11.010503
SGD 1.499756
SHP 0.914228
SLE 27.10726
SLL 24395.34234
SOS 664.867895
SRD 44.967851
STD 24079.466908
STN 24.867092
SVC 10.178068
SYP 15125.62995
SZL 20.602897
THB 37.603739
TJS 10.94572
TMT 4.071805
TND 3.354877
TOP 2.724733
TRY 47.895822
TTD 7.896222
TWD 35.784225
TZS 2905.339585
UAH 48.107607
UGX 4119.027409
USD 1.163373
UYU 46.54051
UZS 14454.906063
VES 173.669272
VND 30649.054873
VUV 138.914638
WST 3.096669
XAF 655.969823
XAG 0.028438
XAU 0.00033
XCD 3.144073
XCG 2.096459
XDR 0.810902
XOF 650.907152
XPF 119.331742
YER 279.384123
ZAR 20.600155
ZMK 10471.775646
ZMW 27.562776
ZWL 374.605548
  • CMSC

    -0.0810

    23.659

    -0.34%

  • BCC

    -1.0000

    85.78

    -1.17%

  • SCS

    0.0300

    16.77

    +0.18%

  • BCE

    -0.5300

    24.43

    -2.17%

  • GSK

    -0.7100

    38.96

    -1.82%

  • AZN

    0.2900

    80.19

    +0.36%

  • BTI

    -1.6500

    55.24

    -2.99%

  • RIO

    -0.8300

    61.89

    -1.34%

  • RBGPF

    -1.0000

    76

    -1.32%

  • NGG

    -2.5900

    67.98

    -3.81%

  • CMSD

    0.0100

    23.63

    +0.04%

  • BP

    0.0000

    35.23

    0%

  • JRI

    -0.0900

    13.51

    -0.67%

  • RYCEF

    0.0800

    14.35

    +0.56%

  • RELX

    -1.2300

    45.44

    -2.71%

  • VOD

    -0.2400

    11.72

    -2.05%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

C.Zeman--TPP