The Prague Post - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.239213
AFN 72.708767
ALL 95.386618
AMD 425.323465
ANG 2.066384
AOA 1059.471478
ARS 1669.331744
AUD 1.636279
AWG 2.077395
AZN 1.961708
BAM 1.956925
BBD 2.322484
BDT 141.539493
BGN 1.927268
BHD 0.435243
BIF 3445.01293
BMD 1.154108
BND 1.486054
BOB 7.996595
BRL 5.997787
BSD 1.153038
BTN 110.30295
BWP 15.649925
BYN 3.235359
BYR 22620.520413
BZD 2.319082
CAD 1.609756
CDF 2654.449107
CHF 0.920167
CLF 0.026996
CLP 1062.589273
CNY 7.808292
CNH 7.828892
COP 4144.564065
CRC 532.094231
CUC 1.154108
CUP 30.583867
CVE 110.967111
CZK 24.206615
DJF 205.108345
DKK 7.47404
DOP 67.226823
DZD 154.290387
EGP 60.068907
ERN 17.311623
ETB 183.280746
FJD 2.558539
FKP 0.864742
GBP 0.864571
GEL 3.069643
GGP 0.864742
GHS 13.635853
GIP 0.864742
GMD 84.249662
GNF 10130.189534
GTQ 8.790899
GYD 241.247583
HKD 9.04515
HNL 30.780263
HRK 7.535865
HTG 150.764021
HUF 355.58823
IDR 20956.007884
ILS 3.380418
IMP 0.864742
INR 110.322759
IQD 1511.881721
IRR 1587043.016681
ISK 143.409458
JEP 0.864742
JMD 182.034602
JOD 0.818246
JPY 184.826987
KES 149.30745
KGS 100.926411
KHR 4630.861153
KMF 493.958018
KPW 1038.530307
KRW 1761.862465
KWD 0.357035
KYD 0.960948
KZT 561.58297
LAK 25390.379769
LBP 103350.388122
LKR 388.736643
LRD 210.653608
LSL 19.100661
LTL 3.407781
LVL 0.698109
LYD 7.334352
MAD 10.688146
MDL 20.087456
MGA 4847.25442
MKD 61.630898
MMK 2422.8188
MNT 4130.308878
MOP 9.307027
MRU 46.204688
MUR 55.085534
MVR 17.830794
MWK 2004.686122
MXN 20.139876
MYR 4.70091
MZN 73.759008
NAD 19.100245
NGN 1570.360016
NIO 42.251951
NOK 10.924107
NPR 176.48665
NZD 1.984073
OMR 0.44375
PAB 1.153143
PEN 4.006198
PGK 5.032064
PHP 71.052677
PKR 321.418795
PLN 4.238927
PYG 7096.077614
QAR 4.198067
RON 5.242305
RSD 117.369371
RUB 84.221435
RWF 1688.460274
SAR 4.332289
SBD 9.288936
SCR 15.584271
SDG 693.040598
SEK 10.879172
SGD 1.486399
SHP 0.861658
SLE 28.393591
SLL 24201.074001
SOS 658.996031
SRD 43.105364
STD 23887.709281
STN 24.813326
SVC 10.089579
SYP 127.565999
SZL 19.10021
THB 37.859387
TJS 10.787352
TMT 4.039379
TND 3.949647
TOP 2.778815
TRY 53.2161
TTD 7.810319
TWD 36.416034
TZS 3029.531661
UAH 51.474279
UGX 4347.479354
USD 1.154108
UYU 46.446891
UZS 13811.787496
VES 649.284051
VND 30404.980117
VUV 136.507437
WST 3.147269
XAF 656.331304
XAG 0.016844
XAU 0.000266
XCD 3.119035
XCG 2.078149
XDR 0.817577
XOF 651.497317
XPF 119.331742
YER 275.39902
ZAR 19.05323
ZMK 10388.356246
ZMW 20.265645
ZWL 371.622364
  • RYCEF

    -0.3300

    16.52

    -2%

  • RBGPF

    1.4900

    61.5

    +2.42%

  • GSK

    -0.8800

    50.64

    -1.74%

  • RELX

    -0.6300

    34.52

    -1.83%

  • BCC

    -0.1100

    67.97

    -0.16%

  • BCE

    -0.2300

    24.18

    -0.95%

  • BTI

    -0.0300

    59.69

    -0.05%

  • CMSC

    -0.1100

    22.36

    -0.49%

  • RIO

    0.2400

    100.93

    +0.24%

  • VOD

    0.1100

    14.81

    +0.74%

  • JRI

    -0.1400

    12.46

    -1.12%

  • CMSD

    -0.1100

    22.41

    -0.49%

  • AZN

    -4.4000

    181.55

    -2.42%

  • BP

    0.7500

    43.72

    +1.72%

  • NGG

    -1.6900

    80.17

    -2.11%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

C.Zeman--TPP