The Prague Post - Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

EUR -
AED 4.228481
AFN 71.967082
ALL 95.162493
AMD 424.333374
ANG 2.061519
AOA 1056.976279
ARS 1671.113607
AUD 1.638481
AWG 2.075382
AZN 1.963455
BAM 1.935095
BBD 2.3198
BDT 141.367426
BGN 1.92273
BHD 0.434218
BIF 3434.598136
BMD 1.151391
BND 1.477667
BOB 7.957752
BRL 5.95511
BSD 1.151737
BTN 109.295062
BWP 15.473507
BYN 3.2308
BYR 22567.255603
BZD 2.316326
CAD 1.604786
CDF 2648.198237
CHF 0.917082
CLF 0.02675
CLP 1052.819726
CNY 7.789906
CNH 7.819623
COP 4148.609984
CRC 529.759398
CUC 1.151391
CUP 30.511851
CVE 110.706288
CZK 24.188184
DJF 204.625462
DKK 7.47595
DOP 67.06878
DZD 155.102791
EGP 60.048294
ERN 17.270859
ETB 182.90938
FJD 2.55528
FKP 0.863082
GBP 0.864083
GEL 3.06293
GGP 0.863082
GHS 13.603664
GIP 0.863082
GMD 84.051694
GNF 10106.33036
GTQ 8.779068
GYD 240.883689
HKD 9.020121
HNL 30.707771
HRK 7.528828
HTG 150.596527
HUF 355.619617
IDR 20835.218735
ILS 3.373747
IMP 0.863082
INR 109.325111
IQD 1508.321676
IRR 1583305.987793
ISK 143.509664
JEP 0.863082
JMD 182.091693
JOD 0.816309
JPY 184.490782
KES 148.943996
KGS 100.689136
KHR 4619.950388
KMF 492.794533
KPW 1036.084868
KRW 1795.432879
KWD 0.356148
KYD 0.959735
KZT 560.422908
LAK 25330.59262
LBP 103983.608886
LKR 387.553338
LRD 210.157607
LSL 19.054982
LTL 3.399757
LVL 0.696464
LYD 7.317038
MAD 10.663043
MDL 19.970411
MGA 4835.840495
MKD 61.623792
MMK 2416.900549
MNT 4118.573956
MOP 9.292613
MRU 46.095853
MUR 54.771565
MVR 17.788637
MWK 1999.965691
MXN 20.162392
MYR 4.639531
MZN 73.585399
NAD 19.055559
NGN 1566.236077
NIO 42.152757
NOK 10.849646
NPR 174.879969
NZD 1.990202
OMR 0.445939
PAB 1.151682
PEN 3.996764
PGK 5.020023
PHP 71.154209
PKR 320.668178
PLN 4.24714
PYG 7039.678324
QAR 4.188182
RON 5.232723
RSD 116.487319
RUB 84.897533
RWF 1684.484436
SAR 4.327176
SBD 9.267063
SCR 15.900549
SDG 691.411546
SEK 10.924831
SGD 1.485176
SHP 0.859629
SLE 28.327065
SLL 24144.087442
SOS 657.444188
SRD 42.960114
STD 23831.460606
STN 24.754898
SVC 10.077178
SYP 127.265617
SZL 19.0557
THB 37.682679
TJS 10.745474
TMT 4.029867
TND 3.359177
TOP 2.772272
TRY 53.063566
TTD 7.802919
TWD 36.358149
TZS 3025.276505
UAH 51.08699
UGX 4339.568395
USD 1.151391
UYU 46.502443
UZS 13779.269707
VES 647.755174
VND 30316.114287
VUV 136.936689
WST 3.139855
XAF 649.004463
XAG 0.016974
XAU 0.000266
XCD 3.111691
XCG 2.075798
XDR 0.815652
XOF 649.961659
XPF 119.331742
YER 274.750621
ZAR 19.112852
ZMK 10363.890591
ZMW 20.247449
ZWL 370.747301
  • CMSC

    -0.1384

    22.47

    -0.62%

  • CMSD

    -0.1300

    22.52

    -0.58%

  • GSK

    0.2500

    51.52

    +0.49%

  • BCE

    0.3300

    24.41

    +1.35%

  • NGG

    0.4800

    81.86

    +0.59%

  • BP

    -1.0700

    42.97

    -2.49%

  • BCC

    -0.4000

    68.08

    -0.59%

  • BTI

    1.8700

    59.72

    +3.13%

  • AZN

    4.1500

    185.95

    +2.23%

  • RIO

    -4.7100

    100.69

    -4.68%

  • JRI

    -0.2100

    12.6

    -1.67%

  • RBGPF

    0.5500

    60.56

    +0.91%

  • RYCEF

    -0.4400

    16.7

    -2.63%

  • VOD

    -0.4000

    14.7

    -2.72%

  • RELX

    0.6900

    35.15

    +1.96%

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks
Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

From edge inference to NVIDIA STX, purpose-built KV cache infrastructure for consistent performance at scale.

Text size:

SUNNYVALE, CA / ACCESS Newswire / April 21, 2026 / Graid Technology, the pioneer in GPU-accelerated NVMe storage, today announced its Agentic AI Storage Portfolio: a purpose-built family of KV cache solutions designed to eliminate the storage bottleneck that stalls "always-on" production AI. The portfolio spans three deployment tiers: KV Cache Server, KV Cache Rack, and KV Cache Platform, all built on SupremeRAID technology. KV Cache Platform, the portfolio's highest tier, is purpose-aligned to NVIDIA's STX reference architecture, with native BlueField-4 DPU execution on the roadmap for H2 2026.

As agentic AI moves from experimentation to production, the infrastructure assumptions that underpinned single-shot inference have broken down. Models running continuous multi-step tasks and maintaining context across hours of operation generate KV cache demands that overwhelm GPU HBM. The result: latency spikes up to 18x, GPU utilization as low as 50%, and model-level failures, including hallucinations and reasoning degradation, that are difficult to detect and costly to recover from.

SupremeRAID addresses this directly, aggregating up to 32 NVMe drives into a single 280 GB/s virtual pool, bypassing the CPU via GPU Direct Storage, and delivering KV cache reads at 1.3ms- 77x faster than standard NVMe. The three portfolio tiers bring this capability to every deployment scale:

KV Cache Server - single-node NVMe acceleration for individual inference servers and edge AI deployments. Available now.

KV Cache Rack - rack-scale, partner-validated solutions co-engineered with leading server OEM partners for enterprise multi-GPU clusters. Available now.

KV Cache Platform - Purpose-built for NVIDIA's STX reference architecture, with native BlueField-4 DPU execution and rack-scale storage expansion on the roadmap.

"A year ago, at GTC 2025, Jensen Huang predicted that storage would become GPU-accelerated for the first time. This year, NVIDIA turned that concept into an architecture with STX and CMX," said Leander Yu, CEO of Graid Technology. "Our KV Cache Portfolio is built for precisely this moment, delivering the storage performance that agentic AI demands, at storage-tier economics."

For enterprises and infrastructure teams evaluating agentic AI deployments, the full deployment architecture, technical specifications, and NVIDIA STX compatibility details are available in the solution brief: Graid Technology Agentic AI Storage Portfolio: Purpose-built KV Cache Solutions for Inference at Scale

To learn more about Graid Technology's AI offerings, visit graidtech.com/ai

Media Inquiries:
Andrea Eaken, Sr. Director of Marketing, Americas & EMEA
[email protected]

____________________________________

About Graid Technology
Graid Technology is building the storage backbone for the future of AI, enterprise, and high-performance computing. As the creator of SupremeRAID, the world's first and only GPU-based RAID, and the global steward of Intel® Virtual RAID on CPU (Intel® VROC), Graid Technology delivers flexible RAID solutions that maximize NVMe performance while ensuring resilient, scalable data protection for modern data infrastructure. Headquartered in Silicon Valley with global operations and R&D in Taiwan, Graid Technology is advancing RAID innovation for the next generation of data-intensive workloads. To learn more, visit graidtech.com.

SOURCE: Graid Technology Inc.



View the original press release on ACCESS Newswire

C.Zeman--TPP