Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.
Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x token throughput and 4x energy efficiency for agentic AI ...
Nvidia shows off its next-generation Kyber rack-scale solution to be powered by Rubin Ultra GPUs with four compute chiplets and 1 TB of HBM4E memory per package.
NVIDIA has launched the new compact single-slot RTX PRO 4500 Blackwell Server Edition with 32GB of GDDR7 memory for servers ...
VDURA today announced three major advances at NVIDIA GTC 2026: availability of Remote Direct Memory Access (RDMA) capability, the upcoming first phase of its Context-Aware Tiering ...
XDA Developers on MSN

You could be buying the wrong GPU

Don't waste your money (and power).
"Kioxia fully supports the NVIDIA Storage-Next initiative and will deliver purpose-built SSDs to effectively address the need for GPU-accessible memory," said Makoto Hamada, Senior Director of the SSD ...
Phison Electronics (8299TT), a global leader in NAND flash controllers and storage solutions, today announced its GTC ...
Lightbits Labs Ltd. today is introducing a new architecture aimed at addressing one of the most stubborn bottlenecks in large-scale artificial intelligence inference: the growing mismatch between the ...