~/devreads

#ai infrastructure

66 posts

9 Jun

Avinash Ahuja 1 min read

NVIDIA GPUs with Confidential Computing are now used for confidential inference in Apple’s Private Cloud Compute (PCC), as it expands beyond Apple’s data centers to Google Cloud. Unveiled during Apple’s annual WWDC gathering for developers from around the globe, NVIDIA GPUs will support server-side inference for Apple Foundation Models, custom-built by Apple and Google, leveraging […]

ai infrastructureartificial intelligencecybersecurityhardwareinference

8 Jun

Madison Huang 4 min read

NVIDIA and LG Group are building an AI factory to accelerate LG Group’s next wave of AI-driven businesses, spanning robotics, autonomous driving, data center technologies and GPU cloud services. The AI factory will provide LG Group with accelerated computing infrastructure to train, simulate, validate and deploy AI-based applications across its key businesses. The collaboration brings […]

ai infrastructureroboticsagentic aiai factoryartificial intelligence

7 Jun

Madison Huang 3 min read

NVIDIA and Doosan Group are expanding their collaboration to advance new opportunities across physical AI, robotics and AI factory infrastructure, spanning Doosan Robotics, Doosan Bobcat, Doosan Enerbility and Doosan Corporation Electro-Materials BG. The collaboration will bring together NVIDIA’s full-stack accelerated computing platforms with Doosan Group’s capabilities in industrial automation, power generation and advanced electronics materials […]

ai infrastructureroboticsagentic aiai factoryartificial intelligence

5 Jun

2 Jun

Dave Salvator 5 min read

The agentic AI moment has arrived, but delivering on its promise requires more than good models. It also takes fast hardware, secure runtimes, a responsive data layer and models tuned for long-running reasoning. NVIDIA and Microsoft are bringing that full stack to developers across Windows devices, Azure cloud and local deployments. At Microsoft Build, NVIDIA […]

aiai infrastructurehardwarenetworkingsoftware

1 Jun

Dion Harris 6 min read

The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling agentic AI applications. NVIDIA AI Clouds are a growing ecosystem of purpose-built clouds serving the exploding token demand behind today’s most popular AI applications. […]

ai infrastructurecloud

Esther Lee 4 min read

As factories move from isolated automation to plant-wide intelligence, manufacturers need AI systems that can connect live machine signals, quality systems, work instructions and operational alerts into a unified decision layer. Today at GTC Taipei at COMPUTEX, NVIDIA announced the NVIDIA Factory Operations Blueprint (FOX) — a reference design for building an autonomous factory manager […]

ai infrastructureroboticsagentic aicomputex 2026industrial and manufacturing

Timothy Costa 3 min read

Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million NVIDIA MGX rack components for NVIDIA Vera Rubin infrastructure come together in Taiwan, from across 25 factory sites. As Vera Rubin ramps into full production to power agentic AI factories worldwide, that ecosystem spans the full supply chain — from key […]

ai infrastructureagentic aiai factorycomputex 2026industrial and manufacturing

27 May

26 May

Diana Aung 3 min read

The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth and the ability to sustain high performance when all cores are active. Initial benchmark results published by Phoronix today show that the NVIDIA Vera CPU meets this need. For this first public look, the benchmark scope […]

ai infrastructureagentic aiai factorynvidia vera

18 May

NVIDIA Writers 7 min read

Agentic AI inference at one-tenth the cost per token with NVIDIA Vera Rubin NVL72. Agent sandboxes run 50% faster on NVIDIA Vera than traditional CPUs — while enterprise data queries are up to 3x faster with the Vera CPU. And 5,000 enterprises like Lilly, Samsung and Honeywell are running AI workloads on Dell AI Factories […]

ai infrastructureagentic aicuda-xnemotronnvidia blueprints

13 May

NVIDIA Writers 2 min read

Reinforcement-learning agents — AI systems that learn by trial and error — can convert computation into new knowledge. That’s the focus of a new engineering-level collaboration between NVIDIA and Ineffable Intelligence, the London-based AI lab founded by AlphaGo architect David Silver in the wake of Ineffable’s emergence from stealth last week. “The next frontier of […]

ai infrastructure

7 May

Brian Caulfield 4 min read

AI will help build the energy it needs. That’s the case U.S. Energy Secretary Chris Wright and NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck made Thursday morning at the SCSP AI+ Expo. The 30-minute fireside chat, moderated by SCSP president Ylli Bajraktari, was called “Powering the Next American Century.” Their argument: American […]

ai infrastructurecorporatehardwareresearchsupercomputing

23 Apr

Justin Boitano 3 min read

AI agents have revolutionized developer workflows, and their next frontier is knowledge work: processing information, solving complex problems, coming up with new ideas and driving innovation. Codex, OpenAI’s agentic coding application, is enabling this new frontier. It’s now powered by GPT-5.5, OpenAI’s latest frontier model, which runs on NVIDIA GB200 NVL72 rack-scale systems. Over 10,000 […]

aiai infrastructureagentic ainvidia blackwell

22 Apr

Ian Buck 6 min read

NVIDIA and Google Cloud have collaborated for more than a decade, co‑engineering a full‑stack AI platform that spans every technology layer — from performance‑optimized libraries and frameworks to enterprise‑grade cloud services. This foundation enables developers, startups and enterprises to push agentic and physical AI out of the lab and into production — from agents that […]

ai infrastructurecloudagentic aiartificial intelligencecloud services

15 Apr

Shruti Koparkar 5 min read

Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure, […]

ai infrastructureinferencenvidia blackwellthink smart

31 Mar

Vladimir Troy 4 min read

CERAWeek — dubbed the Davos of energy — is where policymakers, producers, technologists and financiers gather to discuss how the world powers itself next. NVIDIA and Emerald AI unveiled at the conference last week a new way forward — treating AI factories not as static power loads but as flexible, intelligent grid assets. This collaboration […]

ai infrastructurecorporateai for goodomniverse enterprise

25 Mar

Josh Parker 4 min read

At the half-time whistle of the UEFA EURO 2020 round of 16 football match between England and Germany, millions of viewers stepped away from their screens in the U.K. to do the same thing at the same time — turn on their kettles. National Grid, which provides electricity for England and Wales, saw a demand […]

ai infrastructureai factoryai for goodartificial intelligenceenergy

24 Mar

Justin Boitano 4 min read

Artificial intelligence has rapidly emerged as one of the most critical workloads in modern computing. For the vast majority of enterprises, this workload runs on Kubernetes, an open source platform that automates the deployment, scaling and management of containerized applications. To help the global developer community manage high-performance AI infrastructure with greater transparency and efficiency, […]

ai infrastructurecloudsoftwareartificial intelligenceevents

20 Mar

17 Mar

Kanika Atri 5 min read

As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI. At NVIDIA GTC 2026, leading operators in the U.S. and Asia showed that this shift is underway, announcing AI grids — geographically distributed and interconnected AI infrastructure — using their network footprint to power […]

ai infrastructureartificial intelligencegtc 2026inferencenvidia rtx

16 Mar

Scott Martin 5 min read

Setting up AI factories in simulation — decreasing deployment time from months to days — is accelerating the next industrial revolution. Nowhere was that more apparent than at GTC 2026, in San Jose, where NVIDIA founder and CEO Jensen Huang introduced NVIDIA DSX Air. Part of NVIDIA DSX Sim in the DSX platform, NVIDIA’s blueprint […]

ai infrastructureai factorygtc 2026nvidia dgx

10 Mar

NVIDIA Newsroom 1 min read

NVIDIA and Thinking Machines Lab announced today a multiyear strategic partnership to deploy at least one gigawatt of next-generation NVIDIA Vera Rubin systems to support Thinking Machines’ frontier model training and platforms delivering customizable AI at scale. Deployment on the NVIDIA Vera Rubin platform is targeted for early next year. The partnership also includes an […]

ai infrastructurecorporateai trainingopen source

26 Feb

23 Feb

Itay Ozery 5 min read

As technologies and systems become more digitalized and connected across the world, operational technology (OT) environments and industrial control systems (ICS) — from energy and manufacturing to transportation and utilities — are increasingly depending on enterprise networks and the cloud. This expands OT and ICS capabilities — but also their exposure to cyber threats. Unlike […]

ai infrastructurecybersecuritynvidia bluefieldtrustworthy ai

18 Feb

16 Feb

Ashraf Eassa 4 min read

The NVIDIA Blackwell platform has been widely adopted by leading inference providers such as Baseten, DeepInfra, Fireworks AI and Together AI to reduce cost per token by up to 10x. Now, the NVIDIA Blackwell Ultra platform is taking this momentum further for agentic AI. AI agents and coding assistants are driving explosive growth in software-programming-related […]

ai infrastructurecloudhardwarenetworkingsoftware

12 Feb

Shruti Koparkar 6 min read

A diagnostic insight in healthcare. A character’s dialogue in an interactive game. An autonomous resolution from a customer service agent. Each of these AI-powered interactions is built on the same unit of intelligence: a token. Scaling these AI interactions requires businesses to consider whether they can afford more tokens. The answer lies in better tokenomics […]

aiai infrastructureagentic aidynamoinference

Max Starubinskiy 5 min read

At leading institutions across the globe, the NVIDIA DGX Spark desktop supercomputer is bringing data‑center‑class AI to lab benches, faculty offices and students’ systems. There’s even a DGX Spark hard at work in the South Pole, at the IceCube Neutrino Observatory run by the University of Wisconsin-Madison. The compact supercomputer’s petaflop‑class performance enables local deployment […]

aiai infrastructurehardwareartificial intelligenceeducation

5 Jan

Itay Ozery 3 min read

AI is powering breakthroughs across industries, helping enterprises operate with greater intelligence and speed. As AI factories scale, the next generation of enterprise AI depends on infrastructure that can efficiently manage data, secure every stage of the pipeline and accelerate the core services that move, protect and process information alongside AI workloads. NVIDIA has expanded […]

ai infrastructurenetworkingartificial intelligenceces 2026cybersecurity

Charlie Boyle 5 min read

NVIDIA DGX SuperPOD is paving the way for large-scale system deployments built on the NVIDIA Rubin platform — the next leap forward in AI computing. At the CES trade show in Las Vegas, NVIDIA today introduced the Rubin platform, comprising six new chips designed to deliver one incredible AI supercomputer, and engineered to accelerate agentic […]

ai infrastructureces 2026nvidia bluefieldnvidia dgxnvidia rubin

Chris Marriott 6 min read

Open-source AI is accelerating innovation across industries, and NVIDIA DGX Spark and DGX Station are built to help developers turn innovation into impact. NVIDIA today unveiled at the CES trade show how the DGX Spark and DGX Station deskside AI supercomputers let developers harness the latest open and frontier AI models on a local deskside […]

ai infrastructureai trainingartificial intelligenceces 2026cuda-x

18 Dec 2025

17 Dec 2025

Zoe Kessler 4 min read

The Hao AI Lab research team at the University of California San Diego — at the forefront of pioneering AI model innovation — recently received an NVIDIA DGX B200 system to elevate their critical work in large language model inference. Many LLM inference platforms in production today, such as NVIDIA Dynamo, use research concepts that […]

aiai infrastructureresearchsupercomputingartificial intelligence

15 Dec 2025

NVIDIA Newsroom 2 min read

NVIDIA today announced it has acquired SchedMD — the leading developer of Slurm, an open-source workload management system for high-performance computing (HPC) and AI — to help strengthen the open-source software ecosystem and drive AI innovation for researchers, developers and enterprises. NVIDIA will continue to develop and distribute Slurm as open-source, vendor-neutral software, making it […]

ai infrastructurecorporatesoftwareartificial intelligenceopen source

10 Dec 2025

Prachi Goel 4 min read

The world’s top-performing system for graph processing at scale was built on a commercially available cluster. NVIDIA last month announced a record-breaking benchmark result of 410 trillion traversed edges per second (TEPS), ranking No. 1 on the 31st Graph500 breadth-first search (BFS) list. Performed on an accelerated computing cluster hosted in a CoreWeave data center […]

ai infrastructurenetworkingcloud servicescudahardware

3 Dec 2025

Shruti Koparkar 8 min read

The top 10 most intelligent open-source models all use a mixture-of-experts architecture. Kimi K2 Thinking, DeepSeek-R1, Mistral Large 3 and others run 10x faster to enable one-tenth the cost per token on NVIDIA GB200 NVL72. A look under the hood of virtually any frontier model today will reveal a mixture-of-experts (MoE) model architecture that mimics […]

ai infrastructureartificial intelligencedynamoinferencenvidia blackwell

2 Dec 2025

Kari Briski 2 min read

Today, Mistral AI announced the Mistral 3 family of open-source multilingual, multimodal models, optimized across NVIDIA supercomputing and edge platforms. Mistral Large 3 is a mixture-of-experts (MoE) model — instead of firing up every neuron for every token, it only activates the parts of the model with the most impact. The result is efficiency […]

ai infrastructureinferencenvidia blackwellnvlinktensorrt

Ian Buck 5 min read

At AWS re:Invent, NVIDIA and Amazon Web Services expanded their strategic collaboration with new technology integrations across interconnect technology, cloud infrastructure, open models and physical AI. As part of this expansion, AWS will support NVIDIA NVLink Fusion — a platform for custom AI infrastructure — for deploying its custom-designed silicon, including next-generation Trainium4 chips for […]

aiai infrastructurecloudhardwarenetworking

24 Nov 2025

18 Nov 2025

Ian Buck 5 min read

Timed with the Microsoft Ignite conference running this week, NVIDIA is expanding its collaboration with Microsoft, including through the adoption of next-generation NVIDIA Spectrum-X Ethernet switches for the new Microsoft Fairwater AI superfactory, powered by the NVIDIA Blackwell platform. The collaboration brings new integrations across Microsoft 365 Copilot, as well as the public preview of […]

aiai infrastructurecloudhardwarenetworking

Jacob Liberman 4 min read

AI agents have the potential to become indispensable tools for automating complex tasks. But bringing agents to production remains challenging. According to Gartner, “about 40% of AI prototypes make it into production, and participants reported data availability and quality as a top barrier to AI adoption.1” Just like human workers, AI agents need secure, relevant, […]

ai infrastructurehardwarenetworkingagentic ai

NVIDIA Newsroom 1 min read

Today, Microsoft, NVIDIA and Anthropic announced new strategic partnerships. Anthropic is scaling its rapidly growing Claude AI model on Microsoft Azure, powered by NVIDIA, which will broaden access to Claude and provide Azure enterprise customers with expanded model choice and new capabilities. Anthropic has committed to purchase $30 billion of Azure compute capacity and to […]

ai infrastructurecorporateartificial intelligence

17 Nov 2025

Scott Martin 12 min read

At SC25, NVIDIA unveiled advances across NVIDIA BlueField DPUs, next-generation networking, quantum computing, national research, AI physics and more — as accelerated systems drive the next chapter in AI supercomputing. NVIDIA also highlighted storage innovations powered by the NVIDIA BlueField-4 data processing unit, part of the full-stack BlueField platform that accelerates gigascale AI infrastructure. More […]

ai infrastructurecorporatehardwarenetworkingsupercomputing

Chris Porter 6 min read

Across quantum physics, digital biology and climate research, the world’s researchers are harnessing a universal scientific instrument to chart new frontiers of discovery: accelerated computing. At this week’s SC25 conference in St. Louis, Missouri, NVIDIA announced that over 80 new scientific systems powered by the NVIDIA accelerated computing platform have been unveiled around the globe […]

ai infrastructuresupercomputingai factorycuda-xhigh-performance computing

Kibibi Moseley 5 min read

To power future technologies including liquid-cooled data centers, high-resolution digital displays and long-lasting batteries, scientists are searching for novel chemicals and materials optimized for factors like energy use, durability and efficacy. New NVIDIA-accelerated data processing pipelines and AI microservices unveiled at the SC25 conference in St. Louis are advancing chemistry and material science to support […]

ai infrastructureresearchsupercomputingartificial intelligencecuda-x

Timothy Costa 4 min read

NVIDIA Apollo — a family of open models for accelerating industrial and computational engineering — was introduced today at the SC25 conference in St. Louis. Accelerated by NVIDIA AI infrastructure, the new AI physics models will enable developers to integrate real-time capabilities into their simulation software across a broad range of industries. The NVIDIA Apollo […]

ai infrastructuresupercomputingaerospaceartificial intelligencecuda-x

14 Nov 2025

John Kim 4 min read

Today’s AI workloads are data-intensive, requiring more scalable and affordable storage than ever. By 2028, enterprises are projected to generate nearly 400 zettabytes of data annually, with 90% of new data being unstructured, comprising audio, video, PDFs, images and more. This massive scale, combined with the need for data portability between on-premises infrastructure and the […]

ai infrastructurecloudnetworkingcuda

13 Nov 2025

Shruti Koparkar 4 min read

Editor’s note: This post is part of Think SMART, a series focused on how leading AI service providers, developers and enterprises can boost their inference performance and return on investment with the latest advancements from NVIDIA’s full-stack inference platform. NVIDIA Blackwell delivers the highest performance and efficiency, and lowest total cost of ownership across every […]

ai infrastructuredynamoinferencethink smart

12 Nov 2025

Dave Salvator 3 min read

In the age of AI reasoning, training smarter, more capable models is critical to scaling intelligence. Delivering the massive performance to meet this new age requires breakthroughs across GPUs, CPUs, NICs, scale-up and scale-out networking, system architectures, and mountains of software and algorithms. In MLPerf Training v5.1 — the latest round in a long-running series […]

ai infrastructurehardwarenetworkingsoftwareai training

4 Nov 2025

Markus Hacker 4 min read

In Berlin on Tuesday, Deutsche Telekom and NVIDIA unveiled the world’s first Industrial AI Cloud, a sovereign, enterprise-grade platform set to go live in early 2026. The partnership brings together Deutsche Telekom’s trusted infrastructure and operations and NVIDIA AI and Omniverse digital twin platforms to power the AI era of Germany’s industrial transformation. “We have […]

ai infrastructurecloudcorporateindustrial and manufacturingnvidia dgx

31 Oct 2025

Scott Martin 4 min read

Amidst Gyeongju, South Korea’s ancient temples and modern skylines, Jensen Huang hit the stage at the APEC Summit with historic news: South Korea is leaping into the future with sovereign AI supported by more than a quarter-million NVIDIA GPUs. “It’s vital that we build the ecosystem, not just the AI infrastructure, of Korea,” he said. […]

ai infrastructurecorporategamingcudainception

28 Oct 2025

NVIDIA Writers 2 min read

Monday, Oct. 27, 12:30 p.m. ET How Medium-Sized Cities Are Tackling AI Readiness 🔗 A panel discussion today at GTC Washington, D.C., highlighted a public-private initiative to invigorate the economy of Rancho Cordova, California, with a focus on AI. To bolster innovation, the city is working with the Human Machine Collaboration Institute and NVIDIA on […]

aiai infrastructurecorporatedrivingrobotics

Timothy Costa 2 min read

Leading technology companies in aerospace and automotive are accelerating their engineering design processes with the NVIDIA DoMINO NIM microservice, part of the NVIDIA PhysicsNeMo AI physics framework. By integrating GPU-accelerated computing, NVIDIA PhysicsNeMo and interactive digital twin technologies, enterprises are accelerating their modeling and simulation workflows by up to 500x over traditional methods, speeding innovation […]

ai infrastructuresoftwarecuda-xgtc 2025industrial and manufacturing

Louis Stewart 5 min read

To democratize access to AI technology nationwide, AI education and deployment can’t be limited to a few urban tech hubs — it must reach every community, university and state. That’s why NVIDIA is working with cities, states and educational institutions to embed AI education and innovation across the U.S., with the goal of helping the […]

aiai infrastructurecorporateroboticsai factory

Scott Martin 4 min read

Along the Pacific Ocean in Monterey, California, the Naval Postgraduate School (NPS) is making a splash all the way to Washington, D.C.: It’s using artificial intelligence to solve operational challenges while educating tomorrow’s leaders in AI skills. Like Silicon Valley, it’s not uncommon for NPS, the U.S. Navy’s flagship academic graduate university, to hold hackathons, […]

ai infrastructurehardwareartificial intelligencedeep learning institutedigital twin

Chris Porter 4 min read

The race to bottle a star now runs on AI. NVIDIA, General Atomics and a team of international partners have built a three dimensional, interactive AI-enabled digital twin for a fusion reactor, with technical support from San Diego Supercomputer Center at UC San Diego School of Computing, Information and Data Sciences, the Argonne Leadership Computing […]

ai infrastructuresoftwaresupercomputingcuda-xdigital twin

Kanika Atri 4 min read

NVIDIA is delivering the telecom industry a major boost in open-source software for building AI-native 5G and 6G networks. NVIDIA Aerial software will soon be released as open source, making it available on a variety of NVIDIA platforms, including on NVIDIA DGX Spark. With open-source software and a powerful and accessible supercomputer to run it […]

ai infrastructuremobile5g6gai-ran

Justin Boitano 4 min read

Governments everywhere are racing to harness the power of AI — but legacy infrastructure isn’t built for the velocity, complexity or trust that mission-critical action now demands. Massive data streams, cyber threats and urgent operations require a new blueprint for creating AI factories purpose-built for the public sector’s unique standards and scale. At NVIDIA GTC […]

ai infrastructuresoftwaregtc 2025nvidia ai enterprisenvidia spectrum-x ethernet

James Mills 4 min read

During the GTC Washington, D.C., keynote today, NVIDIA founder and CEO Jensen Huang introduced NVIDIA Omniverse DSX, a comprehensive, open blueprint for designing and operating gigawatt-scale AI factories — validated at the new AI Factory Research Center at Digital Realty’s site in Manassas, Virginia. The blueprint brings together ecosystem partners across the industry that are […]

aiai infrastructuresoftwareai factorydigital twin

Itay Ozery 3 min read

AI factories continue to grow at unprecedented scale, processing structured, unstructured and emerging AI-native data. With demand for trillion-token workloads exploding, a new class of infrastructure is required to keep pace. At NVIDIA GTC Washington, D.C, NVIDIA revealed the NVIDIA BlueField-4 data processing unit, part of the full-stack BlueField platform that accelerates gigascale AI infrastructure, […]

ai infrastructurenetworkingcybersecuritygtc 2025nvidia bluefield