#ai infrastructure

96 posts

23 Jul

23 Jul 2026 8 min read

Minimize idle accelerators: Native RL job interleaving with co-operative time-slicing in llm-d

The math behind reinforcement learning (RL) post-training for large language models (LLMs) is notoriously unforgiving. As frontier AI labs push the boundaries of reasoning and coding models using RL post-training algorithms like Group Relative Policy Optimization (GRPO), they routinely hit hard architectural and infrastructure constraints. While much of the industry's focus remains on acquiring raw accelerator capacity, infrastructure efficiency is…

ai infrastructurellm-dcontainers kubernetes

23 Jul 2026 4 min read

Your AI agents are ready. Is your data?

What’s one of the biggest bottlenecks stopping organizations from scaling their AI initiatives? It isn’t the capabilities of today’s models — it’s their access to business context and semantic meaning. In the agentic era, enterprises need to go beyond simply storing data to activating it with trusted context, moving from passive systems of record to proactive systems of action. But…

data analytics databases ai infrastructure

NVIDIA Writers 23 Jul 2026 3 min read

NVIDIA AI Supercomputer Comes Online at Naval Postgraduate School

NVIDIA founder and CEO Jensen Huang today visited the Naval Postgraduate School in Monterey, California, to commission an NVIDIA DGX GB300 system — bringing one of the world’s most powerful AI platforms fully online for the students, researchers and faculty at the U.S. military’s flagship graduate university. “Our nation depends on our men and women […]

ai infrastructure education nvidia blackwell nvidia dgx omniverse

21 Jul

NVIDIA Writers 21 Jul 2026 8 min read

NVIDIA Vera Rubin Driving Performance Per Watt, Lowest Token Cost for Partners Worldwide

NVIDIA Vera Rubin is here, and it’s going gigascale. Vera Rubin NVL72 production is ramping up with racks running at partners CoreWeave, Google Cloud, Microsoft Azure and Oracle Cloud Infrastructure. Spanning 350-plus factory sites in 30 countries, Vera Rubin has the largest, most mature rack-scale supply chain ever assembled to meet customer compute demand. The […]

ai infrastructurenvidia vera rubinnvlink

Scot Schultz 21 Jul 2026 4 min read

Built for Vera Rubin, NVIDIA Spectrum-6 Arrives in Gigascale AI Factories

AI has entered the gigascale era. The world’s most advanced AI factories are bringing together hundreds of thousands of GPUs and CPUs to train frontier models, power agentic AI and generate intelligence at unprecedented scale. At this level, networking becomes a critical computing power multiplier in driving token generation. Marking a networking milestone, NVIDIA Spectrum-6 […]

ai infrastructure networking ai factory artificial intelligence cloud services

20 Jul

Brian Caulfield 20 Jul 2026 5 min read

Bristol Myers Squibb Building Life Science Industry’s Most Advanced AI Factory on NVIDIA Vera Rubin

Erin Davis calls it the “SuperDuperPOD.” That’s two things in one name: pharmaceutical giant Bristol Myers Squibb (BMS) already runs one of the largest AI clusters in life sciences, with serious results to show for it. And they’re doubling down. BMS announced today it is deploying its second NVIDIA DGX SuperPOD, this one built on […]

ai infrastructure hardware software supercomputing agentic ai

15 Jul

15 Jul 2026 4 min read

IDC: Why the right networking approach is foundational to agentic AI

Editor’s note: Today we hear from IDC on the results of its 2026 AI in Networking Special Report Survey exploring the enterprises' concerns about networking infrastructure to support the rise of agentic AI in their organizations. The survey was sponsored by Google Cloud. Enterprises are moving quickly on AI pilots, but the move from pilot to production remains uneven. While…

ai machine learning ai infrastructure networking

NVIDIA Writers 15 Jul 2026 1 min read

NVIDIA and Japan Bring Full-Stack AI and Robotics to Every Industry

Home to leading manufacturers, robotics pioneers and infrastructure builders, Japan is one of the world’s centers of AI — building across the full stack with NVIDIA technologies. NVIDIA and its partners in Japan are this week showcasing the AI ecosystem’s latest advancements. Check back here for updates. NVIDIA and SEGA Celebrate 30 Years of […]

ai infrastructure corporate

14 Jul

14 Jul 2026 5 min read

Claude at scale on Google Cloud: Frontier AI, built for enterprise production

Running frontier AI in production is demanding — accelerators to manage, latency to hold steady across continents, regulated data to keep in-region, and long-context requests to serve reliably. Claude on Google Cloud is built for exactly this. Like Monet and water lilies, frontier models and the enterprise platforms are often better together. In our case, Claude brings the reasoning, and…

partners ai infrastructure ai machine learning

Shruti Koparkar 14 Jul 2026 4 min read

Why Performance per Watt Is the Ultimate Metric for AI Infrastructure Efficiency

Power is AI infrastructure’s inescapable constraint. How many tokens an AI factory can generate within a fixed power budget determines its revenue and profitability. Because of this, performance per watt — a metric that can’t be gamed, only earned through real-world results — is the foundation for AI factories. As agentic AI drives token demand […]

ai infrastructure hardware networking software inference

8 Jul

8 Jul 2026 5 min read

Google Cloud named Leader in the 2026 Gartner® Magic Quadrant™ for AI Infrastructure

In the agentic era, AI is evolving from answering questions to reasoning and taking action. Companies who want to lead in this next phase of AI need computing infrastructure that’s designed and optimized for these new requirements, helping them innovate faster, deliver compelling user and customer experiences, and optimize for cost and energy efficiency — all at massive scale. Today,…

compute storage data transfertpusai infrastructure

7 Jul

Ian Buck 7 Jul 2026 5 min read

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

Max single-threaded CPUs at scale are a new category of CPUs built for the agentic AI era. Across the creation and deployment of an agentic system, the CPU is on the critical path for reasoning, response time and learning. CPUs are the processor which executes the work the AI model commands: the tool calling, code […]

ai infrastructure agentic ai ai factory artificial intelligence hardware

2 Jul

Colette Kress 2 Jul 2026 2 min read

NVIDIA Unlocks AI Compute at Scale, Inviting Partners to Power the AI Infrastructure Buildout

As AI moves from model development to production inference, compute demand is accelerating and shifting toward continuously operating AI factories that generate tokens at scale. This shift requires access to large‑scale, multi‑tenant accelerated computing that can come online quickly, stay highly utilized and support the economics of token‑scale AI services. Emerging AI companies historically have […]

ai infrastructure cloud ai factory nvidia blackwell

30 Jun

Amr Elmeleegy 30 Jun 2026 4 min read

How NVIDIA’s Inference Software Stack Powers the Lowest Token Cost

As organizations move from AI pilots to production AI factories, infrastructure decisions have shifted from peak chip specifications to cost per token: how many useful tokens they can deliver per dollar, per watt and within required latency targets. Codesigned with NVIDIA GPUs, CPUs, networking and systems, and strengthened by a broad open source ecosystem, NVIDIA’s […]

ai infrastructure hardware networking software cuda

29 Jun

Dave Salvator 29 Jun 2026 1 min read

Claude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure

Anthropic’s Claude models in Microsoft Foundry — hosted on Microsoft Azure and running on NVIDIA GB300 Blackwell Ultra GPUs — are now generally available, giving Azure-native enterprises a powerful new way to build autonomous and domain-specific AI agents. As agentic AI continues to drive enterprise innovation and becomes more autonomous, organizations need access to computing […]

ai infrastructure hardware networking agentic ai nvidia blackwell

24 Jun

Josiah Byers 24 Jun 2026 3 min read

NVIDIA and AWS Collaborate to Bring AI to Production at Scale

Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow without multiplying operational complexity. NVIDIA’s latest work with Amazon Web Services (AWS) addresses each of those constraints. Across Amazon OpenSearch and Amazon EC2, NVIDIA AI infrastructure is giving enterprises more practical paths to deploy […]

ai infrastructure cloud agentic ai nvidia blackwell

23 Jun

Justin Boitano 23 Jun 2026 2 min read

How Businesses Are Building Specialized AI They Can Trust

Companies are asking how to build specialized AI that fits with the way their workflows actually run. The first wave of enterprise AI was about access. Companies experimented with new frontier and open models, ran pilots and explored how AI can help. Now, specialized agents — systems of models that can reason, use tools and […]

ai ai infrastructure agentic ai healthcare and life sciences nemotron

Chris Porter 23 Jun 2026 3 min read

NVIDIA Powers Over 400 of the World’s 500 Fastest Supercomputers

News Highlights: NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list. 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up eight from the previous list. The top eight systems on the Green500 run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies. No. […]

ai infrastructure hardware networking supercomputing

22 Jun

Chris Porter 22 Jun 2026 4 min read

At ISC, JUPITER Shows What Exascale Science Looks Like

JUPITER, Europe’s first exascale supercomputer at Germany’s Forschungszentrum Jülich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 InfiniBand networking — and it’s had a busy year. As the international supercomputing community gathers at ISC in Hamburg this week, four projects running on JUPITER point to what exascale computing can actually do: map the human […]

ai infrastructure supercomputing 6g agentic ai artificial intelligence

Zoe Kessler 22 Jun 2026 4 min read

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

For the past two years, the U.S. National Science Foundation’s National Artificial Intelligence Research Resource (NAIRR) pilot program has driven innovative research across the U.S. for over 700 projects — spanning protein prediction and infectious disease outbreak management. NVIDIA contributed to the NAIRR pilot through a cloud-based resource that gives researchers dedicated access to a […]

ai infrastructure supercomputing nvidia dgx science

Chris Porter 22 Jun 2026 2 min read

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Mission, Vision and Veritas — new Los Alamos National Laboratory (LANL) supercomputers to be built with HPE and NVIDIA — are tapping NVIDIA Vera CPUs to accelerate scientific discovery, unlocking agentic AI for science. The supercomputers will use the HPE Cray Supercomputing GX5000 architecture with the NVIDIA Vera Rubin platform, combining NVIDIA Vera CPUs, NVIDIA […]

ai infrastructure supercomputing agentic ai artificial intelligence high-performance computing

Josh Parker 22 Jun 2026 6 min read

Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines

Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA’s newest AI servers can run their cooling liquid even hotter — up to 45 degrees Celsius, or 113 degrees Fahrenheit. That higher temperature limit is precisely what makes them more energy efficient. […]

ai infrastructure ai factory energy nvidia rubin

18 Jun

Vladimir Troy 18 Jun 2026 3 min read

How FERC’s Large-Load Interconnection Actions Help Address Grid Stress, Improve Affordability

In a consequential grid infrastructure decision, the Federal Energy Regulatory Commission (FERC) today issued a major milestone on large-load interconnection impacting how those building AI factories, semiconductor fabrication support systems and advanced manufacturing facilities can connect to the grid. In the era of AI, which NVIDIA founder and CEO Jensen Huang has described as a […]

ai infrastructure corporate economic development energy public sector

18 Jun 2026 3 min read

Scaling Ray Serve LLM on GKE: Performance without losing the developer experience

Developers looking for LLM inference and model serving often turn to Ray Serve, a scalable model serving library with developer-friendly, Python-native APIs built by Anyscale. Combined with Google Kubernetes Engine (GKE), developers have a powerful, unified platform optimized for demanding LLM serving use cases, spanning from initial model development to online production serving. However, that flexibility and feature set used…

ai infrastructuregkecontainers kubernetes

Nat Ives 18 Jun 2026 5 min read

France Advances Europe’s AI Future With NVIDIA Technologies

A year ago at NVIDIA GTC Paris at VivaTech, France laid out plans to advance local AI — from new AI factories and national compute capacity to open frontier models and industrial platforms. Now, that AI infrastructure is coming online. AI agents are running in production, startups are deploying applications and the French AI ecosystem […]

ai infrastructure agentic ai nemotronopen models

16 Jun

NVIDIA Newsroom 16 Jun 2026 5 min read

Coherent Breaks Ground on Expanded Texas Facility, Scaling AI’s Optical Backbone

AI runs at the speed of light. More and more, that light is made in Texas. Coherent broke ground today on an expanded manufacturing building in Sherman, Texas. The company makes the lasers, optical components and compound semiconductors that wire AI systems together — and runs what it calls the world’s first 6-inch indium phosphide […]

ai ai infrastructure corporate networking ai factory

Chris Marriott 16 Jun 2026 4 min read

HPE AI Factory With NVIDIA Expands for the Era of Agents

Enterprises are moving agentic AI from proof of concept to production — and the next generation of AI factories are built for the era of agents. At HPE Discover Las Vegas, running through Thursday, June 18, NVIDIA and HPE are expanding the HPE AI Factory with NVIDIA, including NVIDIA Vera CPU and NVIDIA Agent Toolkit […]

ai infrastructure hardware networking software agentic ai

Shruti Koparkar 16 Jun 2026 4 min read

Fastest, Largest, Strongest: NVIDIA Blackwell Sweeps MLPerf Training 6.0

Every breakthrough AI model starts the same way: with a training run. The infrastructure running those training jobs shapes everything: how fast teams can iterate, what scale of model they can build and whether those jobs complete reliably. As models grow in size, complexity and intelligence, the demands on training infrastructure are also rising. In […]

ai infrastructure hardware networking software ai training

12 Jun

Shruti Koparkar 12 Jun 2026 3 min read

NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark

AgentPerf from Artificial Analysis, the industry’s first agentic AI benchmark, gives developers, enterprises and infrastructure providers a clear way to compare systems for agentic AI. In the first round of published results, the NVIDIA Blackwell Ultra NVL72 platform delivers leading performance across the agentic AI workloads tested, running 20x more agents per megawatt than NVIDIA […]

ai infrastructure hardware networking software agentic ai

9 Jun

Avinash Ahuja 9 Jun 2026 1 min read

NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute

NVIDIA GPUs with Confidential Computing are now used for confidential inference in Apple’s Private Cloud Compute (PCC), as it expands beyond Apple’s data centers to Google Cloud. Unveiled during Apple’s annual WWDC gathering for developers from around the globe, NVIDIA GPUs will support server-side inference for Apple Foundation Models, custom-built by Apple and Google, leveraging […]

ai infrastructure artificial intelligence cybersecurity hardware inference

9 Jun 2026 5 min read

Report: GKE Inference Gateway delivers up to 92% faster AI responses

As generative AI moves from experimental pilots to massive production environments, the efficiency of your infrastructure becomes the ultimate differentiator. One way to get the most out of it and minimize costly accelerator idle time is to leverage the Google Kubernetes Engine (GKE) Inference Gateway, which intelligently routes generative AI workloads based on real-time model server metrics. Instead of relying…

networking ai machine learning ai infrastructuregkecontainers kubernetes

8 Jun

Madison Huang 8 Jun 2026 4 min read

NVIDIA and LG Group Build an AI Factory to Advance Physical AI, Mobility and AI Infrastructure

NVIDIA and LG Group are building an AI factory to accelerate LG Group’s next wave of AI-driven businesses, spanning robotics, autonomous driving, data center technologies and GPU cloud services. The AI factory will provide LG Group with accelerated computing infrastructure to train, simulate, validate and deploy AI-based applications across its key businesses. The collaboration brings […]

ai infrastructure robotics agentic ai ai factory artificial intelligence

7 Jun

Madison Huang 7 Jun 2026 3 min read

NVIDIA and Doosan Group Collaborate to Advance Physical AI and AI Factory Infrastructure

NVIDIA and Doosan Group are expanding their collaboration to advance new opportunities across physical AI, robotics and AI factory infrastructure, spanning Doosan Robotics, Doosan Bobcat, Doosan Enerbility and Doosan Corporation Electro-Materials BG. The collaboration will bring together NVIDIA’s full-stack accelerated computing platforms with Doosan Group’s capabilities in industrial automation, power generation and advanced electronics materials […]

ai infrastructure robotics agentic ai ai factory artificial intelligence

5 Jun

NVIDIA Writers 5 Jun 2026 7 min read

Seoul Purpose: How NVIDIA and South Korea Are Building the Future of AI

Home to cutting-edge sovereign AI infrastructure and robotics innovators, as well as one of the world’s most passionate gaming communities, South Korea is one of the world’s centers of AI. NVIDIA founder and CEO Jensen Huang is in Seoul this week to meet the partners and builders behind that work. Monday, June 8, 10:00 a.m. […]

ai infrastructure corporate

2 Jun

Dave Salvator 2 Jun 2026 5 min read

NVIDIA Partners With Microsoft on Unified Stack for Agentic AI Deployment, From Windows Devices to Cloud to Local

The agentic AI moment has arrived, but delivering on its promise requires more than good models. It also takes fast hardware, secure runtimes, a responsive data layer and models tuned for long-running reasoning. NVIDIA and Microsoft are bringing that full stack to developers across Windows devices, Azure cloud and local deployments. At Microsoft Build, NVIDIA […]

ai ai infrastructure hardware networking software

1 Jun

Dion Harris 1 Jun 2026 6 min read

NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand

The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling agentic AI applications. NVIDIA AI Clouds are a growing ecosystem of purpose-built clouds serving the exploding token demand behind today’s most popular AI applications. […]

ai infrastructure cloud

Esther Lee 1 Jun 2026 4 min read

NVIDIA Factory Operations Blueprint Gives Factories a New AI Brain

As factories move from isolated automation to plant-wide intelligence, manufacturers need AI systems that can connect live machine signals, quality systems, work instructions and operational alerts into a unified decision layer. Today at GTC Taipei at COMPUTEX, NVIDIA announced the NVIDIA Factory Operations Blueprint (FOX) — a reference design for building an autonomous factory manager […]

ai infrastructure robotics agentic ai computex 2026 industrial and manufacturing

Timothy Costa 1 Jun 2026 3 min read

Taiwan’s Industry Titans Turbocharge World’s AI Infrastructure Buildout With NVIDIA

Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million NVIDIA MGX rack components for NVIDIA Vera Rubin infrastructure come together in Taiwan, from across 25 factory sites. As Vera Rubin ramps into full production to power agentic AI factories worldwide, that ecosystem spans the full supply chain — from key […]

ai infrastructure agentic ai ai factory computex 2026 industrial and manufacturing

27 May

Jeremy Graybill 27 May 2026

AI Factories: The New Infrastructure of Intelligence

AI factories are token factories, converting power into intelligence in real time. And as agentic AI scales and autonomous, always-on special agents are deployed in the enterprise, performance per watt and cost per token become the economics that matter.

ai infrastructure agentic ai ai factory nvidia blackwell nvidia rubin

26 May

Diana Aung 26 May 2026 3 min read

NVIDIA Vera CPU Is ‘Packing a Heavy-Hitting Punch’ Against Competition

The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth and the ability to sustain high performance when all cores are active. Initial benchmark results published by Phoronix today show that the NVIDIA Vera CPU meets this need. For this first public look, the benchmark scope […]

ai infrastructure agentic ai ai factory nvidia vera

18 May

NVIDIA Writers 18 May 2026 7 min read

NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’

Agentic AI inference at one-tenth the cost per token with NVIDIA Vera Rubin NVL72. Agent sandboxes run 50% faster on NVIDIA Vera than traditional CPUs — while enterprise data queries are up to 3x faster with the Vera CPU. And 5,000 enterprises like Lilly, Samsung and Honeywell are running AI workloads on Dell AI Factories […]

ai infrastructure agentic ai cuda-x nemotron nvidia blueprints

Ian Finder 18 May 2026

Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs

The first NVIDIA Vera CPUs arrived at three of the world's leading AI labs on Friday — Anthropic in San Francisco, OpenAI in Mission Bay, SpaceXAI in Palo Alto — followed by a delivery to Oracle Cloud Infrastructure in Santa Clara on Monday. NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck hand-delivered them.

ai infrastructure corporate agentic ai nvidia vera

13 May

NVIDIA Writers 13 May 2026 2 min read

NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure

Reinforcement-learning agents — AI systems that learn by trial and error — can convert computation into new knowledge. That’s the focus of a new engineering-level collaboration between NVIDIA and Ineffable Intelligence, the London-based AI lab founded by AlphaGo architect David Silver in the wake of Ineffable’s emergence from stealth last week. “The next frontier of […]

ai infrastructure

7 May

Brian Caulfield 7 May 2026 4 min read

Powering the Next American Century: US Energy Secretary Chris Wright and NVIDIA’s Ian Buck on the Genesis Mission

AI will help build the energy it needs. That’s the case U.S. Energy Secretary Chris Wright and NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck made Thursday morning at the SCSP AI+ Expo. The 30-minute fireside chat, moderated by SCSP president Ylli Bajraktari, was called “Powering the Next American Century.” Their argument: American […]

ai infrastructure corporate hardware research supercomputing

23 Apr

Justin Boitano 23 Apr 2026 3 min read

OpenAI’s New GPT-5.5 Powers Codex on NVIDIA Infrastructure — and NVIDIA Is Already Putting It to Work

AI agents have revolutionized developer workflows, and their next frontier is knowledge work: processing information, solving complex problems, coming up with new ideas and driving innovation. Codex, OpenAI’s agentic coding application, is enabling this new frontier. It’s now powered by GPT-5.5, OpenAI’s latest frontier model, which runs on NVIDIA GB200 NVL72 rack-scale systems. Over 10,000 […]

ai ai infrastructure agentic ai nvidia blackwell

22 Apr

Ian Buck 22 Apr 2026 6 min read

NVIDIA and Google Cloud Collaborate to Advance Agentic and Physical AI

NVIDIA and Google Cloud have collaborated for more than a decade, co‑engineering a full‑stack AI platform that spans every technology layer — from performance‑optimized libraries and frameworks to enterprise‑grade cloud services. This foundation enables developers, startups and enterprises to push agentic and physical AI out of the lab and into production — from agents that […]

ai infrastructure cloud agentic ai artificial intelligence cloud services

15 Apr

Shruti Koparkar 15 Apr 2026 5 min read

Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters

Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure, […]

ai infrastructure inference nvidia blackwellthink smart

31 Mar

Vladimir Troy 31 Mar 2026 4 min read

Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid

CERAWeek — dubbed the Davos of energy — is where policymakers, producers, technologists and financiers gather to discuss how the world powers itself next. NVIDIA and Emerald AI unveiled at the conference last week a new way forward — treating AI factories not as static power loads but as flexible, intelligent grid assets. This collaboration […]

ai infrastructure corporate ai for goodomniverse enterprise

25 Mar

Josh Parker 25 Mar 2026 4 min read

Blowing Off Steam: How Power-Flexible AI Factories Can Stabilize the Global Energy Grid

At the half-time whistle of the UEFA EURO 2020 round of 16 football match between England and Germany, millions of viewers stepped away from their screens in the U.K. to do the same thing at the same time — turn on their kettles. National Grid, which provides electricity for England and Wales, saw a demand […]

ai infrastructure ai factory ai for good artificial intelligence energy

24 Mar

Justin Boitano 24 Mar 2026 4 min read

Advancing Open Source AI, NVIDIA Donates Dynamic Resource Allocation Driver for GPUs to Kubernetes Community

Artificial intelligence has rapidly emerged as one of the most critical workloads in modern computing. For the vast majority of enterprises, this workload runs on Kubernetes, an open source platform that automates the deployment, scaling and management of containerized applications. To help the global developer community manage high-performance AI infrastructure with greater transparency and efficiency, […]

ai infrastructure cloud software artificial intelligence events

20 Mar

NVIDIA Writers 20 Mar 2026

NVIDIA GTC 2026: Live Updates on What’s Next in AI

Rolling coverage from San Jose, including NVIDIA CEO Jensen Huang’s keynote, news highlights, live demos and on‑the‑ground color through March 19.

ai ai infrastructure corporate robotics gtc 2026

17 Mar

Kanika Atri 17 Mar 2026 5 min read

NVIDIA, Telecom Leaders Build AI Grids to Optimize Inference on Distributed Networks

As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI. At NVIDIA GTC 2026, leading operators in the U.S. and Asia showed that this shift is underway, announcing AI grids — geographically distributed and interconnected AI infrastructure — using their network footprint to power […]

ai infrastructure artificial intelligence gtc 2026 inference nvidia rtx

16 Mar

Constantin Landers 16 Mar 2026 1 min read

Roche Scales NVIDIA AI Factories Globally to Accelerate Drug Discovery, Diagnostic Solutions and Manufacturing Breakthroughs

Roche's new deployment spans more than 3,500 NVIDIA Blackwell GPUs across its worldwide operations and embedded across the entire value chain, massively scaling R&D productivity, next-generation diagnostics and manufacturing efficiencies.

ai infrastructure ai factory ai training artificial intelligence digital twin

Scott Martin 16 Mar 2026 5 min read

NVIDIA DSX Air Boosts Time to Token With Accelerated Simulation for AI Factories

Setting up AI factories in simulation — decreasing deployment time from months to days — is accelerating the next industrial revolution. Nowhere was that more apparent than at GTC 2026, in San Jose, where NVIDIA founder and CEO Jensen Huang introduced NVIDIA DSX Air. Part of NVIDIA DSX Sim in the DSX platform, NVIDIA’s blueprint […]

ai infrastructure ai factory gtc 2026 nvidia dgx

10 Mar

NVIDIA Newsroom 10 Mar 2026 1 min read

NVIDIA and Thinking Machines Lab Announce Long-Term Gigawatt-Scale Strategic Partnership

NVIDIA and Thinking Machines Lab announced today a multiyear strategic partnership to deploy at least one gigawatt of next-generation NVIDIA Vera Rubin systems to support Thinking Machines’ frontier model training and platforms delivering customizable AI at scale. Deployment on the NVIDIA Vera Rubin platform is targeted for early next year. The partnership also includes an […]

ai infrastructure corporate ai training open source

Jensen Huang 10 Mar 2026

AI Is a 5-Layer Cake

AI is one of the most powerful forces shaping the world today. It is not a clever app or a single model; it is essential infrastructure, like electricity and the internet.

ai ai infrastructure ai factory artificial intelligence

26 Feb

Rory Kelleher 26 Feb 2026

Now Live: The World’s Most Powerful AI Factory for Pharmaceutical Discovery and Development

Lilly this week launched the most powerful AI factory wholly owned and operated by a pharmaceutical company to help its teams make meaningful medical advancements faster, more accurately and at unprecedented scale. Dubbed LillyPod, it’s the world’s first NVIDIA DGX SuperPOD with DGX B300 systems.

ai ai infrastructure ai factory ai for good ai training

23 Feb

Itay Ozery 23 Feb 2026 5 min read

NVIDIA Brings AI-Powered Cybersecurity to World’s Critical Infrastructure

As technologies and systems become more digitalized and connected across the world, operational technology (OT) environments and industrial control systems (ICS) — from energy and manufacturing to transportation and utilities — are increasingly depending on enterprise networks and the cloud. This expands OT and ICS capabilities — but also their exposure to cyber threats. Unlike […]

ai infrastructure cybersecurity nvidia bluefieldtrustworthy ai

18 Feb

Jay Puri 18 Feb 2026 7 min read

India Fuels Its AI Mission With NVIDIA

From AI infrastructure leaders to frontier model developers, India is teaming with NVIDIA to drive AI transformation across the nation.

ai ai infrastructure ai training artificial intelligence economic development

16 Feb

Ashraf Eassa 16 Feb 2026 4 min read

New SemiAnalysis InferenceX Data Shows NVIDIA Blackwell Ultra Delivers up to 50x Better Performance and 35x Lower Costs for Agentic AI

The NVIDIA Blackwell platform has been widely adopted by leading inference providers such as Baseten, DeepInfra, Fireworks AI and Together AI to reduce cost per token by up to 10x. Now, the NVIDIA Blackwell Ultra platform is taking this momentum further for agentic AI. AI agents and coding assistants are driving explosive growth in software-programming-related […]

ai infrastructure cloud hardware networking software

12 Feb

Shruti Koparkar 12 Feb 2026 6 min read

Leading Inference Providers Achieve Lowest Token Cost With Open Source Models on NVIDIA Blackwell

A diagnostic insight in healthcare. A character’s dialogue in an interactive game. An autonomous resolution from a customer service agent. Each of these AI-powered interactions is built on the same unit of intelligence: a token. Scaling these AI interactions requires businesses to consider whether they can afford more tokens. The answer lies in better tokenomics […]

ai ai infrastructure agentic ai dynamo inference

Max Starubinskiy 12 Feb 2026 5 min read

NVIDIA DGX Spark Powers Big Projects in Higher Education

At leading institutions across the globe, the NVIDIA DGX Spark desktop supercomputer is bringing data‑center‑class AI to lab benches, faculty offices and students’ systems. There’s even a DGX Spark hard at work in the South Pole, at the IceCube Neutrino Observatory run by the University of Wisconsin-Madison. The compact supercomputer’s petaflop‑class performance enables local deployment […]

ai ai infrastructure hardware artificial intelligence education

5 Jan

Itay Ozery 5 Jan 2026 3 min read

NVIDIA BlueField-Powered Cybersecurity and Acceleration Arrive on NVIDIA Enterprise AI Factory Validated Design

AI is powering breakthroughs across industries, helping enterprises operate with greater intelligence and speed. As AI factories scale, the next generation of enterprise AI depends on infrastructure that can efficiently manage data, secure every stage of the pipeline and accelerate the core services that move, protect and process information alongside AI workloads. NVIDIA has expanded […]

ai infrastructure networking artificial intelligence ces 2026 cybersecurity

Charlie Boyle 5 Jan 2026 5 min read

NVIDIA DGX SuperPOD Sets the Stage for Rubin-Based Systems

NVIDIA DGX SuperPOD is paving the way for large-scale system deployments built on the NVIDIA Rubin platform — the next leap forward in AI computing. At the CES trade show in Las Vegas, NVIDIA today introduced the Rubin platform, comprising six new chips designed to deliver one incredible AI supercomputer, and engineered to accelerate agentic […]

ai infrastructure ces 2026 nvidia bluefield nvidia dgx nvidia rubin

Chris Marriott 5 Jan 2026 6 min read

NVIDIA DGX Spark and DGX Station Power the Latest Open-Source and Frontier Models From the Desktop

Open-source AI is accelerating innovation across industries, and NVIDIA DGX Spark and DGX Station are built to help developers turn innovation into impact. NVIDIA today unveiled at the CES trade show how the DGX Spark and DGX Station deskside AI supercomputers let developers harness the latest open and frontier AI models on a local deskside […]

ai infrastructure ai training artificial intelligence ces 2026 cuda-x

18 Dec 2025

Stacy Ozorio 18 Dec 2025 4 min read

Now Generally Available, NVIDIA RTX PRO 5000 72GB Blackwell GPU Expands Memory Options for Desktop Agentic AI

The NVIDIA RTX PRO 5000 72GB Blackwell GPU is now generally available, bringing robust agentic and generative AI capabilities powered by the NVIDIA Blackwell architecture to more desktops and professionals across the world.

ai ai infrastructure pro graphics agentic ai artificial intelligence

17 Dec 2025

Zoe Kessler 17 Dec 2025 4 min read

UC San Diego Lab Advances Generative AI Research With NVIDIA DGX B200 System

The Hao AI Lab research team at the University of California San Diego — at the forefront of pioneering AI model innovation — recently received an NVIDIA DGX B200 system to elevate their critical work in large language model inference. Many LLM inference platforms in production today, such as NVIDIA Dynamo, use research concepts that […]

ai ai infrastructure research supercomputing artificial intelligence

15 Dec 2025

NVIDIA Newsroom 15 Dec 2025 2 min read

NVIDIA Acquires Open-Source Workload Management Provider SchedMD

NVIDIA today announced it has acquired SchedMD — the leading developer of Slurm, an open-source workload management system for high-performance computing (HPC) and AI — to help strengthen the open-source software ecosystem and drive AI innovation for researchers, developers and enterprises. NVIDIA will continue to develop and distribute Slurm as open-source, vendor-neutral software, making it […]

ai infrastructure corporate software artificial intelligence open source

10 Dec 2025

Prachi Goel 10 Dec 2025 4 min read

How NVIDIA H100 GPUs on CoreWeave’s AI Cloud Platform Delivered a Record-Breaking Graph500 Run

The world’s top-performing system for graph processing at scale was built on a commercially available cluster. NVIDIA last month announced a record-breaking benchmark result of 410 trillion traversed edges per second (TEPS), ranking No. 1 on the 31st Graph500 breadth-first search (BFS) list. Performed on an accelerated computing cluster hosted in a CoreWeave data center […]

ai infrastructure networking cloud services cuda hardware

3 Dec 2025

Shruti Koparkar 3 Dec 2025 8 min read

Mixture of Experts Powers the Most Intelligent Frontier AI Models, Runs 10x Faster to Deliver 1/10 the Token Cost on NVIDIA Blackwell NVL72

The top 10 most intelligent open-source models all use a mixture-of-experts architecture. Kimi K2 Thinking, DeepSeek-R1, Mistral Large 3 and others run 10x faster to enable one-tenth the cost per token on NVIDIA GB200 NVL72. A look under the hood of virtually any frontier model today will reveal a mixture-of-experts (MoE) model architecture that mimics […]

ai infrastructure artificial intelligence dynamo inference nvidia blackwell

2 Dec 2025

Kari Briski 2 Dec 2025 2 min read

NVIDIA Partners With Mistral AI to Accelerate New Family of Open Models

Today, Mistral AI announced the Mistral 3 family of open-source multilingual, multimodal models, optimized across NVIDIA supercomputing and edge platforms. Mistral Large 3 is a mixture-of-experts (MoE) model — instead of firing up every neuron for every token, it only activates the parts of the model with the most impact. The result is efficiency […]

ai infrastructure inference nvidia blackwellnvlinktensorrt

Ian Buck 2 Dec 2025 5 min read

NVIDIA and AWS Expand Full-Stack Partnership, Providing the Secure, High-Performance Compute Platform Vital for Future Innovation

At AWS re:Invent, NVIDIA and Amazon Web Services expanded their strategic collaboration with new technology integrations across interconnect technology, cloud infrastructure, open models and physical AI. As part of this expansion, AWS will support NVIDIA NVLink Fusion — a platform for custom AI infrastructure — for deploying its custom-designed silicon, including next-generation Trainium4 chips for […]

ai ai infrastructure cloud hardware networking

24 Nov 2025

Amanda Saunders 24 Nov 2025 3 min read

Nemotron Labs: 3 Ways Specialized AI Agents Are Reshaping Businesses

Built on open-source models, today’s AI agents can be tailored for unique workflows and business needs to boost productivity and return on investment.

ai ai infrastructure agentic ai artificial intelligence cybersecurity

18 Nov 2025

Ian Buck 18 Nov 2025 5 min read

Powering AI Superfactories, NVIDIA and Microsoft Integrate Latest Technologies for Inference, Cybersecurity, Physical AI

Timed with the Microsoft Ignite conference running this week, NVIDIA is expanding its collaboration with Microsoft, including through the adoption of next-generation NVIDIA Spectrum-X Ethernet switches for the new Microsoft Fairwater AI superfactory, powered by the NVIDIA Blackwell platform. The collaboration brings new integrations across Microsoft 365 Copilot, as well as the public preview of […]

ai ai infrastructure cloud hardware networking

Jacob Liberman 18 Nov 2025 4 min read

Delivering AI-Ready Enterprise Data With GPU-Accelerated AI Storage

AI agents have the potential to become indispensable tools for automating complex tasks. But bringing agents to production remains challenging. According to Gartner, “about 40% of AI prototypes make it into production, and participants reported data availability and quality as a top barrier to AI adoption.1” Just like human workers, AI agents need secure, relevant, […]

ai infrastructure hardware networking agentic ai

NVIDIA Newsroom 18 Nov 2025 1 min read

Microsoft, NVIDIA and Anthropic Announce Strategic Partnerships

Today, Microsoft, NVIDIA and Anthropic announced new strategic partnerships. Anthropic is scaling its rapidly growing Claude AI model on Microsoft Azure, powered by NVIDIA, which will broaden access to Claude and provide Azure enterprise customers with expanded model choice and new capabilities. Anthropic has committed to purchase $30 billion of Azure compute capacity and to […]

ai infrastructure corporate artificial intelligence

Dion Harris 18 Nov 2025 3 min read

The Great Flip: How Accelerated Computing Redefined Scientific Systems — and What Comes Next

Where CPUs once ruled, power efficiency — and then AI — flipped the balance. Extreme co-design across GPUs, networking and software now drives the frontier of science.

ai infrastructure networking research supercomputing cuda-x

17 Nov 2025

Scott Martin 17 Nov 2025 12 min read

Accelerated Computing, Networking Drive Supercomputing in Age of AI

At SC25, NVIDIA unveiled advances across NVIDIA BlueField DPUs, next-generation networking, quantum computing, national research, AI physics and more — as accelerated systems drive the next chapter in AI supercomputing. NVIDIA also highlighted storage innovations powered by the NVIDIA BlueField-4 data processing unit, part of the full-stack BlueField platform that accelerates gigascale AI infrastructure. More […]

ai infrastructure corporate hardware networking supercomputing

Chris Porter 17 Nov 2025 6 min read

NVIDIA Accelerates AI for Over 80 New Science Systems Worldwide

Across quantum physics, digital biology and climate research, the world’s researchers are harnessing a universal scientific instrument to chart new frontiers of discovery: accelerated computing. At this week’s SC25 conference in St. Louis, Missouri, NVIDIA announced that over 80 new scientific systems powered by the NVIDIA accelerated computing platform have been unveiled around the globe […]

ai infrastructure supercomputing ai factory cuda-x high-performance computing

Kibibi Moseley 17 Nov 2025 5 min read

NVIDIA Accelerated Computing Enables Scientific Breakthroughs for Materials Discovery

To power future technologies including liquid-cooled data centers, high-resolution digital displays and long-lasting batteries, scientists are searching for novel chemicals and materials optimized for factors like energy use, durability and efficacy. New NVIDIA-accelerated data processing pipelines and AI microservices unveiled at the SC25 conference in St. Louis are advancing chemistry and material science to support […]

ai infrastructure research supercomputing artificial intelligence cuda-x

Timothy Costa 17 Nov 2025 4 min read

One Giant Leap for AI Physics: NVIDIA Apollo Unveiled as Open Model Family for Scientific Simulation

NVIDIA Apollo — a family of open models for accelerating industrial and computational engineering — was introduced today at the SC25 conference in St. Louis. Accelerated by NVIDIA AI infrastructure, the new AI physics models will enable developers to integrate real-time capabilities into their simulation software across a broad range of industries. The NVIDIA Apollo […]

ai infrastructure supercomputing aerospace artificial intelligence cuda-x

14 Nov 2025

John Kim 14 Nov 2025 4 min read

How to Unlock Accelerated AI Storage Performance With RDMA for S3-Compatible Storage

Today’s AI workloads are data-intensive, requiring more scalable and affordable storage than ever. By 2028, enterprises are projected to generate nearly 400 zettabytes of data annually, with 90% of new data being unstructured, comprising audio, video, PDFs, images and more. This massive scale, combined with the need for data portability between on-premises infrastructure and the […]

ai infrastructure cloud networking cuda

13 Nov 2025

Shruti Koparkar 13 Nov 2025 4 min read

AWS, Google, Microsoft and OCI Boost AI Inference Performance for Cloud Customers With NVIDIA Dynamo

Editor’s note: This post is part of Think SMART, a series focused on how leading AI service providers, developers and enterprises can boost their inference performance and return on investment with the latest advancements from NVIDIA’s full-stack inference platform. NVIDIA Blackwell delivers the highest performance and efficiency, and lowest total cost of ownership across every […]

ai infrastructure dynamo inferencethink smart

12 Nov 2025

Dave Salvator 12 Nov 2025 3 min read

NVIDIA Wins Every MLPerf Training v5.1 Benchmark

In the age of AI reasoning, training smarter, more capable models is critical to scaling intelligence. Delivering the massive performance to meet this new age requires breakthroughs across GPUs, CPUs, NICs, scale-up and scale-out networking, system architectures, and mountains of software and algorithms. In MLPerf Training v5.1 — the latest round in a long-running series […]

ai infrastructure hardware networking software ai training

4 Nov 2025

Markus Hacker 4 Nov 2025 4 min read

Deutsche Telekom and NVIDIA Launch Industrial AI Cloud — a New Era for Germany’s Industrial Transformation

In Berlin on Tuesday, Deutsche Telekom and NVIDIA unveiled the world’s first Industrial AI Cloud, a sovereign, enterprise-grade platform set to go live in early 2026. The partnership brings together Deutsche Telekom’s trusted infrastructure and operations and NVIDIA AI and Omniverse digital twin platforms to power the AI era of Germany’s industrial transformation. “We have […]

ai infrastructure cloud corporate industrial and manufacturing nvidia dgx

31 Oct 2025

Scott Martin 31 Oct 2025 4 min read

Korea Joins AI Industrial Revolution: NVIDIA CEO Jensen Huang Unveils Historic Partnership at APEC Summit

Amidst Gyeongju, South Korea’s ancient temples and modern skylines, Jensen Huang hit the stage at the APEC Summit with historic news: South Korea is leaping into the future with sovereign AI supported by more than a quarter-million NVIDIA GPUs. “It’s vital that we build the ecosystem, not just the AI infrastructure, of Korea,” he said. […]

ai infrastructure corporate gaming cuda inception

28 Oct 2025

NVIDIA Writers 28 Oct 2025 2 min read

NVIDIA GTC Washington, DC: Live Updates on What’s Next in AI

Monday, Oct. 27, 12:30 p.m. ET How Medium-Sized Cities Are Tackling AI Readiness 🔗 A panel discussion today at GTC Washington, D.C., highlighted a public-private initiative to invigorate the economy of Rancho Cordova, California, with a focus on AI. To bolster innovation, the city is working with the Human Machine Collaboration Institute and NVIDIA on […]

ai ai infrastructure corporate driving robotics

Timothy Costa 28 Oct 2025 2 min read

NVIDIA AI Physics Transforms Aerospace and Automotive Design, Accelerating Engineering by 500x

Leading technology companies in aerospace and automotive are accelerating their engineering design processes with the NVIDIA DoMINO NIM microservice, part of the NVIDIA PhysicsNeMo AI physics framework. By integrating GPU-accelerated computing, NVIDIA PhysicsNeMo and interactive digital twin technologies, enterprises are accelerating their modeling and simulation workflows by up to 500x over traditional methods, speeding innovation […]

ai infrastructure software cuda-x gtc 2025 industrial and manufacturing

Louis Stewart 28 Oct 2025 5 min read

Fueling Economic Development Across the US: How NVIDIA Is Empowering States, Municipalities and Universities to Drive Innovation

To democratize access to AI technology nationwide, AI education and deployment can’t be limited to a few urban tech hubs — it must reach every community, university and state. That’s why NVIDIA is working with cities, states and educational institutions to embed AI education and innovation across the U.S., with the goal of helping the […]

ai ai infrastructure corporate robotics ai factory

Scott Martin 28 Oct 2025 4 min read

NVIDIA, NPS Commission the Navy’s AI Flagship for Training Tomorrow’s Leaders

Along the Pacific Ocean in Monterey, California, the Naval Postgraduate School (NPS) is making a splash all the way to Washington, D.C.: It’s using artificial intelligence to solve operational challenges while educating tomorrow’s leaders in AI skills. Like Silicon Valley, it’s not uncommon for NPS, the U.S. Navy’s flagship academic graduate university, to hold hackathons, […]

ai infrastructure hardware artificial intelligencedeep learning institutedigital twin

Chris Porter 28 Oct 2025 4 min read

NVIDIA and General Atomics Advance Commercial Fusion Energy

The race to bottle a star now runs on AI. NVIDIA, General Atomics and a team of international partners have built a three dimensional, interactive AI-enabled digital twin for a fusion reactor, with technical support from San Diego Supercomputer Center at UC San Diego School of Computing, Information and Data Sciences, the Argonne Leadership Computing […]

ai infrastructure software supercomputing cuda-x digital twin

Kanika Atri 28 Oct 2025 4 min read

NVIDIA Open Sources Aerial Software to Accelerate AI-Native 6G

NVIDIA is delivering the telecom industry a major boost in open-source software for building AI-native 5G and 6G networks. NVIDIA Aerial software will soon be released as open source, making it available on a variety of NVIDIA platforms, including on NVIDIA DGX Spark. With open-source software and a powerful and accessible supercomputer to run it […]

ai infrastructure mobile5g6gai-ran

Rory Kelleher 28 Oct 2025

Lilly Deploys World’s Largest, Most Powerful AI Factory for Drug Discovery Using NVIDIA Blackwell-Based DGX SuperPOD

Lilly, a pioneer in medicine, is deploying the largest, most powerful AI factory wholly owned and operated by a pharmaceutical company —the world’s first NVIDIA DGX SuperPOD with DGX B300 systems.

ai ai infrastructure ai factory ai for good artificial intelligence

Justin Boitano 28 Oct 2025 4 min read

NVIDIA and US Technology Leaders Unveil AI Factory Design to Modernize Government and Secure the Nation

Governments everywhere are racing to harness the power of AI — but legacy infrastructure isn’t built for the velocity, complexity or trust that mission-critical action now demands. Massive data streams, cyber threats and urgent operations require a new blueprint for creating AI factories purpose-built for the public sector’s unique standards and scale. At NVIDIA GTC […]

ai infrastructure software gtc 2025 nvidia ai enterprisenvidia spectrum-x ethernet

James Mills 28 Oct 2025 4 min read

NVIDIA Launches Omniverse DSX Blueprint, Enabling Global AI Infrastructure Ecosystem to Build Gigawatt-Scale AI Factories

During the GTC Washington, D.C., keynote today, NVIDIA founder and CEO Jensen Huang introduced NVIDIA Omniverse DSX, a comprehensive, open blueprint for designing and operating gigawatt-scale AI factories — validated at the new AI Factory Research Center at Digital Realty’s site in Manassas, Virginia. The blueprint brings together ecosystem partners across the industry that are […]

ai ai infrastructure software ai factory digital twin

Itay Ozery 28 Oct 2025 3 min read

NVIDIA Launches BlueField-4: The Processor Powering the Operating System of AI Factories

AI factories continue to grow at unprecedented scale, processing structured, unstructured and emerging AI-native data. With demand for trillion-token workloads exploding, a new class of infrastructure is required to keep pace. At NVIDIA GTC Washington, D.C, NVIDIA revealed the NVIDIA BlueField-4 data processing unit, part of the full-stack BlueField platform that accelerates gigascale AI infrastructure, […]

ai infrastructure networking cybersecurity gtc 2025 nvidia bluefield