The Anatomy of AI Power

Moritz Maibaum

Rejected for the following reason(s):

This is an automated rejection. No LLM generated, assisted/co-written, or edited work.

Read full explanation

This is a crosspost from our blog.

The rapid scaling of AI has shifted the bottleneck of compute from data throughput to energy delivery. While a traditional data center consumes 10 to 50 MW, the first AI clusters are crossing 1 GW. Constructing a 1 GW AI data center requires an estimated 35 billion USD in upfront CapEx, with a lot of it going towards energy delivery.

We map the journey of electricity from the utility grid to the GPU core.

1. The Macro Level (Facility)

Path: Grid → Substation → Switchgear → UPS

To train LLMs, data centers act as a single, unified supercomputer. This is why when a training sequence initiates, the facility experiences a massive, synchronized demand for power.

The journey begins at the utility grid, carrying raw, unconditioned power at extreme voltages (110kV+). An on-site Substation steps this down to a medium voltage (typically 11kV to 35kV).

The most critical facility-level component is the Uninterruptible Power Supply (UPS). If the grid fluctuates or fails, diesel or gas generators take several seconds to boot. The UPS utilizes massive battery arrays to bridge this gap, ensuring the AI clusters never lose power.

Companies like Schneider Electric, Eaton, and ABB dominate this infrastructure. Semiconductor companies (e.g., Infineon, Wolfspeed) provide the Silicon Carbide (SiC) switches inside the UPS, with some modern UPS products advertising up to 99% efficiency.

2. The Distribution Level

Path: UPS → Floor PDUs → Busways → Rack

Once power is conditioned by the UPS, it is distributed across the data center floor. It travels through large transformers inside floor-level Power Distribution Units (PDUs), which step the voltage down to 415V or 240V AC.

This power is transported down the aisles of server racks via overhead or underfloor copper bars known as Busways. Because AI racks consume significantly more power than traditional web-hosting racks (up to 120kW per rack compared to a traditional 10kW), these busways must carry immense current.

Traditional mechanical circuit breakers are being replaced by Solid-State Circuit Breakers (SSCBs) that can cut power in microseconds to prevent catastrophic arc flashes. AI datacenter operators are willing to significantly increase their spending for these components, because a single power malfunction in their facility can be incredibly expensive.

3. The Rack Level (48V Pivot)

Path: Rack PDUs → Power Shelves

This stage represents the most significant architectural pivot in modern AI infrastructure. For decades, traditional servers operated on a 12-Volt DC backplane. However, the sheer power density of AI GPUs renders 12V economically and physically highly impractical due to transmission limits.

To facilitate this, racks now feature Power Shelves: centralized banks of power supply units (PSUs). These shelves take the incoming AC power and convert it into highly stable 48V DC. Inside these PSUs, Gallium Nitride (GaN) and Superjunction MOSFETs switch at incredibly high frequencies, allowing power supplies to remain compact while delivering Titanium-grade efficiency. NVIDIA’s NVL72 architecture has emerged as the de-facto reference for 120 kW liquid-cooled racks at hyperscale, anchoring the industry’s transition to the 48 V DC backplane.

The Power Formula: P = V × I
If an AI rack requires 100,000 Watts at 12 Volts, it must push 8,333 Amps of current. This causes extreme heat and copper losses (I²R). By increasing the rack voltage to 48V DC, the current drops to 2,083 Amps. This reduces power losses by a factor of 16.

4. The Board/Chip Level

Path: 48V DC → VRMs → AI Silicon Core

The final stage is one of the the most demanding power engineering environment in the world. The 48V DC power arrives at the motherboard and must be immediately stepped down to the exact voltage the silicon logic requires (usually between 0.6V and 0.8V).

This transition is handled by Point-of-Load (PoL) converters and Voltage Regulator Modules (VRMs). A single flagship AI GPU (like NVIDIA’s Blackwell) can consume over 1,200 Watts. At 0.7 Volts, a single chip demands nearly 1,700 Amps of current.

Traditional Lateral Power Delivery (LPD) clusters VRMs around the silicon. To combat transient resistance, next-generation architectures are replacing this with Vertical Power Delivery (VPD), placing the VRMs directly underneath the GPU rather than next to it. Semiconductor companies supply the specialized smart power stages and digital controllers that orchestrate this rapid-fire energy delivery.

The Transient Challenge
AI workloads are highly “bursty”. A GPU may demand its full 1,700 Amps in a matter of microseconds. If the VRMs cannot respond instantly, the voltage drops, and the GPU crashes.

Conclusion

The AI buildout is inextricably linked to power infrastructure. As model sizes scale, the physical limit of computation is bound by our ability to efficiently convert 110,000 Volts from the grid into 0.7 Volts at the silicon core. The companies that master this conversion across the entire macro-to-micro value chain will be essential during the next era of high-performance computing.