Tag: data centers

  • AI’s next bottleneck may not be intelligence. It may be Earth.

    AI’s next bottleneck may not be intelligence. It may be Earth.

    For the last two years, the AI debate has been mostly about intelligence.

    Which model is ahead? How fast are capabilities improving? Will agents replace tasks, jobs, or whole workflows? Can Europe regulate the technology fast enough?

    All valid questions.

    But the next constraint may be less abstract. It may be physical.

    Power. Grid capacity. Land. Cooling. Permits. Transmission lines. Water. Construction time. Capital allocation.

    The AI race is turning into a gigawatt race. And if the space-data-center discussion is any signal, the next frontier may not just be cloud regions. It may be orbit.

    My read: the executive conversation has to move from "Which AI model should we use?" to "What physical infrastructure does our AI strategy depend on?"

    The scale shift

    Chart showing typical data center power use from 5-10 MW to 100 MW and 1 GW
    The scale jump matters: 10 MW is a facility, 100 MW is industrial infrastructure, and 1 GW becomes a regional energy strategy.

    A modern hyperscale data center is not a large office building with servers. It is an industrial energy asset.

    The International Energy Agency says average data centers draw around 5-10 megawatts. Large hyperscale facilities increasingly require 100 megawatts or more. That number sounds technical, so translate it.

    One megawatt running continuously for a year equals 8.76 gigawatt-hours. A 100 MW data center therefore consumes 876 GWh per year, or 0.876 TWh. At 90% utilization, still roughly 0.8 TWh per year. The IEA compares this to the annual electricity demand of about 350,000 to 400,000 electric cars.

    A 1 GW AI campus is ten 100 MW hyperscale data centers. Running continuously, it consumes 8.76 TWh per year.

    For comparison, Germany's annual electricity consumption is roughly 500 TWh. The EU is around 2,700 TWh. The US is around 4,000 TWh. So one 1 GW AI campus would be small at continental scale – about 0.3% of EU electricity consumption or 0.2% of US consumption – but huge at local grid scale.

    That local point matters.

    Put a 1 GW load in the wrong county, with weak transmission and slow permitting, and it is not "0.2% of America." It is a grid emergency, a political fight, and a capital allocation problem.

    Now consider the language around terawatts. Elon Musk's recent "Terafab" discussion was about chip manufacturing, not a conventional data center, but the vocabulary matters. AI infrastructure ambition is moving from mega to giga to tera. A theoretical 1 TW compute or manufacturing footprint running continuously would consume 8,760 TWh per year. That is more electricity than the US and EU combined.

    That does not mean a 1 TW data center is around the corner. It means the ambition curve is now colliding with the energy system.

    The current footprint

    The IEA estimates global data center electricity consumption at 240-340 TWh in 2022, excluding crypto mining. That was around 1-1.3% of global final electricity demand.

    In large economies such as the United States, China and the European Union, data centers already account for around 2-4% of total electricity consumption. That is the average.

    The local reality is more extreme.

    The IEA notes that data centers have already surpassed 10% of electricity consumption in at least five US states. In Ireland, data centers account for more than 20% of electricity consumption. Denmark projects data center electricity use could rise sixfold by 2030 and approach 15% of national electricity consumption.

    This is the important distinction: globally, data centers are still a manageable share of electricity. Locally, they can become one of the dominant loads on the system.

    Goldman Sachs Research estimates data center power demand could grow 160% by 2030, with global data centers rising from roughly 1-2% of power consumption today to 3-4% by the end of the decade. It also estimates AI could add around 200 TWh per year of data center power demand between 2023 and 2030.

    Two hundred TWh is not abstract. It is close to the annual electricity consumption of a mid-sized industrial country. And it is only the AI-related increment in one forecast.

    The backlash is already here

    Chart comparing global data center electricity share with US, EU, Ireland and local grid impacts
    Global averages hide local pressure: data centers can reach double-digit shares of electricity demand in specific regions.

    This is no longer theoretical.

    In May, several local flashpoints showed the political side of the bottleneck. Seattle was weighing a pause on large data centers. Durham, North Carolina passed a 60-day moratorium on data-center development. A Texas county paused data-center construction in rural areas for a year. Utah approved a data-center project described as twice the size of Manhattan, triggering backlash. Tennessee was considering legislation that would let data centers self-power with limited regulation.

    Different places, same pattern.

    AI infrastructure is colliding with local politics. Communities are asking who gets the jobs, who pays for grid upgrades, who carries water risk, who absorbs noise and land-use impact, and who benefits from the compute.

    This is the part of the AI story many executives still underestimate. It is not enough to have GPUs. You need permission. You need interconnection. You need credible energy sourcing. You need community acceptance.

    The future of AI may be decided as much in planning boards and utility queues as in model labs.

    Why energy is now part of AI leadership

    Executive checklist for AI energy strategy and infrastructure planning
    AI energy strategy is now an executive checklist: economics, thresholds, model allocation, partnerships, and efficiency.

    For a long time, digital leaders could assume infrastructure would scale behind the scenes. Cloud abstracted away servers. SaaS abstracted away operations. Developers increasingly acted as if compute was infinite, elastic, and mostly someone else's problem.

    AI breaks that illusion.

    Training frontier models is energy-intensive. Inference at scale may matter even more because successful AI products are used continuously. Agents add another multiplier: they do not just answer one prompt. They plan, call tools, retry, search, generate, check, and act. A single user request can become dozens or hundreds of model calls behind the scenes.

    That makes energy not just an engineering issue but a leadership issue.

    If AI becomes a core production layer, power becomes part of product economics. Latency becomes part of geography. Energy procurement becomes part of risk management. Infrastructure partnerships become part of market entry. Sustainability claims become harder to defend if absolute consumption rises faster than efficiency improves.

    The better question is not whether AI uses "too much" energy.

    The better question is: are we using scarce energy for high-value intelligence, or are we wasting it on low-value automation theatre?

    The opportunity

    The upside is enormous.

    AI can help design better grids, forecast demand, optimize industrial processes, improve cooling, accelerate materials science, reduce waste, and make energy systems more flexible. The same technology that increases electricity demand can also improve how electricity is produced, routed, stored, and consumed.

    There is also a market opportunity.

    Companies that solve the infrastructure layer will not just be suppliers to AI. They will become strategic gatekeepers. Power developers, grid operators, data-center builders, cooling specialists, chip designers, construction firms, nuclear developers, storage providers, and energy software companies are moving closer to the center of the AI economy.

    This is especially relevant for Europe.

    Europe often frames AI competitiveness around regulation, foundation models, sovereignty, and talent. All matter. But infrastructure sovereignty may become just as important. If compute depends on power availability, grid speed, and data-center capacity, then AI sovereignty is partly electricity sovereignty.

    A European AI strategy without an energy strategy is incomplete.

    The space question

    Conceptual space-based AI data center with solar arrays orbiting above Earth
    Space-based data centers are not a near-term replacement for terrestrial infrastructure. They are a signal that the AI compute curve is pushing beyond the grid.

    The more provocative version of this debate is space.

    A few years ago, data centers in orbit sounded like science fiction. Now Bloomberg is writing about how to build them. McKinsey has made the case for space-based data centers. University researchers are exploring the idea because AI energy demand is rising. Google and SpaceX have been linked in recent coverage to the broader possibility of AI data centers in space.

    The attraction is obvious: continuous solar power, less terrestrial land pressure, potentially easier cooling through radiative systems, and the strategic appeal of moving part of the compute layer off Earth.

    The problems are just as obvious: launch cost, maintenance, radiation, latency, orbital debris, security, regulation, and basic economics.

    But the fact that serious people are asking the question matters. Space data centers are not a near-term replacement for terrestrial infrastructure. They are a signal. The AI compute curve is steep enough that people are looking beyond the grid.

    When a technology forces executives to ask whether the data center belongs in orbit, something fundamental has changed.

    What leaders should do now

    The call to action is practical.

    First: put energy into the AI business case. Every serious AI initiative should have a compute and energy view, not just a model and vendor view. If the project scales 10x or 100x, what happens to cost, latency, emissions, and capacity?

    Second: use real thresholds. A 10 MW workload is a large facility. A 100 MW workload is industrial infrastructure. A 1 GW workload is a regional energy strategy. Treat them differently.

    Third: separate high-value intelligence from low-value automation. Not every workflow deserves heavy AI. Use frontier models where judgment, ambiguity, and leverage justify the cost. Use smaller models, retrieval, caching, rules, and process redesign where they are enough.

    Fourth: make infrastructure a board-level topic. If AI is strategic, then power supply, data-center capacity, cloud concentration, and sustainability are strategic. CIOs, CTOs, CFOs, COOs, and sustainability leaders need one shared view.

    Fifth: build partnerships beyond software. The AI stack now reaches into energy markets, utilities, real estate, cooling, semiconductors, construction, public policy, and eventually maybe space.

    The leadership shift

    The first AI leadership question was: "What can this technology do?"

    The second was: "How does it change work?"

    The third is now emerging: "What does it require from the physical world?"

    This is where the debate becomes more serious.

    AI is not just a software wave. It is a capital investment wave, an energy demand wave, and an infrastructure coordination problem. The limiting factor may not be imagination. It may be megawatts.

    Executives should not panic about that. But they should stop treating it as somebody else's problem.

    Models matter.

    But electricity decides where the models can run. And if the curve continues, the strategic question may become even stranger:

    How much intelligence can Earth afford to host?

    Sources and further reading

  • Hyperscaler 2026 capex hits ~$700B. Free cash flow is the variable that breaks.

    Hyperscaler 2026 capex hits ~$700B. Free cash flow is the variable that breaks.

    What was announced

    On February 6, CNBC reported that combined 2026 AI capex commitments across Amazon, Google, Microsoft, and Meta now approach $700 billion. Amazon: roughly $200 billion. Alphabet: up to $185 billion. Microsoft: increase from prior 2025 levels (analyst consensus near $99 billion FY26, ending June). Meta: budgeted $115–135 billion. Approximately 75% of the spend is AI-related — call it $450 billion of AI infrastructure in a single year, up about 36% versus 2025. Free cash flow projections for the same set of companies show meaningful compression; Amazon is forecast to turn negative, with analyst projections of negative free cash flow between $17 billion and $28 billion in 2026.

    What it means

    Capex of this magnitude rewrites the financial model for the entire frontier compute stack. The hyperscalers are no longer building toward a near-term revenue profile — they are building toward a 5-to-7-year usage curve they believe is coming. That is a different posture than the 2018–2022 capex cycle, which was largely demand-led. This one is conviction-led, and the conviction is asymmetric: if AI compute demand materializes at the projected rate, today’s capex looks conservative; if it lags by even 18 months, the depreciation schedule eats free cash flow at a rate the public markets have not yet priced.

    A second-order effect matters more for non-hyperscalers: every CIO planning AI infrastructure in 2026 is now negotiating against a supplier base whose capacity is partially already absorbed by internal hyperscaler workloads. Pricing power for capacity is structurally higher, lead times for premium GPU instances are longer, and the cost-per-token of frontier inference will move on hyperscaler margin compression rather than competition.

    Andreas’s view

    My read on this: $700 billion is not a number that resolves itself by spreadsheet logic. It resolves itself by which hyperscaler is willing to absorb the cash-flow hit longest. The strategic question inside each company is no longer “should we build” but “which competitor blinks first when the free-cash-flow line turns red on quarterly reporting.” Amazon is closest to that line. Microsoft has the strongest cash position to absorb it. Google sits in between. Meta has the most flexibility because its core ad business is funding the AI infrastructure with the lightest accounting drag.

    I don’t think the capex commitment will be revised down materially in 2026. The competitive cost of unilaterally easing off — handing GPU capacity, customer relationships, and the model-training cadence to a competitor — is too high. What will happen instead is creative financing: more debt, more partnerships with sovereign wealth and infrastructure funds, more long-term capacity contracts that move spend off the balance sheet. The capex will continue. The accounting around it will get more interesting.

    The way I see it, adjacent businesses should not assume the capacity they need will be available at the price they modeled. My expectation is that premium-tier inference and training capacity will be priced as a scarce resource for the rest of 2026 and most of 2027. Any AI roadmap that depends on flat or declining unit costs over that window has a hidden assumption built in that I think is unlikely to hold.

    Three things I’m watching

    1. I’ll be watching whether companies move to lock multi-year capacity contracts for premium inference and training now, or wait — because negotiating against scarcity in 2027 will be more expensive than over-committing modestly in 2026.
    2. The companies that preserve optionality will be the ones that have stress-tested their AI cost models against a scenario where frontier-tier compute prices are flat or rising for 18 months — and redesigned the workflow, not the budget, when the unit economics broke.
    3. Hyperscaler free-cash-flow disclosures over the next four quarters are the leading indicator I’m focused on — they will show whether the capex commitments hold or quietly compress.

    References and related signals