Tag: AI strategy

  • AI’s next bottleneck may not be intelligence. It may be Earth.

    AI’s next bottleneck may not be intelligence. It may be Earth.

    For the last two years, the AI debate has been mostly about intelligence.

    Which model is ahead? How fast are capabilities improving? Will agents replace tasks, jobs, or whole workflows? Can Europe regulate the technology fast enough?

    All valid questions.

    But the next constraint may be less abstract. It may be physical.

    Power. Grid capacity. Land. Cooling. Permits. Transmission lines. Water. Construction time. Capital allocation.

    The AI race is turning into a gigawatt race. And if the space-data-center discussion is any signal, the next frontier may not just be cloud regions. It may be orbit.

    My read: the executive conversation has to move from "Which AI model should we use?" to "What physical infrastructure does our AI strategy depend on?"

    The scale shift

    Chart showing typical data center power use from 5-10 MW to 100 MW and 1 GW
    The scale jump matters: 10 MW is a facility, 100 MW is industrial infrastructure, and 1 GW becomes a regional energy strategy.

    A modern hyperscale data center is not a large office building with servers. It is an industrial energy asset.

    The International Energy Agency says average data centers draw around 5-10 megawatts. Large hyperscale facilities increasingly require 100 megawatts or more. That number sounds technical, so translate it.

    One megawatt running continuously for a year equals 8.76 gigawatt-hours. A 100 MW data center therefore consumes 876 GWh per year, or 0.876 TWh. At 90% utilization, still roughly 0.8 TWh per year. The IEA compares this to the annual electricity demand of about 350,000 to 400,000 electric cars.

    A 1 GW AI campus is ten 100 MW hyperscale data centers. Running continuously, it consumes 8.76 TWh per year.

    For comparison, Germany's annual electricity consumption is roughly 500 TWh. The EU is around 2,700 TWh. The US is around 4,000 TWh. So one 1 GW AI campus would be small at continental scale – about 0.3% of EU electricity consumption or 0.2% of US consumption – but huge at local grid scale.

    That local point matters.

    Put a 1 GW load in the wrong county, with weak transmission and slow permitting, and it is not "0.2% of America." It is a grid emergency, a political fight, and a capital allocation problem.

    Now consider the language around terawatts. Elon Musk's recent "Terafab" discussion was about chip manufacturing, not a conventional data center, but the vocabulary matters. AI infrastructure ambition is moving from mega to giga to tera. A theoretical 1 TW compute or manufacturing footprint running continuously would consume 8,760 TWh per year. That is more electricity than the US and EU combined.

    That does not mean a 1 TW data center is around the corner. It means the ambition curve is now colliding with the energy system.

    The current footprint

    The IEA estimates global data center electricity consumption at 240-340 TWh in 2022, excluding crypto mining. That was around 1-1.3% of global final electricity demand.

    In large economies such as the United States, China and the European Union, data centers already account for around 2-4% of total electricity consumption. That is the average.

    The local reality is more extreme.

    The IEA notes that data centers have already surpassed 10% of electricity consumption in at least five US states. In Ireland, data centers account for more than 20% of electricity consumption. Denmark projects data center electricity use could rise sixfold by 2030 and approach 15% of national electricity consumption.

    This is the important distinction: globally, data centers are still a manageable share of electricity. Locally, they can become one of the dominant loads on the system.

    Goldman Sachs Research estimates data center power demand could grow 160% by 2030, with global data centers rising from roughly 1-2% of power consumption today to 3-4% by the end of the decade. It also estimates AI could add around 200 TWh per year of data center power demand between 2023 and 2030.

    Two hundred TWh is not abstract. It is close to the annual electricity consumption of a mid-sized industrial country. And it is only the AI-related increment in one forecast.

    The backlash is already here

    Chart comparing global data center electricity share with US, EU, Ireland and local grid impacts
    Global averages hide local pressure: data centers can reach double-digit shares of electricity demand in specific regions.

    This is no longer theoretical.

    In May, several local flashpoints showed the political side of the bottleneck. Seattle was weighing a pause on large data centers. Durham, North Carolina passed a 60-day moratorium on data-center development. A Texas county paused data-center construction in rural areas for a year. Utah approved a data-center project described as twice the size of Manhattan, triggering backlash. Tennessee was considering legislation that would let data centers self-power with limited regulation.

    Different places, same pattern.

    AI infrastructure is colliding with local politics. Communities are asking who gets the jobs, who pays for grid upgrades, who carries water risk, who absorbs noise and land-use impact, and who benefits from the compute.

    This is the part of the AI story many executives still underestimate. It is not enough to have GPUs. You need permission. You need interconnection. You need credible energy sourcing. You need community acceptance.

    The future of AI may be decided as much in planning boards and utility queues as in model labs.

    Why energy is now part of AI leadership

    Executive checklist for AI energy strategy and infrastructure planning
    AI energy strategy is now an executive checklist: economics, thresholds, model allocation, partnerships, and efficiency.

    For a long time, digital leaders could assume infrastructure would scale behind the scenes. Cloud abstracted away servers. SaaS abstracted away operations. Developers increasingly acted as if compute was infinite, elastic, and mostly someone else's problem.

    AI breaks that illusion.

    Training frontier models is energy-intensive. Inference at scale may matter even more because successful AI products are used continuously. Agents add another multiplier: they do not just answer one prompt. They plan, call tools, retry, search, generate, check, and act. A single user request can become dozens or hundreds of model calls behind the scenes.

    That makes energy not just an engineering issue but a leadership issue.

    If AI becomes a core production layer, power becomes part of product economics. Latency becomes part of geography. Energy procurement becomes part of risk management. Infrastructure partnerships become part of market entry. Sustainability claims become harder to defend if absolute consumption rises faster than efficiency improves.

    The better question is not whether AI uses "too much" energy.

    The better question is: are we using scarce energy for high-value intelligence, or are we wasting it on low-value automation theatre?

    The opportunity

    The upside is enormous.

    AI can help design better grids, forecast demand, optimize industrial processes, improve cooling, accelerate materials science, reduce waste, and make energy systems more flexible. The same technology that increases electricity demand can also improve how electricity is produced, routed, stored, and consumed.

    There is also a market opportunity.

    Companies that solve the infrastructure layer will not just be suppliers to AI. They will become strategic gatekeepers. Power developers, grid operators, data-center builders, cooling specialists, chip designers, construction firms, nuclear developers, storage providers, and energy software companies are moving closer to the center of the AI economy.

    This is especially relevant for Europe.

    Europe often frames AI competitiveness around regulation, foundation models, sovereignty, and talent. All matter. But infrastructure sovereignty may become just as important. If compute depends on power availability, grid speed, and data-center capacity, then AI sovereignty is partly electricity sovereignty.

    A European AI strategy without an energy strategy is incomplete.

    The space question

    Conceptual space-based AI data center with solar arrays orbiting above Earth
    Space-based data centers are not a near-term replacement for terrestrial infrastructure. They are a signal that the AI compute curve is pushing beyond the grid.

    The more provocative version of this debate is space.

    A few years ago, data centers in orbit sounded like science fiction. Now Bloomberg is writing about how to build them. McKinsey has made the case for space-based data centers. University researchers are exploring the idea because AI energy demand is rising. Google and SpaceX have been linked in recent coverage to the broader possibility of AI data centers in space.

    The attraction is obvious: continuous solar power, less terrestrial land pressure, potentially easier cooling through radiative systems, and the strategic appeal of moving part of the compute layer off Earth.

    The problems are just as obvious: launch cost, maintenance, radiation, latency, orbital debris, security, regulation, and basic economics.

    But the fact that serious people are asking the question matters. Space data centers are not a near-term replacement for terrestrial infrastructure. They are a signal. The AI compute curve is steep enough that people are looking beyond the grid.

    When a technology forces executives to ask whether the data center belongs in orbit, something fundamental has changed.

    What leaders should do now

    The call to action is practical.

    First: put energy into the AI business case. Every serious AI initiative should have a compute and energy view, not just a model and vendor view. If the project scales 10x or 100x, what happens to cost, latency, emissions, and capacity?

    Second: use real thresholds. A 10 MW workload is a large facility. A 100 MW workload is industrial infrastructure. A 1 GW workload is a regional energy strategy. Treat them differently.

    Third: separate high-value intelligence from low-value automation. Not every workflow deserves heavy AI. Use frontier models where judgment, ambiguity, and leverage justify the cost. Use smaller models, retrieval, caching, rules, and process redesign where they are enough.

    Fourth: make infrastructure a board-level topic. If AI is strategic, then power supply, data-center capacity, cloud concentration, and sustainability are strategic. CIOs, CTOs, CFOs, COOs, and sustainability leaders need one shared view.

    Fifth: build partnerships beyond software. The AI stack now reaches into energy markets, utilities, real estate, cooling, semiconductors, construction, public policy, and eventually maybe space.

    The leadership shift

    The first AI leadership question was: "What can this technology do?"

    The second was: "How does it change work?"

    The third is now emerging: "What does it require from the physical world?"

    This is where the debate becomes more serious.

    AI is not just a software wave. It is a capital investment wave, an energy demand wave, and an infrastructure coordination problem. The limiting factor may not be imagination. It may be megawatts.

    Executives should not panic about that. But they should stop treating it as somebody else's problem.

    Models matter.

    But electricity decides where the models can run. And if the curve continues, the strategic question may become even stranger:

    How much intelligence can Earth afford to host?

    Sources and further reading

  • AI: creating or destroying jobs?

    AI: creating or destroying jobs?

    The AI-jobs argument has split into two camps that aren’t actually arguing about the same thing.

    Jensen Huang told CEOs at GTC that firing people for AI shows “no imagination” — radiologists, he points out, are more numerous now than before AI entered radiology. Marc Andreessen calls the displacement narrative “completely fabricated” and points to Jevons Paradox: cheaper labor produces more demand, not less. The WEF Future of Jobs Report still projects net +78 million jobs globally by 2030. Challenger’s Hiring Plans index was up 157% year-over-year in March.

    A week later, Block laid off 40% of its workforce. Jack Dorsey said engineering work that needed weeks now happens in a fraction of the time. Block is still hiring AI engineers.

    So which is it?

    My read: both sides are right. They’re answering different questions about different decades. Most of the public argument is two conversations pretending to be one.

    The optimist case

    Three pieces hold it up.

    The historical record is strong. Keynes wrote in 1930 that his grandchildren would work fifteen-hour weeks. Reality 2025: OECD average is thirty-seven hours, Americans clock 1,976 hours a year. Mechanization, electrification, the computer, the internet — every general-purpose technology was forecast to end work, and every one produced more jobs than it eliminated. In 1900, 41% of Americans worked in agriculture; today it’s 2%. The jobs went somewhere.

    Jevons Paradox is real. When something useful gets cheaper, demand rises. If AI makes cognitive work twenty times cheaper, you don’t end up with one-twentieth the cognitive work. You end up with twenty times the cognitive work, deployed against far more problems. Andreessen’s “Super-PhD in every field” captures it.

    A big chunk of the labor market is hard to displace. Licensed jobs (medicine, law, accounting), unionized jobs (skilled trades, transit, public safety), and public-sector roles add up to a large fraction of US employment. Not protected because they’re irreplaceable in some technical sense — protected by institutions that move slowly.

    Each piece is correct. The question is whether they’re enough.

    Where the optimist case breaks

    Radar chart of AI capability versus observed usage across eight occupations from the Anthropic Economic Index, showing the deployment gap.
    The deployment gap: theoretical AI capability dwarfs observed usage by occupation. Source: Anthropic Economic Index.

    The Anthropic Economic Index plots theoretical AI capability against observed AI usage by occupation. The two lines look almost nothing alike — capability is broad and high; usage is narrow and concentrated. There’s a gap between what AI can do and what it’s actually doing.

    Read that gap two ways. The optimist reading: deployment is slow, friction is real, the labor market reabsorbs shocks like it always has. The harder reading: the gap is the queue — it’s where displacement comes from over the next five to ten years, not from new capability but from deployment catching up to capability that already exists.

    94% of cognitive job tasks are theoretically automatable today; 33% actually are. The space between is the transition zone. It’s not science fiction. It’s not contested. Most of it will close. Block’s layoffs sit on the second reading.

    The historical-record argument also has a footnote that doesn’t get enough weight. AI is the first general-purpose technology to automate cognitive labor at scale. Every prior wave automated muscle, then narrow categories of cognitive work — but never the universal category of “thinking and writing and analyzing and deciding.” The tractor displaced farm hands; they moved into office work. The PC displaced typewriters and clerks; they moved into knowledge work. AI doesn’t have an obvious “moved into” destination, because the destination of every prior wave is the category AI now automates.

    The TIME / Contextual AI benchmark chart makes the universality vivid. AI surpassed human-level performance on handwriting recognition around 2015, then speech, then images, reading, language, common sense, math, code generation. The rate at which new tasks fall is increasing.

    The trades-and-physical-work counterargument is weaker than it looks. Yes, 57% of jobs depend on physical presence or craft work AI can’t currently replicate. But 70% of positions inside blue-collar companies — the dispatcher, the accountant, the customer-service rep — are white-collar-adjacent and fully exposed. And if displaced knowledge workers all migrate into trades, wages collapse from saturation. Bank of America projects billions of humanoid robots by mid-century with hardware costs falling from $35,000 to under $15,000; one analyst projects robot-hours at four to six euros. Even physical work has an expiration date.

    So the optimist case is strong for a long-run answer. It’s much weaker for the next ten years.

    The displacement case

    Not “AI replaces all jobs.” That’s the optimists’ caricature, and once you reach for it the displacement case looks weak. The serious version is more specific.

    Three vertical bars on dark navy: high-skill rising, middle-skill shrinking with downward arrow, low-skill stable — the AI barbell economy.
    The barbell economy: high-skill productivity rises, low-skill stable, the middle hollows out.

    It’s structural: the middle is being squeezed. The labor market is shifting from a K-shape into a barbell. High-skill technical roles are more productive — the same Anthropic data shows code, analysis, and research at the top of the productivity-gain distribution, with usage approaching 60% of theoretical capacity. Low-skill physical roles in care, hospitality, manual handling, and trades are stable for now. The middle is shrinking: bookkeeping and paralegal work, content writing and copywriting, junior finance and analyst roles, customer service, entry-level coding, marketing copy, translation, project coordination, junior tax preparation.

    Germany has already seen roughly 90,000 AI-related job losses in the first months of 2026. The risk is not mass unemployment in aggregate. Aggregate unemployment can stay low for years while the middle hollows out. The risk is a split labor market — and a split society — in which the people who staffed the middle no longer have a clear path up or sideways.

    The Anthropic Economic Index BLS panel makes this concrete: hiring of younger workers in AI-exposed occupations has slowed, even as overall employment numbers haven’t moved much. That’s what early-stage hollowing looks like — the entry-level rung disappears first, before the established middle does.

    Five years of that compounds into something the historical record didn’t have to absorb.

    My read — the three-phase shape

    Horizontal timeline 2025 to 2040+ split into three colored zones: red displacement, amber strain, cyan abundance — AI jobs transition phases.
    Three phases of the AI jobs transition: displacement (2025-2030), strain (2030-2035), abundance (2035+).

    The clearest three-phase framing is German — chronological, not parallel.

    Phase one — displacement (~2025-2030). AI displaces knowledge work faster than the labor market rebuilds. The middle hollows. Aggregate unemployment may not move much; entry-level paths in white-collar roles narrow sharply. The optimists are right that the technology eventually creates new categories. They’re wrong about the timing.

    Phase two — strain (~2030-2035). Strain shows up in places that aren’t unemployment: tax-base erosion, weakened consumer demand, capital returns rising while labor’s share of national income falls to historic lows. Public-sector and licensed-job cushions hold initially but come under fiscal pressure. The political consequences sharpen.

    Phase three — abundance (after ~2035). The deflation the optimists describe arrives. Costs collapse across categories. What costs $100 today costs a few cents. The median 2040 lifestyle, on a flow-of-services basis, looks something like today’s high-net-worth lifestyle on every dimension except positional goods. Both Andreessen and Huang are right about the destination.

    That’s the timeframe trap. Both sides are correct on their respective horizons. The honest version of the optimist case includes the transition pain. The honest version of the displacement case includes the recovery.

    What this means for how leaders think about the next ten years: the question isn’t “do we believe in AI displacement, yes or no.” That question is roughly answered. The task is to assume real displacement in the middle, plan for it, and carry the organization through to the recovery in a way that keeps the institution and its people whole.

    Three things I’m watching

    1. Whether the entry-level signal becomes a leading indicator. The slowing of hiring for younger workers in AI-exposed occupations is, in my read, the most important early signal. Aggregate employment numbers lag; entry-level absorption leads. If the slowdown becomes a structural break, phase one stops being a forecast and becomes a measurement.
    2. Whether the licensed and public-sector cushion holds when fiscal space tightens. The structural-protection argument is strong only as long as the institutions that protect those jobs don’t themselves come under fiscal pressure. Phase two erodes the tax base. The question is whether legislatures and regulators are protecting genuinely-essential public-sector employment or post-hoc subsidizing the share of the workforce the private sector can no longer place.
    3. Whether the recovery looks like restored employment or restored income. Phase three is consistent with both. Jobs come back in new categories — the historical track record. Or they don’t come back at scale and the recovery is income-shaped: UBI-like distribution of the deflation surplus rather than wage-based participation. These look very different politically. The shape of phase two is what determines which one we get.

    No one has confidence on these three questions yet. I’m watching them because the answers will tell us, in roughly the next five years, what the transition phase actually costs.

    The destination is not in serious doubt. The road is.

  • Model deprecation is the new continuity risk

    Model deprecation is the new continuity risk

    Four rectangles in a row with the leftmost ghosted, simple connecting arrows
    A — model lifecycle row.

    OpenAI announced the discontinuation of the Sora web and app experiences on April 26, with the Sora API following on September 24. The first deprecation triggers in two weeks. Enterprises that built workflows on Sora since launch are not facing a model upgrade — they are facing a workflow rebuild on a four-month timeline. This is the first prominent enterprise-facing AI deprecation event of the cycle, and the precedent it sets matters more than the specific product involved.

    Model deprecation is no longer a developer-tier concern. It is an enterprise governance question that deserves a place on the risk committee agenda. The real shift is happening here: AI dependency without continuity is becoming a board-level risk in 2026.

    The shift: dependency without continuity guarantees

    The pattern of the past two years has been to build agent workflows on whichever foundation model was demonstrably best at the time, with little contractual commitment from the model provider about how long that model would remain available. Provider terms have improved — Azure OpenAI’s twelve-plus-six-month commitment for generally available models is the strongest standard in market — but most enterprises have not negotiated equivalent terms with their chosen providers. They built on capability, not on continuity.

    When the provider sunsets the model, the enterprise’s options are bad. Migrate to a successor model that may behave differently in subtle ways — requiring re-validation of every governed use case. Renegotiate at the eleventh hour for extended access at unfavorable terms. Or absorb the operational disruption of the workflow simply not working until rebuilt.

    The Sora event is small in dollar terms but large in precedent. The next deprecation will involve a more enterprise-critical model, and the enterprises that did not see this one coming are not going to see that one coming either.

    A single thread connecting a workflow box to a model box, the thread visibly fraying near the model with a clock above
    Built on capability. Not on continuity.

    The role change is the addition of an AI continuity discipline

    Inside enterprises that take this seriously, a discipline is emerging that did not exist in 2024 — AI continuity management. The work overlaps with vendor management, with disaster recovery, with model risk management, and with regulatory compliance, but it is structurally distinct from all of them. The discipline involves maintaining an inventory of model dependencies by workflow, negotiating continuity commitments at procurement, running successor-model regression tests on a regular cadence, and ensuring that the documentation chain meets the rebuild-readiness standard.

    Most enterprises have not staffed this discipline. The accountabilities are scattered across teams that do not coordinate. The procurement team negotiated the model contract a year ago without a continuity clause. The deployment team is building production dependencies on the model without thinking about migration cost. The risk team has not flagged model deprecation as a category. When the deprecation announcement lands, the company finds out it has no plan.

    The fix is straightforward in concept and slow in practice. Add continuity commitments to the procurement template. Build a model-dependency inventory. Designate an owner for AI continuity at the executive level. Run quarterly successor-model tests. None of this is hard. It is just unglamorous work that does not get done unless someone owns it.

    The strategic consequence is renewed buy-versus-build math

    Continuity risk changes the calculus of where to deploy AI capability. For workflows where the cost of unplanned migration is high — regulated workflows, mission-critical operations, customer-facing experiences with high switching costs — the case for either fine-tuning a frontier model into a controlled deployment, partnering with a vendor offering enterprise-grade continuity commitments, or building on open-weight models the enterprise can host indefinitely is stronger than it was in 2024. The case for relying on whichever model is best on a benchmark this quarter is weaker.

    The math is not simple. Open-weight models lag the frontier, sometimes meaningfully. Self-hosting carries operational cost that the proprietary providers absorb. The vendor lock-in to a single proprietary provider, even with the best continuity terms, is a different kind of risk than open-weight self-hosting carries. Each enterprise has to make this trade-off based on the workflow’s tolerance for capability lag versus its tolerance for continuity disruption.

    What is no longer defensible in 2026 is treating model continuity as someone else’s problem. The Sora sunset is small. The next one will not be.

    So what boards should do this quarter

    Add model deprecation to the risk committee agenda. The first deprecation event lands in two weeks. The board should at minimum understand which workflows are exposed and what the migration plans are.

    Demand a model-dependency inventory. Which workflows depend on which models from which providers, with which contractual continuity commitments. If this inventory does not exist, building it is the priority.

    Reconsider the buy-versus-build posture for mission-critical AI workflows. The 2024 default — use whichever proprietary model is best — was rational at the time. In 2026, with the deprecation precedent now visible, that default deserves an explicit reconsideration. Continuity is becoming a form of resilience. The boards that price it in this quarter will not be the ones rebuilding workflows under deadline.

    References and links

  • The agentic year begins underprepared

    The agentic year begins underprepared

    The year opens with a measurable gap. McKinsey’s 2026 trust maturity survey, fielded in December and January, puts twenty-three percent of organizations into the scaling phase for agentic systems and thirty-nine percent into experimentation. The remaining majority — nearly two thirds — has not yet begun scaling AI across the enterprise. The capability frontier moved twelve to eighteen months faster than the operating models around it. That gap is no longer an experimentation question. It is the year’s defining strategic risk.

    The boards that close this gap first will not be using better models than their competitors. They will be running organizations that can metabolize what the models already do. The constraint is no longer technology. It is adoption — and adoption is a leadership problem.

    The shift is structural, not cyclical

    Agentic systems are not a new feature inside a familiar product. They are a new class of worker. They take a goal, decompose it into steps, hold state across those steps, call other tools, recover from errors, and return a completed unit of work. That changes what a job is, not how a job is done.

    The 2025 narrative — copilots, productivity boosts, ten percent uplift — is over. The 2026 question is harder. What units of work no longer require a human originator? What units of work now require a human reviewer instead of a human executor? Which decisions can be delegated to a system that explains its reasoning? The companies asking these questions on a Monday morning are reorganizing. The companies still benchmarking model accuracy are stalling.

    The shift is one-way. No board will vote in 2027 to remove agentic systems from a workflow they reduced from forty hours to four. The architectural choices made this year will compound.

    Diagram of one human silhouette passing a goal to a central node that branches into multiple task arrows
    Goal in, decomposition out, no human in the loop between.

    The role change has already happened on the ground

    Inside organizations that have actually shipped agentic systems, the role redefinition is happening informally, by individual contributors, ahead of any HR process. A senior analyst who used to write three reports a week now reviews twelve agent-drafted reports a week and signs off on the analysis. A staff engineer who used to write three pull requests a day now reviews fifteen agent-generated pull requests a day. An account manager who used to draft proposals now edits proposals the agent has built from CRM context.

    The work that survives is judgment, taste, accountability, and relationship. The work that does not survive is execution under specification. Job titles still describe the second category. Job content has already shifted to the first.

    First-line managers feel this most acutely. They were trained to manage humans doing execution work. They are now managing humans doing review work, who in turn are managing systems doing execution work. That is a different management discipline — closer to portfolio management of automated processes than to people management of execution teams.

    A figure at a desk with twelve document icons floating above, marking one of them
    Three reports a week became twelve reviews a week.

    The organizational consequence is delayering

    Span of control widens when the work below each manager becomes more automated and more reviewable. McKinsey’s parallel work on the state of organizations points in the same direction: companies that scale agentic systems also flatten by removing one to two layers of middle management. The economic logic is direct. Middle layers existed to translate strategy into execution and to coordinate the humans doing that execution. When the execution is increasingly handled by systems and the translation is increasingly handled by models, the layer is doing less.

    This is not the 2024 layoff cycle that hit individual contributors. This is a 2026 reorganization that compresses the manager-of-managers layer. It is structurally different and politically harder. The people most threatened by it are the people running the budget meetings about it.

    Organizations that resist the delayering will have a temporary cost advantage and a permanent decision-velocity disadvantage. Decision cycles compress when fewer humans need to be in the loop. The competitor who removed two layers will commit to a market move three weeks faster. Over a year, that compounds into a different market position.

    Two org-chart pyramids side by side, the right one flatter, with an arrow indicating compression
    The middle layer compresses, span of control widens.

    So what boards should do this quarter

    Two actions belong on the Q1 agenda. First, demand a workforce plan that names the units of work moving from human execution to human review, with a twelve-month horizon. Vague AI strategies are no longer acceptable as deliverables; the question is which jobs, which tasks, which review cadences, which accountability lines.

    Second, name an executive owner for the operating-model redesign — not for AI strategy as a separate track, but for the way the company will be organized around the systems it has already deployed. The CHRO and the COO are the natural owners. The CTO is not. The technology decision is downstream of the operating-model decision, and treating it as upstream is how organizations end up with sophisticated tools and a 2023 org chart.

    The year that just started will be measured by the gap between capability and operating model. The companies that close it first set the pace for the rest of the decade. The risk is not moving too fast. The risk is moving too late. Execution speed will separate leaders from followers.