Tag: leadership

  • AI: creating or destroying jobs?

    AI: creating or destroying jobs?

    The AI-jobs argument has split into two camps that aren’t actually arguing about the same thing.

    Jensen Huang told CEOs at GTC that firing people for AI shows “no imagination” — radiologists, he points out, are more numerous now than before AI entered radiology. Marc Andreessen calls the displacement narrative “completely fabricated” and points to Jevons Paradox: cheaper labor produces more demand, not less. The WEF Future of Jobs Report still projects net +78 million jobs globally by 2030. Challenger’s Hiring Plans index was up 157% year-over-year in March.

    A week later, Block laid off 40% of its workforce. Jack Dorsey said engineering work that needed weeks now happens in a fraction of the time. Block is still hiring AI engineers.

    So which is it?

    My read: both sides are right. They’re answering different questions about different decades. Most of the public argument is two conversations pretending to be one.

    The optimist case

    Three pieces hold it up.

    The historical record is strong. Keynes wrote in 1930 that his grandchildren would work fifteen-hour weeks. Reality 2025: OECD average is thirty-seven hours, Americans clock 1,976 hours a year. Mechanization, electrification, the computer, the internet — every general-purpose technology was forecast to end work, and every one produced more jobs than it eliminated. In 1900, 41% of Americans worked in agriculture; today it’s 2%. The jobs went somewhere.

    Jevons Paradox is real. When something useful gets cheaper, demand rises. If AI makes cognitive work twenty times cheaper, you don’t end up with one-twentieth the cognitive work. You end up with twenty times the cognitive work, deployed against far more problems. Andreessen’s “Super-PhD in every field” captures it.

    A big chunk of the labor market is hard to displace. Licensed jobs (medicine, law, accounting), unionized jobs (skilled trades, transit, public safety), and public-sector roles add up to a large fraction of US employment. Not protected because they’re irreplaceable in some technical sense — protected by institutions that move slowly.

    Each piece is correct. The question is whether they’re enough.

    Where the optimist case breaks

    Radar chart of AI capability versus observed usage across eight occupations from the Anthropic Economic Index, showing the deployment gap.
    The deployment gap: theoretical AI capability dwarfs observed usage by occupation. Source: Anthropic Economic Index.

    The Anthropic Economic Index plots theoretical AI capability against observed AI usage by occupation. The two lines look almost nothing alike — capability is broad and high; usage is narrow and concentrated. There’s a gap between what AI can do and what it’s actually doing.

    Read that gap two ways. The optimist reading: deployment is slow, friction is real, the labor market reabsorbs shocks like it always has. The harder reading: the gap is the queue — it’s where displacement comes from over the next five to ten years, not from new capability but from deployment catching up to capability that already exists.

    94% of cognitive job tasks are theoretically automatable today; 33% actually are. The space between is the transition zone. It’s not science fiction. It’s not contested. Most of it will close. Block’s layoffs sit on the second reading.

    The historical-record argument also has a footnote that doesn’t get enough weight. AI is the first general-purpose technology to automate cognitive labor at scale. Every prior wave automated muscle, then narrow categories of cognitive work — but never the universal category of “thinking and writing and analyzing and deciding.” The tractor displaced farm hands; they moved into office work. The PC displaced typewriters and clerks; they moved into knowledge work. AI doesn’t have an obvious “moved into” destination, because the destination of every prior wave is the category AI now automates.

    The TIME / Contextual AI benchmark chart makes the universality vivid. AI surpassed human-level performance on handwriting recognition around 2015, then speech, then images, reading, language, common sense, math, code generation. The rate at which new tasks fall is increasing.

    The trades-and-physical-work counterargument is weaker than it looks. Yes, 57% of jobs depend on physical presence or craft work AI can’t currently replicate. But 70% of positions inside blue-collar companies — the dispatcher, the accountant, the customer-service rep — are white-collar-adjacent and fully exposed. And if displaced knowledge workers all migrate into trades, wages collapse from saturation. Bank of America projects billions of humanoid robots by mid-century with hardware costs falling from $35,000 to under $15,000; one analyst projects robot-hours at four to six euros. Even physical work has an expiration date.

    So the optimist case is strong for a long-run answer. It’s much weaker for the next ten years.

    The displacement case

    Not “AI replaces all jobs.” That’s the optimists’ caricature, and once you reach for it the displacement case looks weak. The serious version is more specific.

    Three vertical bars on dark navy: high-skill rising, middle-skill shrinking with downward arrow, low-skill stable — the AI barbell economy.
    The barbell economy: high-skill productivity rises, low-skill stable, the middle hollows out.

    It’s structural: the middle is being squeezed. The labor market is shifting from a K-shape into a barbell. High-skill technical roles are more productive — the same Anthropic data shows code, analysis, and research at the top of the productivity-gain distribution, with usage approaching 60% of theoretical capacity. Low-skill physical roles in care, hospitality, manual handling, and trades are stable for now. The middle is shrinking: bookkeeping and paralegal work, content writing and copywriting, junior finance and analyst roles, customer service, entry-level coding, marketing copy, translation, project coordination, junior tax preparation.

    Germany has already seen roughly 90,000 AI-related job losses in the first months of 2026. The risk is not mass unemployment in aggregate. Aggregate unemployment can stay low for years while the middle hollows out. The risk is a split labor market — and a split society — in which the people who staffed the middle no longer have a clear path up or sideways.

    The Anthropic Economic Index BLS panel makes this concrete: hiring of younger workers in AI-exposed occupations has slowed, even as overall employment numbers haven’t moved much. That’s what early-stage hollowing looks like — the entry-level rung disappears first, before the established middle does.

    Five years of that compounds into something the historical record didn’t have to absorb.

    My read — the three-phase shape

    Horizontal timeline 2025 to 2040+ split into three colored zones: red displacement, amber strain, cyan abundance — AI jobs transition phases.
    Three phases of the AI jobs transition: displacement (2025-2030), strain (2030-2035), abundance (2035+).

    The clearest three-phase framing is German — chronological, not parallel.

    Phase one — displacement (~2025-2030). AI displaces knowledge work faster than the labor market rebuilds. The middle hollows. Aggregate unemployment may not move much; entry-level paths in white-collar roles narrow sharply. The optimists are right that the technology eventually creates new categories. They’re wrong about the timing.

    Phase two — strain (~2030-2035). Strain shows up in places that aren’t unemployment: tax-base erosion, weakened consumer demand, capital returns rising while labor’s share of national income falls to historic lows. Public-sector and licensed-job cushions hold initially but come under fiscal pressure. The political consequences sharpen.

    Phase three — abundance (after ~2035). The deflation the optimists describe arrives. Costs collapse across categories. What costs $100 today costs a few cents. The median 2040 lifestyle, on a flow-of-services basis, looks something like today’s high-net-worth lifestyle on every dimension except positional goods. Both Andreessen and Huang are right about the destination.

    That’s the timeframe trap. Both sides are correct on their respective horizons. The honest version of the optimist case includes the transition pain. The honest version of the displacement case includes the recovery.

    What this means for how leaders think about the next ten years: the question isn’t “do we believe in AI displacement, yes or no.” That question is roughly answered. The task is to assume real displacement in the middle, plan for it, and carry the organization through to the recovery in a way that keeps the institution and its people whole.

    Three things I’m watching

    1. Whether the entry-level signal becomes a leading indicator. The slowing of hiring for younger workers in AI-exposed occupations is, in my read, the most important early signal. Aggregate employment numbers lag; entry-level absorption leads. If the slowdown becomes a structural break, phase one stops being a forecast and becomes a measurement.
    2. Whether the licensed and public-sector cushion holds when fiscal space tightens. The structural-protection argument is strong only as long as the institutions that protect those jobs don’t themselves come under fiscal pressure. Phase two erodes the tax base. The question is whether legislatures and regulators are protecting genuinely-essential public-sector employment or post-hoc subsidizing the share of the workforce the private sector can no longer place.
    3. Whether the recovery looks like restored employment or restored income. Phase three is consistent with both. Jobs come back in new categories — the historical track record. Or they don’t come back at scale and the recovery is income-shaped: UBI-like distribution of the deflation surplus rather than wage-based participation. These look very different politically. The shape of phase two is what determines which one we get.

    No one has confidence on these three questions yet. I’m watching them because the answers will tell us, in roughly the next five years, what the transition phase actually costs.

    The destination is not in serious doubt. The road is.

  • MIT Called It a Disenchanted Intern. METR Says Check the Growth Rate.

    MIT Called It a Disenchanted Intern. METR Says Check the Growth Rate.

    Something happened this week that I keep turning over.

    MIT published findings this month showing that when 41 AI models were tested across more than 11,000 real workplace tasks, the result was, in their words, like a “disenchanted intern” — hitting minimum benchmarks about 65% of the time, but never exceeding 50% success on tasks requiring genuinely superior-quality output. If you work in software, marketing, legal services, or knowledge work of any kind, that’s the snapshot.

    METR — a nonprofit focused on measuring AI capabilities — published a different kind of snapshot. Their metric is the “time horizon”: the maximum length of autonomous task a frontier AI can reliably complete. In 2019, the best AI could handle roughly a two-minute task without human intervention. By the end of 2025, that had grown to roughly an hour. The doubling time across that whole period: around seven months.

    METR’s January 2026 update tightened that number further. Post-2023, the best estimate for the doubling period is now 130 days — closer to four months.

    My read on this:

    The MIT study and the METR data aren’t in conflict. They’re measuring different things at different timescales. MIT is taking a photograph. METR is measuring the shutter speed. And the shutter speed is getting faster.

    I don’t think the “disenchanted intern” framing is wrong — it describes today accurately. What I’m less sure about is the assumption, implicit in most of the coverage I’ve read this week, that “today” is a stable state. An intern who gets twice as capable every four months is not the same resource at the end of the year as they are today.

    What I keep returning to is the gap between the current snapshot and the trajectory — and the opportunity that opens up in that gap. The MIT data is a photograph of now. The METR data is the shutter speed. Anyone building workflows, designing teams, or structuring how they work around AI capability today is working from a reference point that will be measurably out of date within a single planning cycle. That’s an opportunity signal at a scale and pace most planning assumptions don’t account for.

    Three things I’m watching:

    1. Where the doubling curve hits friction. Every exponential eventually meets a wall — physical limits, data constraints, regulatory friction. METR’s time-horizon metric is useful precisely because it measures real-world task completion, not synthetic benchmark scores. When the doubling cadence breaks, that will be the signal that the curve has met something real. I expect that to happen. I just don’t know when.

    2. Whether “minimally sufficient” matters or not. MIT’s 65% minimally sufficient rate sounds modest. But most enterprise workflows run on people who are minimally sufficient most of the time. The threshold isn’t excellence — it’s “acceptable at scale, around the clock, at near-zero marginal cost.” That bar is lower than it sounds, and closer than the headline number implies.

    3. The infrastructure spend as an access unlock. Alphabet, Meta, Microsoft, and Amazon are projected to spend nearly $700 billion combined on AI infrastructure in 2026 — roughly double what they spent last year. That capital isn’t just building capacity for the current snapshot. It’s funding the cost compression that makes the next several capability doublings broadly accessible. When the infrastructure matures, the cost floor drops — and the surface area for building on top of it expands with it.

    The disenchanted intern framing is apt today. My expectation is that it’s a better description of 2025 than it is of 2027.

    References