Market Pulse
The compute market is bifurcating along a fault line between managed scarcity and engineered abundance. In two-provider H100 spot markets — Montreal, Stockholm, Madrid — prices are rising sharply (Montreal +96% over 30 days to $4.17/hr), driven not by a continental capacity crunch but by oligopoly repricing dynamics where the marginal low-cost provider is lifting its floor off an April trough. Meanwhile, single-provider and over-capacitated H100 corridors (Dublin at $1.27/hr, Kuala Lumpur at $0.94/hr) are drifting lower, confirming that global H100 supply is not tight — distribution of that supply is. Layered over this, AWS is visibly cycling Inferentia spot inventory in South Asia and Tokyo with a synchronized flush-and-refill pattern that has narrowed the spot discount to just 47–51% of ON_DEMAND in Mumbai and Tokyo — while Ohio Inferentia spot drifts to near-zero, revealing this as regional fleet management, not a global monetization shift. The structural story remains unchanged: power, not silicon, is the binding constraint on AI compute expansion, and operators who already hold grid capacity are compounding a moat that interconnection queues now measure in decades, not years.
Key Movers
| Component | Region | Type | Price ($/hr) | 24h Δ | 7d Δ | 30d Δ | Flag |
|---|---|---|---|---|---|---|---|
| Inferentia | Mumbai | SPOT | $0.103 | — | — | +398% | AWS fleet flush→refill; spot discount narrowed to 51% of OD |
| Inferentia | Tokyo | SPOT | $0.224 | — | — | +327% | Same flush pattern; now above Jan baseline; 47% OD discount |
| Trainium | Ohio | SPOT | $0.099 | — | — | +300% | Corroborates ASIC rebalancing; not a demand surge |
| H100 | Montreal | SPOT | $4.17 | — | +22.5% | +96% | Neocloud repricing off Apr-7 trough; spread 133% vs. 44% floor |
| V100_32GB | Tokyo | SPOT | $0.684 | — | — | +145% | Legacy GPU demand spike; Japan AI investment absorbing old capacity |
| H100 | Stockholm | SPOT | $5.20 | — | — | +60% | EU corridor tightening; 2-provider dynamic similar to Montreal |
| T4 | Iowa | ON_DEMAND | est. +72% | — | +72% | — | Mid-tier US Midwest capacity event; 2-provider spread ~137–145% |
| SSD IOPS | 8 regions | ON_DEMAND | ↓ ~92–94% | 0% | 0% | −92% | Policy change, not market drift — single provider restructured IOPS billing; ignore for GPU cost models |
| Inferentia | Hong Kong | SPOT | ~$0.009 | — | — | −93% | Opposite leg of Mumbai/Tokyo rebalance; AWS redirecting capacity |
| H100 | Dublin | SPOT | $1.27 | — | — | −50% | Single-provider oversupply; not a signal — noise from thin market |
Thin-market caveat: H100 SPOT in KL ($0.94/hr) and Dublin ($1.27/hr) have low observation counts and single-provider coverage. Moves in these markets should be treated as provider-specific decisions, not price discovery signals.
Investable Insights
H1 — Montreal H100 Is a Neocloud Repricing Story, Not a Continental Tightening Signal — But the Microstructure Trade Is Real Confidence: 3 / 5
Thesis: The 90-day Montreal H100 SPOT history exposes a market structure more interesting than the headline number suggests. From January 12 through April 8 — 95 consecutive days — the spread between two providers locked at exactly ~196%: one holding near $7.46/hr, the other at ~$2.52/hr, both immobile. This is textbook oligopoly standoff — neither provider has incentive to blink first. On April 9, the low-cost provider capitulated downward, reaching a nadir of $2.13/hr on May 7. Since then, a 28-day uninterrupted ascent to $4.17/hr has re-expanded the spread to 133%. The "cheap leg" has risen from $1.75/hr to ~$2.51/hr; the "expensive leg" (likely AWS at near-ON_DEMAND rates) sits at ~$5.84/hr. Critically, the US-Virginia H100 SPOT is essentially flat at $2.51/hr with only a +5.4% 30-day move and a narrowing spread of 26.8% — the continental tightening narrative doesn't hold where AWS capacity actually lives. What is real: the second provider in any two-provider H100 market sets the marginal spot price, and identifying who controls the cheap leg in Montreal (most likely CoreWeave or Lambda Labs) is the highest-value piece of intelligence for this trade.
Key Evidence:
- H100 SPOT Montreal at $4.17/hr, +96% over 30 days, +22.5% over 7 days — momentum unbroken since May 7 trough (live ticker data, June 4, 2026)
- Provider spread: 43.8% at the May 7 trough → 133% today, reflecting the expensive provider anchoring at ~$5.84/hr while the cheap provider re-prices upward (live ticker, June 4)
- H100 SPOT US-Virginia at $2.51/hr, +5.4% over 30 days, spread narrowing to 26.8% — the real US capacity bellwether is quiet (live ticker, June 4)
- Montreal ON_DEMAND holds flat at $7.68/hr, 0.0% delta across all horizons — hyperscaler capacity is not being tightened; this is neocloud-driven (live ticker, June 4)
- H100 SPOT Stockholm +60% (30d), Madrid +39% (30d) mirror the Montreal pattern — suggesting this is a multi-region neocloud repricing dynamic, not isolated noise
- H100 SPOT Dublin −50% (30d, $1.27/hr) and KL at $0.94/hr confirm that single-provider markets move in the opposite direction — supply is not globally tight
- CoreWeave (CRWV) at $110.93, −11.6% over 1 month, −7% today — the market is not yet rewarding H100 operators for this repricing (equity data, June 4)
Implied Action: This is a pricing intelligence trade more than an equity trade. Monitor the Montreal provider spread as the primary trigger: if the spread compresses below 80% while the median continues rising (i.e., the cheap provider approaches the expensive provider's level), that confirms genuine demand absorption — upgrade to 4-star and consider CRWV long on next earnings (August 12). If the spread widens further with the cheap provider plateauing, the repricing is a single-operator decision and the thesis reverts. Avoid reading Montreal as a continental signal: the Virginia flat-line is the veto. For cloud buyers, the actionable implication is simpler — if you're procuring H100 spot in Montreal, you are currently buying the repricing cycle; migrate to Dublin ($1.27/hr) or KL ($0.94/hr) where single-provider oversupply hasn't cleared yet. The 4x regional spread for identical hardware is an arbitrage that will close — direction TBD.
H2 — AWS Is Actively Cycling Inferentia Spot Inventory to Reset the Price Floor — Regional Fleet Management, Not Global Monetization Confidence: 3 / 5
Thesis: The AWS Inferentia spot data contains the most forensically rich pricing signal in the current dataset. The Mumbai series shows three distinct regimes in 90 days: a stable $0.088–0.098/hr baseline through February 20, a sudden overnight +48% step-change to $0.137/hr on February 21 (sustained for ~10 weeks), then a synchronized near-zero flush ($0.010–0.021/hr) across both Mumbai and Tokyo simultaneously from May 3–11, followed by explosive recovery to current levels ($0.103/hr Mumbai, $0.224/hr Tokyo). That May 3–11 dual-market flush is the smoking gun: it is operationally impossible for two geographically separate markets to simultaneously drop 90%+ on organic demand collapse — this is AWS pulling spot inventory from both pools simultaneously, then re-releasing it at a higher floor. The mechanism is regional capacity rebalancing within AWS's global Inferentia fleet, almost certainly to direct capacity toward higher-monetization use cases or to maintenance-window fleet blocks. The critical qualifier — and what prevents this from being a 4-star thesis — is the Ohio signal: Inferentia SPOT in Ohio, AWS's most important US inference home region, sits at $0.013/hr and is declining at −34% over 7 days. AWS does not rebalance away from its home region without reason; either Ohio has structural oversupply or AWS is using Ohio as a pressure-relief valve while managing prices in international markets.
Key Evidence:
- Inferentia SPOT Mumbai: $0.103/hr today, +398% over 30 days — from a May 7 trough of $0.010/hr, a 10x recovery in 28 days (live ticker, June 4)
- Inferentia SPOT Tokyo: $0.224/hr today, +327% over 30 days — from a May 7 trough of $0.038/hr; today's price exceeds the January baseline ($0.120/hr) by 87% (live ticker, June 4)
- Synchronized dual-market flush: Mumbai and Tokyo both collapsed to near-zero simultaneously in the May 3–11 window — statistically incompatible with organic demand dynamics; confirms deliberate inventory management (90-day history, June 4)
- Inferentia ON_DEMAND: $0.209/hr in Mumbai, $0.425/hr in Tokyo — zero delta across 24h, 7d, and 30d in every region. This artificial flatness over 30+ days is a managed price, not a market-clearing rate (live ticker, June 4)
- Hong Kong Inferentia SPOT −93% over 30 days (~$0.009/hr) — the mirror image of Mumbai/Tokyo; AWS is draining HK inventory while inflating AP-South and AP-Northeast (live ticker, June 4)
- Inferentia SPOT Ohio: $0.013/hr, −34% over 7 days, −10% over 24h — declining in the core US home region, which falsifies a global repricing thesis but is consistent with regional rebalancing (live ticker, June 4)
- Trainium SPOT Ohio: $0.099/hr, +300% over 30 days — corroborates ASIC inventory tightening, but note Trainium is a different workload profile (training, not inference) (live ticker, June 4)
- Marvell Technology 10-Q filed May 28 — data center as key segment with geographic revenue concentration in US, China, Taiwan; custom ASIC supply chain remains the structural beneficiary of this trend (SEC filing, May 28, 2026)
Implied Action: For cloud buyers running inference on AWS Inferentia: the May 3–11 near-zero window pattern is a recurring operational event — implement automated fallback to GPU-based inference (T4, A10G, or G4dn) triggered by Inferentia spot price rising above 60% of ON_DEMAND. The Mumbai and Tokyo windows currently at 47–51% OD discount are approaching that threshold. For investors: the Inferentia rebalancing pattern is most cleanly expressed via long Marvell (MRVL) as the custom ASIC silicon design beneficiary — not as an Inferentia-specific play, but as the broader hyperscaler trend toward proprietary silicon at inference scale. The Trainium/Inferentia dynamic is AWS-internal and not directly investable; the spillover into merchant silicon is. Avoid short Cerebras (CBRS) here — CBRS's −19% over 5 days likely reflects its own business challenges, not an AWS pricing decision.
H3 — Power-Secured Incumbents Hold an Irreplaceable Moat; The Greenfield Supply Pipeline Is Largely Fictional on a Relevant Timescale Confidence: 5 / 5
Thesis: This is the single highest-conviction structural call in the current data stack — and unlike the other hypotheses, it has no credible falsification scenario on a 6–24 month horizon. The regulatory and grid intelligence is now so convergent it constitutes an operating environment rather than a risk factor: PJM capacity market costs have risen $23 billion since 2025, the 2027/2028 auction faces a 6.5 GW projected shortfall (up from 208 MW just one auction cycle earlier), Northern Virginia grid connections now carry a 14-year estimated queue timeline, and NERC's newly issued "Reliability Guideline on Risk Mitigation for Emerging Large Loads" formally categorizes hyperscale datacenter loads as grid-critical with new compliance requirements for frequency and voltage settings. The 3,300+ project CAISO queue and Pennsylvania's Governor filing a FERC complaint specifically naming generative AI as a driver remove any ambiguity — the US East interconnection system is functionally broken for new entrants. Meanwhile, 14 states are considering legislation to ban or pause new datacenter construction, and Texas's 82 datacenter campuses — including OpenAI's Stargate and Google's $40B commitment — face rising political opposition from the 65% of Americans who poll as opposing datacenter siting in their communities. The equity data confirms the thesis is playing out in real time: Equinix (EQIX) at $1,077 holds near its 52-week high ($1,128) and has outperformed Digital Realty (DLR) by approximately 11 points over 3 months — exactly the divergence predicted by a moat that favors multi-tenant colocation incumbents over hyperscale-growth-dependent operators.
Key Evidence:
- PJM capacity market costs +$23B since 2025; 2027/28 auction shortfall projected at 6.5 GW (vs. 208 MW prior cycle) — sourced from Data Center Knowledge, June 3, 2026
- Northern Virginia datacenter grid connections: estimated 14-year queue for new interconnection (Forbes, May 31, 2026)
- NERC "Reliability Guideline on Risk Mitigation for Emerging Large Loads" — formalizes datacenter operators as grid-critical entities; new compliance cost layer for all new builds (Intel Feed, NERC filing)
- CAISO public interconnection queue: 3,300+ projects awaiting approval; median delay stretching beyond economic equipment life (Intel Feed, May 7, 2026)
- 14 states considering datacenter legislation; Maine passed a ban (governor vetoed); Florida passed SB 484 prohibiting utility cost pass-through to residential ratepayers (Federal Regulatory Intel Feed)
- ComEd (Northern Illinois) implementing 12% consumer bill increases directly attributed to AI datacenter load — politically combustible in Midwest markets where most greenfield capacity is planned (AOL, May 31, 2026)
- IEA: global datacenter electricity consumption projected to double to 945 TWh by 2030 — exceeding Japan's entire national consumption; University of Michigan confirms renewables cannot meet uptime requirements at hyperscale
- EQIX at $1,077: +0.49% today, +11.1% over 3 months, near 52-week high — analyst target $1,077 (consensus inline); 74x trailing P/E reflects power moat premium (equity data, June 4)
- DLR at $183.50: −7.6% over 1 month, −4.3% over 5 days, −2.0% today; approaching the lower half of its 52-week range ($146–$208); analyst target $218 implies 19% upside but market is pricing execution risk (equity data, June 4)
- Applied Digital (APLD): −6.5% single session, −8.7% over 5 days; beta 5.7; greenfield Texas exposure; exactly the risk profile this thesis predicts will derate (equity data, June 4)
Implied Action: The structural long is well-known but still has legs on duration: EQIX at 74x trailing P/E is expensive but defensible if the 14-year Virginia interconnection timeline proves accurate rather than the market's implicit 5–7 year assumption — the incremental bull case is that the bottleneck is worse than consensus, which is a high-probability outcome given PJM queue dynamics. DLR at $183 vs. $218 analyst target is the more interesting entry — it has sold off on execution concern, but if the power moat thesis is right, DLR's hyperscale-growth posture actually amplifies the incumbency advantage. The asymmetric short is APLD: beta 5.7, greenfield Texas exposure, and a business model requiring continuous capacity addition that now faces 14-year queue timelines. The compute pricing implication: H100 and A100 ON_DEMAND prices in US-East and EU-West have a structural floor — even if AMD Helios ships on time, you cannot deploy new AMD capacity without first solving the power problem. Any greenfield supply announcements should be discounted by at least 50% on timeline until power contracts are signed and grid connections are confirmed.
Get the Weekly Pulse
This analysis is part of Signwl's weekly research. Subscribe free — Tuesday delivery, no spam.
Risk Flags
Immediate (0–30 Days)
R1 — Montreal H100 SPOT Spread May Widen Further Without Demand Confirmation The current 133% spread between Montreal providers masks a fragile equilibrium. If the cheap provider (likely a neocloud) hits its own utilization ceiling and pauses repricing, the median will stall while the spread appears to widen — creating a false "tightening" signal for anyone watching the median alone. Trigger to watch: any 3-consecutive-day flat-or-down close in Montreal median while the cheap provider's estimated floor ($2.51/hr) stops rising. That's the sign the repricing is exhausted, not confirming.
R2 — AWS Inferentia Spot Floor Is Not Permanent — Ohio Is the Warning Sign The May trough event in Mumbai and Tokyo lasted only 8 days before the floor was re-established at a higher level. But Ohio's steady decline to $0.013/hr and −34% 7d move is a live counter-signal. If Mumbai and Tokyo begin declining from their current levels (Mumbai $0.103, Tokyo $0.224) before reaching the prior February highs ($0.137 and $0.209 respectively), the repricing cycle has likely peaked and the next flush could take prices back to near-zero. Cloud buyers: don't anchor to current Inferentia spot rates for budget modeling.
R3 — SSD IOPS 92% Collapse Requires Cost Model Recalibration The synchronized ~92–94% drop in provisioned IOPS pricing across 8 regions (Virginia, London, Paris, Tokyo, Stockholm, São Paulo, Cape Town, Dublin) is a policy change, not market drift — zero 24h and 7d deltas confirm the reset has already occurred. Any GPU instance cost models that include attached high-IOPS EBS/Persistent Disk pricing need to be recalculated. The GPU-per-hour cost is unchanged; total instance cost of ownership for IO-intensive inference workloads has dropped materially and models that haven't updated will be quoting inflated figures.
Near-Term (30–90 Days)
R4 — AMD Helios Launch Timing Risk Cuts Both Ways AMD is trading at $542 today, +168% over 3 months, within 0.7% of its 52-week high of $546.15 — the stock is pricing near-flawless Helios execution. Sell-side consensus at $479 is already 13% below spot, meaning analyst models haven't caught up with the momentum. The MI450/Helios commercial launch is guided for Q3–Q4 2026 with Q2 earnings on August 4. Any commentary suggesting delays into Q1 2027 (TSMC capacity constraints, customer qualification cycles) would be a −20–30% event from current levels given the multiple expansion entirely attributable to Helios timing expectations. Conversely, H100 SPOT pricing data provides no corroboration that demand substitution is already occurring — Montreal, Stockholm, and Madrid are all rising, the exact opposite of AMD demand displacement. The H100 ON_DEMAND depreciation model shows zero computed annual decay rates for current-gen GPUs, consistent with no observable price erosion since H100 launch. The AMD equity long survives as a fundamental/momentum trade into Q2 earnings; the H100 short does not.
R5 — Neocloud Margin Squeeze Emerging as Spot-to-Reserved Spread Compresses CoreWeave (CRWV) at $110.93 is −11.6% over 1 month and −7% today, trading at a negative forward P/E (−159x) that requires sustained high utilization and stable spot pricing to reach profitability. The Montreal H100 SPOT repricing toward the 1YR Reserved floor ($7.02/hr) is a margin compression signal for neoclouds that sell short-term GPU capacity above their reserved cost basis — the premium they can charge over their own cost is narrowing. If spot prices converge to within 10% of the 1YR Reserved rate in key markets, neocloud economics begin to mirror hyperscaler commodity margins. CRWV's August 12 earnings are the next hard catalyst; current trajectory suggests below-consensus utilization metrics.
Structural (6–24 Months)
R6 — Power Grid Paralysis Will Create Regional Pricing Floors That Cannot Be Arbitraged Away The PJM shortfall, CAISO queue paralysis, and NERC compliance requirements are not temporary market frictions — they are multi-year infrastructure constraints that will create permanent regional price stratification in GPU compute. US-East regions (Virginia, Ohio) with existing hyperscaler campuses will structurally command pricing premiums over EU and APAC corridors where power infrastructure exists or is being built. The current 4x spread between Dublin H100 SPOT ($1.27/hr) and Montreal ($4.17/hr) is already a preview of this stratification. Over 6–24 months, this will make cross-region GPU arbitrage increasingly irrelevant for cloud buyers who need low-latency proximity to US-East data — they will pay the premium or build their own capacity, the latter of which faces the same queue timelines.
R7 — Intel HBM-Free AI Chip Could Disrupt Memory Supply Chain Assumptions Intel's reported development of an AI chip that skips HBM memory entirely (TheStreet, June 3) is an 18-month+ risk, but if technically viable it restructures the memory economics underpinning current NVIDIA GPU pricing. SK Hynix and Micron have built their datacenter memory businesses around the assumption that HBM3E demand scales with H100/B200/H200 adoption — a breakout HBM-free inference chip (from Intel or any credible vendor) would compress server DRAM demand back toward commodity pricing and reduce NVIDIA's component cost moat. This remains highly speculative; the more likely near-term memory story is continued HBM tightness as B200/H200 ramps absorb SK Hynix and Micron's advanced packaging capacity through 2026.
R8 — AWS Custom Silicon Ecosystem Lock-In Amplifies Enterprise Cost Risk The Inferentia/Trainium ON_DEMAND pricing flatness is not a guarantee of future stability — it is the current managed rate. AWS's Neuron SDK compilation pipeline creates significant migration friction: enterprises that have compiled and optimized models specifically for Inferentia hardware cannot trivially redeploy on NVIDIA or AMD silicon without recompilation, re-testing, and potentially retraining. As AWS tests higher spot floors in international markets, the revealed elasticity of this locked-in customer base becomes a pricing power data point. If AWS observes low price sensitivity (i.e., Mumbai/Tokyo customers don't flee to GPU alternatives despite 47–51% spot discounts narrowing), the structural case for an ON_DEMAND rate increase strengthens. Enterprises with meaningful Inferentia inference spend should proactively qualify a GPU-based fallback path before the lock-in is monetized.
All prices and deltas as of June 4, 2026. Ticker data from cross-provider aggregation model (AWS, Azure, GCP, Oracle). News sources cited inline. This brief is for analytical and informational purposes only.