AI Startup Due Diligence: Beyond the Hype
Artificial intelligence is simultaneously the most exciting and most overhyped sector in venture capital. The challenge for investors: separating genuine technical moats from "GPT wrappers" that can be replicated in a weekend.
The AI Investment Landscape in 2026
The AI sector has stratified into clear tiers:
- Foundation Models — Massive capital requirements ($100M+), winner-take-most dynamics
- AI Infrastructure — Tooling, MLOps, and compute optimization
- Vertical AI Applications — Domain-specific solutions (legal, medical, financial)
- AI-Native Workflows — Products that couldn't exist without AI (not just AI-enhanced)
- Edge AI — On-device inference for privacy-sensitive and latency-critical applications
What PV1 Evaluates in AI Startups
- Data Moat Assessment — Proprietary training data that improves with usage vs. commodity data
- Model Defensibility — Can the core model be replicated with open-source alternatives?
- Compute Economics — Inference cost per query and trajectory (must be declining)
- Workflow Integration Depth — Is the AI embedded in critical workflows or a nice-to-have overlay?
The "Wrapper" Problem
Our data shows that 78% of AI startups funded in 2024 were thin application layers on top of foundation models. These companies face existential risk when the underlying model provider (OpenAI, Anthropic, Google) adds their feature natively. PV1 specifically tests for "Platform Dependency Risk" in every AI evaluation.
Historical AI Exit Patterns
- AI acquisitions command 10-20x revenue premiums when proprietary data is involved
- 85% of AI acqui-hires are for the team, not the product — a red flag for investors
- Vertical AI companies exit at higher multiples than horizontal AI plays
Key Metrics to Evaluate
- Data Flywheel Strength: Does product usage generate proprietary training data?
- Inference Cost Trajectory: Cost per query must be declining >30% annually
- Model Accuracy vs. Baseline: Measurable improvement over open-source alternatives
- Gross Margin: >60% (compute-heavy AI companies often have SaaS-like pricing but hardware-like margins)
- Retention by Cohort: AI products often see high initial adoption but steep drop-off
Risk Factors
- Foundation Model Risk: OpenAI/Anthropic/Google adding your feature natively
- Commoditization Speed: Open-source models closing the gap rapidly
- Compute Cost Dependency: GPU shortage or price spikes can destroy unit economics
- Hallucination Liability: Enterprise customers increasingly demanding accuracy guarantees
- Regulatory Uncertainty: EU AI Act and emerging global AI regulations
Related Reading