Deep AI Due Diligence
For investors backing companies that buildAI. When the thesis rests on the model itself, the question is simple: is the AI a genuine, defensible advantage - or a thin wrapper around someone else's? We answer that with evidence, across the full model lifecycle.
~40%
of European 'AI' startups showed no evidence of material AI use (MMC Ventures, State of AI, 2019)
>80%
of AI projects fail - about twice the rate of non-AI IT projects (RAND Corporation, 2024)
>280x
fall in the cost to run a GPT-3.5-quality query in ~18 months - model access is commoditizing (Stanford AI Index, 2025)
50-60%
typical AI gross margins, vs. 70-90% for classic SaaS (a16z, 2020)
Is the AI defensible?
Defensibility - moat or wrapper?
- Whether core value depends on proprietary models, proprietary data, or just a public API
- Strength of any proprietary data advantage and how it compounds with usage
- Real switching cost for a competitor to replicate the capability
- Model-provider concentration and lock-in - exposure to their pricing, terms, and rate limits
IP & data provenance - who owns it?
- Ownership of models, weights, prompts, agents, and fine-tuning datasets
- Provenance and licensing of training data, including open-weight licence obligations
- Copyright and IP exposure arising from training-data sources
- Rights to use customer data for training, and the consent and privacy basis behind it
Is the ML real?
Datasets & data pipeline
- Sourcing, curation, labeling quality, and deduplication
- Train/eval contamination and leakage checks
- Coverage, representativeness, and known gaps
- PII/safety filtering and reproducibility of the data pipeline
Training & alignment
- Base-model strategy: pre-train from scratch vs. adapt an existing model - and whether the spend/claims are credible
- Supervised fine-tuning (SFT) data and method; parameter-efficient tuning (LoRA/QLoRA)
- Preference optimization & alignment: RLHF (reward model + PPO), RLAIF, DPO
- Whether the team has the preference data and infra to sustain a custom-model story
Evaluation & metrics
- Offline benchmarks and task-specific metrics
- Hallucination/factuality and safety evaluations
- Human eval rigor and inter-rater agreement
- Regression testing / CI for models, plus online metrics and A/B testing
Inference, cost & MLOps
- Serving stack, latency, and quantization/optimization
- Cost per request/token and how it scales - direct margin impact
- Experiment tracking, data/model versioning, and retraining pipelines
- Drift and production monitoring, and the data-flywheel/feedback loop
Deep AI Due Diligence is a specialist engagement, scoped to the target's stack and stage. Findings land in a board-ready report and red-flag list. Book a call and we'll scope it with you.
Investing in a company that simply uses AI? See SaaS Due Diligence.