PM OS
Module 2Foundation110 min

Empathy & Discovery

Earn the right to build by separating real demand from articulated wishes through continuous, evidence-led discovery.

Jobs-to-be-Done (JTBD)Outcome-Driven InnovationContinuous discoveryOpportunity Solution TreeSwitch interview / 4 forcesUser interviewing fundamentalsJourney & service blueprintsSurvey design pitfallsRisk-Assumption mappingTriangulation

Explainer

Discovery is the work of earning the right to build. Most product failures are not execution failures; they are demand failures dressed up in shipped features. Strong discovery rejects the comfortable shortcuts — surveys without behavior, focus groups, road-show demos for validation — and replaces them with a tight loop: interview, observe, analyze, and triangulate qualitative evidence with quantitative signal until the team is honestly less wrong than it was last week.

1

Stated Preference vs Revealed Behavior

Users are unreliable forecasters of their own behavior. They will tell you they want a feature they will never use, that they would pay for something they never will, and that they hate a UI they keep returning to. Discovery is the discipline of trusting what users do over what they say. The interview is a tool to reconstruct past behavior, not to predict future behavior.

  • Past behavior > stated intent > hypothetical preference, in that order.
  • When you must ask hypotheticals, ask about a specific recent moment, not a general future.
  • If a user says 'I would definitely pay for this', ask 'how much have you paid for the workaround you use today?'
2

Jobs-to-be-Done — Three Schools

JTBD is not one thing; there are three overlapping schools. Conflating them is the most common error PMs make. Pick the school that matches your decision and stick with its vocabulary.

  • Christensen / Switch (Bob Moesta, Chris Spiek): focuses on the moment of switching from an old solution to a new one. Vocabulary: anxieties, habits, push, pull, the four forces.
  • Outcome-Driven Innovation (Tony Ulwick): functional jobs decomposed into measurable desired outcomes scored on importance and current satisfaction. Vocabulary: opportunity score = importance + max(importance - satisfaction, 0).
  • Strategyzer (Alex Osterwalder): jobs as the input to value proposition design — functional, emotional, and social jobs paired with pains and gains.
3

The Switch Interview & The Four Forces

The Switch interview reconstructs the moment a user moved from one solution to another. The framework: push of the current situation, pull of the new solution, anxieties about switching, and habits of the present. Demand exists when push + pull > anxieties + habits. Strong PMs probe each force explicitly.

  • Push: what specifically broke about the prior workflow? When? With whom?
  • Pull: what about the new option crossed the threshold of 'good enough to try'?
  • Anxiety: what worried you about adopting? Cost, risk, social risk, learning curve?
  • Habit: what kept you on the old way for so long even after the push appeared?
  • If anxieties or habits are larger than push + pull, demand is theoretical, not real.
4

User Interviewing Mechanics

Most PMs overestimate how good their interviews are. The biggest issues: leading questions, pitching the solution, asking users to design, and accepting articulated reasons at face value. A good interview is structured but not scripted — it reconstructs behavior in vivid detail.

  • Open with a recent specific moment: 'walk me through the last time this happened.'
  • Probe the timeline minute by minute; don't accept summary statements.
  • Listen for energy changes — sighs, accelerations, pauses — and follow them.
  • Ask about workarounds before pitching solutions; the workaround is the price of admission.
  • Never end an interview without asking 'what didn't I ask that I should have?'
  • Record (with consent) and tag transcripts; memory compresses everything within 48 hours.
5

Continuous Discovery as a Weekly Habit

Teresa Torres's continuous discovery practice replaces the 'discovery phase' with a weekly cadence. Goal: the product trio (PM, design, eng tech-lead) talks to at least one user per week, every week, and updates an Opportunity Solution Tree against a clear desired outcome.

  • Recruit a continuous panel; don't restart sourcing every project.
  • Hold a weekly trio sync to update the tree, not to ask permission.
  • Treat any week without an interview as a discovery debt — name it and pay it down.
  • Discovery and delivery interleave; one is not the gate to the other.
6

Opportunity Solution Tree

An OST visualizes the chain from desired outcome → opportunities (problems / unmet needs surfaced from research) → candidate solutions → assumption tests. Read top-down, you can see why each solution is being considered. Read bottom-up, every test laddered to a real user opportunity tied to the team's outcome.

  • One outcome at the root; multiple opportunities; multiple solutions per opportunity; multiple tests per solution.
  • Resist solutions that don't ladder to a named opportunity.
  • Use the tree to argue with: 'is this opportunity bigger than that one?', 'which solution best targets this opportunity?'
  • Refresh weekly with new evidence; prune branches that lose evidence.
7

Risk-Assumption Mapping

Marty Cagan separates four risk types every product idea faces: Value (will users use it?), Usability (can they use it?), Feasibility (can we build it?), and Viability (does it work for the business — legal, sales, support, finance?). Discovery is the practice of identifying which of these risks dominates each idea, then designing the cheapest test that retires that risk.

  • Value risk dominates new categories and new audiences.
  • Usability risk dominates radical UI changes.
  • Feasibility risk dominates technically novel work (ML, real-time, scale).
  • Viability risk dominates regulated or pricing-sensitive areas.
8

User Journey Maps & Service Blueprints

Journey maps walk a single user persona through a real workflow over time, capturing actions, thoughts, emotions, and moments of friction. Service blueprints extend the journey to include backstage processes — what the company, systems, and policies have to do for the front-stage moment to work. Most B2B and ops-heavy products fail at the back-stage, not the screen.

  • Build the map from a real recorded session, not a workshop guess.
  • Mark each step with: action, thought, emotion, friction, time elapsed.
  • For service blueprints, add backstage actions, support systems, and SLAs.
  • Highlight 'moments of truth' where the experience succeeds or breaks.
9

Survey Design and Why Most Surveys Lie

Surveys are popular because they scale; they are misused because they're easy. Most product surveys suffer from selection bias (only enthusiasts respond), social desirability bias (respondents tell you what they think you want to hear), and order effects (the question above leaks into the answer below). Surveys are useful as quantification of a hypothesis you've already developed qualitatively — almost never as discovery from scratch.

  • Use surveys to size, not to discover.
  • Avoid 'would you use' and 'how much would you pay' — both are unreliable.
  • Anchor questions to behavior in the last week, not preferences in the abstract.
  • Pilot with 10 users before sending; 80% of survey-design errors are caught in the first 5 responses.
  • Always include an open-ended question; the verbatims are usually more useful than the rating scales.
10

Triangulation: Qual + Quant + Market

Single-source insights are dangerous. Triangulate by combining qualitative interviews (why), quantitative analytics (how big), and market evidence (does the world support this trend?). When all three agree, conviction is high. When they disagree, follow the disagreement — that's where the most useful learning lives.

  • Qualitative tells you what's happening and why.
  • Quantitative tells you how big and how often.
  • Market evidence (competitors, adjacent industries, primary indicators) tells you whether the wind is at your back or in your face.
  • Document each source with its confidence level; don't average them, weight them.
11

Discovery Output: The Insight Library

Discovery without an artifact decays inside one PM's head. The team needs a shared insight library — tagged interview snippets, opportunities, evidence — that survives PM turnover and accumulates across projects. Even a simple shared doc with consistent tags beats brilliant siloed memory.

  • Tag every quote with: persona, job, opportunity, severity, confidence.
  • Make the library searchable; if you can't find a tagged quote in 30 seconds, the system has failed.
  • Review the top opportunities every quarter as the OST root candidates for the next quarter.

Framework atlas

Reference cards for each method in this mission

Expand a card for when to deploy it, misuse patterns, sequencing guidance, and (where relevant) shorthand formulas.

Discovery interview · Bob Moesta & Chris Spiek (Re-Wired Group / The Rewired Group)Switch / Forces of Progress(Switch)

A structured interview format that reconstructs a user's switch from an old solution to a new one, mapped to the four forces: push, pull, anxiety, habit.

When to use

  • Understanding why users adopted (or didn't adopt) a category.
  • Pricing and packaging research.
  • Churn and switching investigations.

When not to

  • Pure usability testing of a specific UI.
  • Discovery for an entirely new behavior with no prior solution to switch from.

How to apply

  1. Recruit users who switched within the last 90 days.
  2. Anchor the conversation to the day of the switch.
  3. Probe push, pull, anxiety, habit explicitly with timeline questions.
  4. Identify the moment of first thought, struggle, deciding event, and consumption event.
  5. Synthesize forces and map to demand strength.
Pitfalls / anti-patterns
  • Asking about preference instead of behavior.
  • Letting users summarize ('I just got tired of it') instead of reconstructing the timeline.
Discovery synthesis · Teresa TorresOpportunity Solution Tree(OST)

Visual structure connecting a desired outcome to opportunities (problems), candidate solutions, and the experiments that test them.

When to use

  • Continuous discovery teams aligning on what to learn next.
  • Quarterly planning where the outcome is fixed but the path is not.

When not to

  • When the team has not yet defined a clear outcome — fix that first.

How to apply

  1. State the outcome at the root.
  2. Surface opportunities from research; cluster duplicates.
  3. Map candidate solutions under each opportunity.
  4. Define the riskiest assumption per solution and an experiment to test it.
  5. Update weekly with new evidence; prune unsupported branches.
Pitfalls / anti-patterns
  • Stuffing every feature request as a 'solution' regardless of opportunity fit.
  • Letting the tree become a static slide deck instead of a living doc.
Discovery quantification · Tony UlwickOutcome-Driven Innovation(ODI)

Decomposes a job into outcome statements (e.g. 'minimize the time it takes to identify duplicate contacts'), then surveys users on importance and current satisfaction. Opportunity score = importance + max(0, importance - satisfaction).

Opportunity = Importance + max(0, Importance − Satisfaction)

When to use

  • Mature markets where the job is well understood and you need to find under-served outcomes.
  • Roadmap arbitration when many possible directions look equally appealing.

When not to

  • Brand-new categories where the user can't articulate the job yet.

How to apply

  1. Identify the core functional job through interviews.
  2. Decompose into outcome statements (verb + object + clarifier, e.g. 'minimize time to ___').
  3. Survey 100+ users for importance (1-10) and satisfaction (1-10).
  4. Compute the opportunity score per outcome.
  5. Prioritize outcomes with score > 12 as under-served opportunities.
Pitfalls / anti-patterns
  • Writing outcome statements that contain solutions ('add an auto-detect button').
  • Surveying users who don't actually do the job.
Discovery prioritization · Marty Cagan / SVPGRisk-Assumption Mapping

Categorizes the risks of a product idea into Value, Usability, Feasibility, and Viability and tests the largest risks first with the smallest possible experiment.

When to use

  • Before committing eng capacity to a non-trivial bet.
  • When the team feels unable to articulate why an idea might fail.

When not to

  • Tiny incremental changes where a single A/B test is enough.

How to apply

  1. List all assumptions the idea depends on.
  2. Tag each as Value / Usability / Feasibility / Viability risk.
  3. Score by impact-if-wrong and confidence-now.
  4. Design the cheapest experiment to retire the top 1-2 risks.
  5. Re-rank weekly as evidence accumulates.
Root cause analysis · Sakichi Toyoda / Toyota Production SystemFive Whys

Iteratively ask 'why' to walk from a symptom to a root cause. Useful in user interviews, post-mortems, and bug triage.

When to use

  • Surface-level user complaints whose real cause is unclear.
  • Post-incident reviews.

When not to

  • Multi-causal complex systems where a single 'why' chain oversimplifies; use a fishbone diagram instead.

How to apply

  1. Start with the observed symptom or complaint.
  2. Ask 'why' and capture the answer; don't accept generalities.
  3. Repeat 4–5 times until the answer is structural rather than personal.

Product Psychology

Cognitive biases that distort product decisions

Confirmation Bias in Interviews

Interviewers selectively probe and remember evidence that confirms the going hypothesis.

Product Risk

The team walks out of every research round 'validated', regardless of what users actually said.

Research Countermove

Pre-register the hypothesis and the falsification criterion. Have a second interviewer or transcriber tag quotes blind to the hypothesis.

Researcher / Demand Effect

Users mirror what they believe the interviewer wants to hear, especially when interviewer comes from the company that might pay them or refer them.

Product Risk

Hypotheticals get inflated agreement; pricing surveys inflate willingness to pay; feature interest is overstated.

Research Countermove

Use neutral recruiters who don't disclose the hypothesis. Pay users a fixed incentive regardless of stated enthusiasm. Anchor questions in past behavior, not future preference.

Recency Bias in Synthesis

The most recent interview disproportionately influences the synthesis even when earlier interviews carried the same signal.

Product Risk

Roadmaps swing toward whichever user the team last heard from, especially executives or vocal users.

Research Countermove

Tag every quote at intake, not at synthesis. Synthesis should weight every tagged quote equally, not by memorability.

Articulation Bias

Users who can articulate clearly are over-represented; quieter users with the same problem are missed.

Product Risk

Power-user features ship while novice-user pain remains invisible.

Research Countermove

Recruit deliberately across articulation styles; supplement interviews with observation, screen-share recordings, and unmoderated tests.

Hawthorne Effect

Users behave differently because they know they're being observed.

Product Risk

Lab usability tests overstate competence; users perform for the researcher.

Research Countermove

Use unmoderated remote tests, in-product analytics, and longitudinal studies whenever possible. Treat moderated lab tests as one of multiple data sources.

Cargo-Cult Discovery

Adopting discovery rituals (interviews, OSTs, story maps) without the underlying behavioral change in the team's decision-making.

Product Risk

Discovery becomes documentation theater; the same backlog ships regardless of what was learned.

Research Countermove

Tie every discovery output to a decision: 'This week's research changes our roadmap in this specific way, or it changes our confidence in this specific item.'

Organizational anti-patterns

When ceremonies look like rigor but aren't

Validating, Not Discovering

Research is scheduled after the spec is written; goal is to confirm rather than challenge.

PM has already committed to the solution and uses 'discovery' as a checkpoint to feel safe.

Fix

Run discovery before scoping. If you must run it after, give the team explicit permission to kill the idea based on findings.

The Persona Slide That Lives Forever

A glossy persona deck made 18 months ago is still cited in decisions; nobody can name a real recent user.

Personas are easier to display than to maintain; nobody owns their freshness.

Fix

Replace static personas with a continuous panel. Cite real recent quotes and behaviors in decisions, not 'Marketer Mary'.

Survey-First Discovery

The team's first move on every new question is to send a survey.

Surveys feel scientific and scale fast; interviews feel slow and subjective.

Fix

Lead with 5-7 interviews; only survey to size what interviews already revealed qualitatively.

The Loudest User Drives the Roadmap

Roadmap shifts in response to the latest angry support ticket or executive escalation.

Without an aggregated insight library, vivid recent feedback wins by default.

Fix

Aggregate weekly: count themes across interviews, support tickets, NPS verbatims, and churn reasons. Make the count visible in roadmap meetings.

Insights as Slide Decks

Research findings live as polished one-time decks; they don't accumulate into a searchable corpus.

Researchers and PMs are rewarded for visible deliverables, not infrastructure.

Fix

Standardize tagging. Build a searchable repository — even a structured Notion/Coda/Airtable database is enough.

Worked examples

Walkthroughs translated from real trade-off rooms

Catching a phantom feature with a switch interview

A team is convinced users want richer reporting. Surveys show 78% interest. Internal advocates are pushing for a Q1 build.

  1. Run 8 switch interviews with users who recently adopted a competitor that already has rich reporting.
  2. Map the four forces. Push and pull are real; anxieties (cost, change) are large; habit (existing internal dashboards) is the largest force.
  3. Find that only 2/8 actually use the rich reporting after switching; 6/8 still rely on legacy dashboards exported to spreadsheets.
  4. Reframe the opportunity: not 'rich reporting in product' but 'reduce the spreadsheet round-trip'. Ship a CSV export with templated views in 3 weeks instead of a 4-month reporting build.

TakeawayStated demand was real but did not translate into post-switch usage. The actual job was about leaving the product, not staying in it.

Using ODI to find the under-served outcome

A meeting-notes product wants to know which AI feature to build first.

  1. Interview 12 users to identify the functional job: 'capture decisions and action items from a meeting'.
  2. Decompose into 14 outcome statements, e.g. 'minimize time to identify action items', 'minimize chance of missing a decision'.
  3. Survey 200 users for importance and satisfaction.
  4. Highest opportunity scores: 'minimize time to share notes with absentees' (15.2) and 'minimize chance of missing a decision' (14.8).
  5. Lower than expected: 'minimize time to identify action items' (10.1) — already considered well-served by the existing product.

TakeawayODI redirected the team from auto-summarization (already satisfying) toward decision tracking and absentee summaries (under-served).

Killing a feature using risk-assumption mapping

Team wants to add an in-app marketplace for third-party integrations.

  1. List assumptions: users want to discover integrations in-product; partners will list; we can review submissions; legal can manage liability; revenue model works.
  2. Tag risks: Value (partial — direct evidence weak), Viability (dominant — legal & revenue model unsolved), Feasibility (medium).
  3. Test viability first with 5 partner conversations and a legal review — both come back negative under current contracts.
  4. Kill the marketplace; instead, ship a curated integrations directory and a partner program with handpicked launch partners.

TakeawayThe dominant risk was viability, not value. Testing the right risk first prevented a 6-month build that would have shipped into a legal wall.

Resources / Case Studies

Curated reading for this mission

The canonical operating manual for weekly discovery practices, the Opportunity Solution Tree, and assumption testing.

The single most actionable book on modern discovery practice; every team benefits from running its first 90 days.

Framework

The original blog series introducing the OST as a connective tissue between outcomes, opportunities, solutions, and tests.

The clearest free reference for the OST diagram and its discipline.

Book

Moesta's articulation of JTBD applied to selling. Covers the four forces, switching moments, and the timeline interview.

Translates JTBD from theory to operational interview craft for PMs and sales partners.

Jobs to Be Done — Theory of JTBD

Tony Ulwick (Strategyn)

Essay

Ulwick's school of JTBD with outcome-driven innovation, opportunity scoring, and quantitative discovery.

The right tool when the team needs to size and prioritize unmet needs at scale, not just understand them qualitatively.

Book

Cagan's Value/Usability/Feasibility/Viability framework and prototyping techniques for retiring each risk type.

Establishes the risk-typing language that the rest of the industry uses.

Book

Portigal's craft-level guide to interviewing users without leading them. Covers question design, listening, and synthesis.

The best stand-alone book on interview mechanics, especially for PMs without UX research training.

The Mom Test

Rob Fitzpatrick

Book

How to run customer conversations that surface real evidence even when users (and your mom) want to be encouraging.

A 90-minute read that fixes the most common interview mistake — asking questions that beg for compliments.

Tool / Vendor

Templates for OSTs, interview snapshots, and weekly trio cadences.

Saves teams the design-thinking-template hunt; production-ready artifacts to drop into Notion or Miro.

JTBD case study showing how customers hire and fire products through unexpected substitutions.

Strong narrative companion to the Switch interview methodology.

Newsletter / Feed

Practitioner case studies on user research from PMs at Airbnb, Stripe, Notion, and more.

Recurring real-world examples from current product teams; a complement to the canonical books.

The Right It

Alberto Savoia

Book

Pretotyping and the search for 'the right it' before 'building it right'. Practical low-cost demand experiments.

Bridges discovery and early experimentation, with concrete templates for fake-door tests, smoke tests, and the like.

How to operationalize ongoing user research at scale: panels, ops, tooling, and synthesis.

Covers the operational layer that most discovery books skip.