2 April 2026 , Blackgate

An AI-assisted MVP can be a legitimate way to learn fast. It becomes a liability when speed removes the moments where someone asks what could go wrong. A review is not about paperwork - it is a short, explicit pass over how the system could be abused, misconfigured, or broken under load before you widen access.

For a general pre-launch checklist aimed at business operators, see what can break before real users or data. This piece focuses on when that situation is serious enough that you should commission a security and/or architecture review rather than only patching ad hoc.

Data and compliance pressure

Review before launch if the MVP touches any of the following:

  • Identifiable people - names, contact details, health, workforce, children, accounts tied to real individuals.
  • Money movement or financial identity - payments, payouts, bank linking, credit decisions, anything that could enable fraud.
  • High-stakes decisions - eligibility, hiring, pricing, safety-related outputs where a wrong answer has outsized harm.

Regulated contexts (privacy law, sector rules) raise the bar; so does cross-border data flow. If you cannot draw a simple diagram of what data enters, where it is stored, who can read it, and how it is deleted, you are not ready to scale users - you are ready for review.

AI-specific red flags

  • No one can explain what the model is allowed to decide - outputs drive user-facing actions but there is no policy, fallback, or human path when the model is wrong.
  • Training or fine-tuning on production or client data without retention and consent clarity - including data sent to third-party model APIs.
  • Prompt injection / untrusted content in context - user-supplied text, scraped pages, or emails fed straight into the model with no isolation or limits.
  • “Black box” recommendations in domains where accountability matters - finance, health, employment - with no logging of inputs and versions for an incident replay.

These issues are not hypothetical; they decide whether a bug is a support ticket or a regulatory or reputational event.

Security and access

These items read like a “bad idea” list because they are also the highest-yield checks in a short review: you do not need a week of penetration testing to see an API key in a public bundle or a /debug route with no auth.

Escalate to a security-minded review when you see:

  • Single secrets pasted in repos, env samples committed, or API keys shipped to the browser.
  • Authentication that is effectively optional - debug routes, “temporary” admin toggles, shared accounts.
  • Missing ownership checks on reads and writes - IDs are guessable or enumerable in APIs.
  • Wide-open webhooks or callbacks with no signature verification or replay protection.
  • Dependencies that have never been audited or pinned; no plan to patch CVEs.

If your threat model is “we are small, no one cares,” remember that bots do not care about your stage - they scan for the above patterns automatically.

Architecture and operability

Architecture review is warranted when:

  • One process does everything - synchronous chains, no clear boundaries, no path to scale one hot path without rewriting everything.
  • No staging that mirrors production constraints - or “production is the first place we try real data volume.”
  • No rollback - deploys are not reversible; database migrations are irreversible experiments.
  • Critical paths (auth, payment, core workflow) have no automated checks and no on-call or alerting story.

The goal is not enterprise UML - it is enough structure that another competent engineer or firm can operate and extend the system without reverse-engineering folklore.

“We moved too fast” indicators

Treat these as stop signs for a wider launch until reviewed:

  • Core behaviour was changed in chat and not reflected in tickets, tests, or docs.
  • Only one person can deploy or knows which database is canonical.
  • No incident practice - nobody has walked through “what if prod is down or data is wrong for an hour?”

Rapid AI-assisted development amplifies these gaps because the code volume grows faster than shared understanding.

What a proportionate review looks like

You do not need a twelve-week programme on day one. A useful engagement often includes:

  1. Threat modelling light - assets, actors, main abuse paths, mitigations (hours to a day, not months).
  2. Data flow and storage map - where PII and secrets live; retention and access.
  3. Spot review of auth, APIs, and integration points; dependency and config scan.
  4. Architecture readout - scaling bottlenecks, single points of failure, recommended refactors before marketing spend.

Deliverable: a prioritised list - fix now / fix before scale / monitor - not a generic slide deck.

When “ship anyway” is still okay

Very small internal cohorts, synthetic data only, explicit throwaway scope, and no sensitive integrations can justify bounded exposure - if you still track the red flags above and have a date to review before externalisation.


Frequently asked questions

What are the main red flags for an AI-built MVP?

Sensitive data without a clear flow and access model; weak auth; keys and secrets in the wrong places; unreviewed third-party AI and APIs; and architecture that no second person could operate.

When must we do a security review?

Before meaningful real user data, payments, or regulated information - and before high-visibility launch if any of the security signals above are present.

Is an architecture review different from security?

Overlap exists, but architecture stresses scale, failure modes, maintainability, and deployment; security stresses abuse, leakage, and compliance. Many MVPs need both lenses once; a single small team can wear both if they are senior enough.

Can we wait until after launch?

You can, but fixes get more expensive and public after misuse or breach. Timeboxing a review before irreversible exposure is usually cheaper.

Do we need penetration testing immediately?

Not always first. Threat modelling, config review, and code/architecture review often find the critical issues; pen testing is valuable once the obvious doors are closed and you want an adversarial pass.

Have a Project for Us?

Tell us where you're headed, and we'll show you how we can help you get there.

Let's chat