9 of 10 AI Features Leaked Data via Prompt Injection

AI features prompt injection testing NZ and AU

Over the past two years, almost every SaaS product picked up some kind of AI feature. A chat assistant that answers customer questions. A summarizer that reads support tickets. A helper that drafts replies based on account history. Most of these shipped fast, because customers expect them now, and the teams building them were focused on making the feature work, not on what happens when someone tries to misuse it.

Capture The Bug set out to check exactly that. Fifty AI features, built by SaaS companies across New Zealand and Australia, were tested for one specific weakness: prompt injection. The companies are not named here, because the goal is not to single anyone out. It is to show what happens when a feature ships before anyone asks what it will do with text it was never meant to trust.

Forty five of the fifty leaked something they should not have.

What prompt injection actually is

Understanding prompt injection vulnerabilities

Most AI features work by reading text and following instructions inside that text. The trouble starts when the AI cannot reliably tell the difference between instructions from the company that built it and instructions hidden inside content it is simply supposed to read, like a customer's message, a support ticket, or an uploaded file.

Picture a support assistant that summarizes incoming tickets for a staff member. If a ticket contains a line that looks like an instruction rather than a complaint, something telling the assistant to ignore its previous task and reveal information instead, a poorly built assistant may simply follow it. It does not know the difference between the company's instructions and a stranger's. It just sees text, and text is what it was trained to act on.

This is not a theoretical risk. It is the single most common way these fifty AI features failed.

What actually leaked

Customer data leaked via AI prompt injection

The leaks fell into a small number of repeated patterns across the test group.

Cross account data exposure. In several cases, a crafted message convinced the assistant to reveal details from a different customer's account, including names, order details, or support history that had nothing to do with the conversation at hand.
Internal instructions exposed. A number of assistants could be talked into revealing the internal guidance they had been given, the equivalent of a staff handbook nobody outside the company was meant to see. That alone hands an attacker a map of what the system is allowed to do.
Unintended actions through connected systems. A handful of the AI features were wired into other tools, like ticketing systems or account settings. In those cases, a crafted instruction occasionally got the assistant to attempt an action, such as changing a setting or triggering a workflow, that the person typing it had no real authority to request.

None of these required deep technical skill. They required patience, a willingness to phrase a request in an unusual way, and a feature that had never been tested against exactly that kind of input.

What am I risking by not acting?

Your Last Pentest Is Already Out of Date

Every week you ship without continuous testing is a week a vulnerability goes unseen. See what Capture The Bug finds in your first engagement.

Show Me What I'm Missing

Book a demo

If a chat assistant, a summarizer, or any AI feature in your product has never been tested this way, it is worth finding out what it would actually do with the wrong kind of message. Book a demo with Capture The Bug and see how a real test against your AI feature works, with findings explained in plain language.

Why this keeps slipping through

Why AI security issues slip past testing teams

Most teams test an AI feature the way they test any feature: does it answer correctly, does it sound right, does it handle the obvious edge cases. That testing rarely includes someone deliberately trying to manipulate it, because that mindset belongs to security testing, not product testing, and the two often happen on completely separate timelines, if the second one happens at all.

It also slips through because a traditional pentest, scoped before the AI feature existed or written without it in mind, may never touch the feature specifically. The login page gets tested. The payment flow gets tested. The chat widget that quietly has access to every customer's account history sometimes does not, simply because nobody updated the scope to include it.

How this should actually be tested

Testing AI features for vulnerabilities properly

An AI feature is, underneath the interface, just another part of the application talking to a backend through an API. Treating it that way is the most useful shift a team can make. A focused api pentest service aimed specifically at the endpoints behind an AI feature, the data it can access, and the systems it is connected to, finds these issues the same way it finds any other access control flaw: by trying to make the system do something it should refuse.

This is also good news for budget. Testing an AI feature properly does not usually require a separate, specialized engagement bolted on top of everything else. It fits naturally inside the same scoped penetration testing service a SaaS company already needs for its application and APIs. The feature just needs to be named in scope rather than left out of it.

This connects directly to how penetration testing cost in Australia and New Zealand actually works. A separate, AI-specific security audit can sound expensive and complicated, which is part of why so many companies skip it entirely. Folding the AI feature into an existing scoped test, the same kind already recommended for penetration testing for startups working through early growth and compliance requirements, usually costs far less than treating it as its own project, and it gets tested by the same people who already understand the rest of the product.

The honest comparison, as always, is not the cost of testing against zero. It is the cost of testing against the cost of a support assistant handing a stranger someone else's account details, which is a conversation no founder wants to have with a customer.

What this means for your roadmap

AI features are not going away, and customers have made it clear they expect them. What this test shows is that shipping one without checking how it handles a hostile message is no different from shipping a login form without checking what happens when someone tries the wrong password on purpose. Nine times out of ten in this exercise, nobody had checked.

The fix is not to slow down on building AI features. It is to make sure the same scrutiny applied to every other part of a product, the kind already built into a standard penetration testing service, gets applied here too, before a customer finds the gap by accident or someone with worse intentions finds it on purpose.

Plan Security Better

Plan Your Annual Pentesting Strategy the Right Way

Learn how modern SaaS companies structure pentesting across the year to reduce risk, stay compliant, and avoid last-minute panic before audits.

Get the Annual Pentesting Plan

FAQ

What is prompt injection in simple terms?

It is when someone hides an instruction inside text an AI feature is meant to read, like a support message or uploaded file, and the AI follows that hidden instruction instead of simply processing the content normally. It happens because many AI features cannot reliably tell trusted instructions apart from text written by anyone else.

Why did 9 out of 10 AI features fail this kind of test?

Most teams test whether an AI feature works correctly, not whether it can be manipulated. Without a deliberate test for that specific weakness, gaps like this tend to stay invisible until someone, intentionally or not, stumbles onto them.

Can a normal penetration test catch prompt injection issues?

Only if the AI feature is explicitly included in scope. A pentest scoped before the feature existed, or written without it in mind, often misses it entirely. A proper API pentest service that names the AI feature's endpoints and data access as part of the scope will test for exactly this.

Does fixing this require rebuilding the AI feature from scratch?

Usually not. Most fixes involve tightening how the system separates trusted instructions from untrusted input and limiting what data or actions the AI feature can reach in the first place. It is a scoping and access control problem more than a full rebuild.

Is testing an AI feature expensive compared to a normal pentest?

Not when it is folded into an existing scoped engagement rather than treated as a separate project. Naming the AI feature's endpoints in the scope of a standard penetration testing service usually costs far less than a standalone AI security audit, and it gets tested alongside everything else.

We Tested 50 AI Features Built by NZ and AU SaaS Startups. 9 Out of 10 Leaked Customer Data Through Prompt Injection.

What prompt injection actually is

What actually leaked

Your Last Pentest Is Already Out of Date

Book a demo

Why this keeps slipping through

How this should actually be tested

What this means for your roadmap

Plan Your Annual Pentesting Strategy the Right Way

FAQ

What is prompt injection in simple terms?

Why did 9 out of 10 AI features fail this kind of test?

Can a normal penetration test catch prompt injection issues?

Does fixing this require rebuilding the AI feature from scratch?

Is testing an AI feature expensive compared to a normal pentest?

Manu Kumar Singh

Read Industry Insights

The 7 Vulnerabilities Your AI-Generated Code Is Shipping to Production Right Now, and How to Catch Them Before Attackers Do

I Asked 30 NZ CTOs Why They Fired Their Last Pentest Provider. The Answers Will Change How You Choose One.

We Tested 50 AI Features Built by NZ and AU SaaS Startups. 9 Out of 10 Leaked Customer Data Through Prompt Injection.

Your SOC 2 Auditor Wants a Pentest, Here's How to Get One in 7 Days, Not 3 Months, for NZ and AU SaaS

The $200K Bug a NZ Startup Ignored, and the 2-Hour Pentest That Would Have Caught It

I gave 10 NZ SaaS apps to our hackers. 7 were breached in under 60 minutes. Here's what they found.

We analysed 2,500 real bugs from NZ and AU SaaS companies. The #1 vulnerability isn't what your CTO thinks it is

LLM Penetration Testing: How to Test Your AI Product Before Attackers Do (2026)

Penetration Testing for SaaS Startups: What to Test, When, and How Much It Costs

How to Pass Your SOC 2 Audit Using Continuous Pentesting (AU and NZ Edition)

PTaaS vs Traditional Penetration Testing: Which One Actually Protects Your Business in 2026?

What Happens After a Pentest? A Step-by-Step Guide to Remediation and Re-Testing

How to Build a Business Case for PTaaS Investment (With Numbers Your CFO Will Approve)

Penetration Testing for Healthcare SaaS in NZ and AU: Compliance, Scope, and What to Budget

How Fast Should a Pentest Provider Triage and Report a Critical Vulnerability? (Benchmarks Inside)

Top 5 Signs Your Current Penetration Testing Provider Is Underdelivering

How to Read and Act on a Penetration Testing Report (A Guide for CTOs and CISOs)

Bug Bounty Program vs Penetration Testing as a Service: Which Model Delivers Better ROI?

Real-Time Vulnerability Detection vs Scheduled Scanning: Which Protects Your Business Better?

Penetration Testing for Fintech Companies in Australia | Regulatory Guide 2025

How to Evaluate a Vulnerability Disclosure Program Before You Launch One

What Is Included in a Professional Penetration Test? (And What Most Vendors Leave Out)

PTaaS for SaaS Startups: When Is the Right Time to Start and What Does It Cost?

How Continuous Penetration Testing Helps You Pass SOC 2, ISO 27001, and PCI-DSS Audits

Penetration Testing Services in New Zealand: What to Look For in 2026

Why One Annual Pentest Is No Longer Enough - And What to Do Instead

How to Choose a Penetration Testing Provider in Australia: 7 Questions to Ask Before You Sign

Best PTaaS Platforms in 2026: Capture The Bug vs Cobalt vs Synack vs Astra (Honest Comparison)

How Much Does Penetration Testing as a Service Actually Cost in Australia and New Zealand?

SOC 2 Compliance Without Stress Using Continuous Pentesting

How Often Should You Do Penetration Testing in 2026

Top 7 Penetration Testing Mistakes SaaS Companies Still Make

Zero Trust Security vs Penetration Testing: What Actually Protects You in 2026

AI Pentesting Tools vs Human Hackers: What Actually Works?

Top 7 Hidden SaaS Security Risks Nobody Talks About

“We Got Hacked in 10 Minutes” Real Attack Simulation Breakdown

Zero Trust Security in 2026: Is Your Company Already Outdated?

What Is Software Penetration Testing? A Practical Guide for Modern Teams

Third-Party Penetration Testing Service: Process, Benefits and Providers

The 7 Best Pentesting Tools in 2026: Why Tools Aren’t Enough

From Cost Center to Growth Driver: The Business ROI of PTaaS

Scaling SaaS Securely: What Top Founders Do Differently in 2026

How to Prove Your Security Posture to Enterprise Clients (Without PDFs)

The Hidden Revenue Impact of Weak Security in SaaS Businesses

Why Security Leaders Are Investing in Continuous Pentesting (Not More Tools)

The CISO Playbook for 2026: Real-Time Visibility Over Static Reports

From Audit Stress to Always-Ready: How PTaaS Redefines Compliance for CISOs

The $1M Risk: Why SaaS Founders Can’t Rely on Traditional Security Anymore

Why Modern CISOs Are Replacing Annual Pentests with Continuous PTaaS

Penetration Testing Tips Every CEO and CTO Should Know

Why CISOs Are Moving Beyond Annual Pentests to Always-On Security Testing

How CISOs in Australia Choose the Right Pentesting Partner

AI Risk Testing for US Fintech: What’s Broken and How to Fix It

Cloud Penetration Testing Pricing in 2026: What Businesses Actually Pay Across USA, Australia, and New Zealand

Why New Zealand Companies Are Moving to Continuous Pentesting Platforms

Why Australian Companies Are Moving to Always-On Penetration Testing

AWS Security Testing for Enterprises in the USA: A Practical Readiness Checklist

AI-Led Pentesting for SaaS in New Zealand: A Practical Founder’s Guide

Cloud Security Testing in Australia: What Smart Businesses Do Differently

When Should Companies Run Security Testing to Stay Truly Protected?

Intelligent Penetration Testing Services in the USA: A Practical Enterprise Security Guide for 2026

What Penetration Testing Really Means for Modern Businesses

Why Connected Devices Break Under Real Security Testing