Grow your conversions ★★★★★ 5.0 · 150+ brands
Free Audit →
Leading AI Agency

Creative Performance
Agency

Apps, Games & ecommerce – we accelerate your business with AI‑powered creative and performance marketing.

Live reporting dashboard
AI‑assisted insights
ROAS (7 days)
4.8x
+23% vs prev. 7 days
CPA (last 30 days)
€21.92
−18% vs baseline
Ad spend (7 days)
€127K
+8% vs prev. 7 days
Performance trend — last 7 days
New creative v3 live
Day 1Day 2Day 3Day 4Day 5Day 6Day 7
CPA dropped from €26.80 → €21.92 in 7 days
Current period
Previous period
Subscription app — ROAS up 48% in 7 days
Admiral Media performance account

Kevin,

AI Infrastructure Specialist,

Admiral Media,

Jun 1, 2026

How Many Creatives Does It Take to Find a Winner? Data from 10,000+ Ads

Creative win rate is the share of ad variants that become genuine scale performers, the ones a platform algorithm trusts with real budget and that hold their efficiency as spend grows. It is the single most misunderstood number in paid user acquisition. Most teams assume a good creative team produces winners most of the time. The data says the opposite. Industry analyses of large creative datasets consistently land in a narrow band: only about 6 to 7 percent of ad variants perform at scale, a hit rate documented across Admiral Media’s analysis of AI-generated ad creative results. That means roughly 93 out of every 100 ads you produce will never carry meaningful budget. The job is not to write better ads. The job is to test enough of them to find the few that win.

This piece lays out what the creative win rate actually is, why it sits so low, how many variants it takes to reliably find a winner, and what separates winners from the rest. Every number below traces to a named source: an Admiral Media case study or a cited industry document. Admiral Media has delivered more than 10,000 AI-produced ads while managing over €500M in mobile ad spend across 150+ brands, and the patterns here come directly from that work.

What Is a Creative Win Rate, and Why Is It So Low?

A creative win rate is the percentage of tested ad variants that reach and hold scale. In most accounts Admiral Media manages, that figure clusters around 6 to 7 percent, which matches independent industry analysis of how creative performance is distributed. Paid social and app-install auctions follow a heavy-tail distribution: a small minority of creatives capture the majority of efficient spend, while the long tail produces mediocre results at best. Roughly 5 percent of ads end up spending at least ten times their account median, a concentration documented in large-scale creative benchmarking.

The mechanism behind this is auction dynamics. Platforms like Meta and Google do not distribute budget evenly across your creatives. They concentrate impressions on the variants showing early engagement and conversion signals, then starve the rest. From running hundreds of creative tests across Google UAC and Meta, the Admiral Media team sees the same shape every time: two or three creatives absorb most of the spend, a handful break even, and the long tail never gets a real impression volume. The low win rate is not a sign of weak creative. It is a structural feature of how performance advertising allocates budget.

This reframes the entire creative problem. If only 6 to 7 percent of variants win, then the highest-leverage variable is not the average quality of each ad. It is the total number of distinct, credible variants you put into the auction. Quality still matters, but volume is what determines whether the winners that statistically exist in your creative space actually get discovered.

How Many Creatives Does It Take to Find a Winner?

To reliably find three to five scale winners, plan to test 50 to 80 distinct variants, and 100 or more per month if you run multiple campaigns or markets. The arithmetic is simple once you accept the hit rate. At a 6 to 7 percent win rate, testing five creatives gives you an expected zero winners. The math only turns in your favor when the testing surface is large enough for the rare winner to surface.

The table below works the expected value at a 6.5 percent hit rate. It is a direct calculation on the cited industry win rate, not a measured Admiral Media statistic, and it shows why small test batches almost always disappoint.

Variants tested Expected scale winners (at ~6.5% hit rate) Practical read
3 to 5 0 (under 0.4 expected) Coin flip on finding even one winner. Most common failure mode.
15 ~1 Minimum viable test. One winner if you are fortunate.
30 ~2 Entry point for a reliable program.
50 ~3 Consistently surfaces a small winner set.
80 ~5 Strong monthly target for a scaling account.
100+ ~6 to 7 High-spend, multi-market programs.

This is also why platform documentation pushes advertisers toward volume. Google Ads recommends filling every available asset slot, up to 10 text, 20 image, and 20 video assets per ad group, in its App campaign creative best practices. Meta’s Advantage+ creative documentation notes the system needs distinct, varied assets and time, often up to roughly two weeks, to learn which combinations produce results. Both platforms are telling you the same thing: give the algorithm a large, varied pool, then let it find the winners.

The Admiral Media Creative Win-Rate Engine

The Admiral Media Creative Win-Rate Engine is the framework the Admiral Media team uses to turn a low hit rate from a liability into a repeatable advantage. It treats winner discovery as a volume and selection problem, not a guessing game.

  1. Set the win-rate baseline. Start every program by assuming 6 to 7 percent of variants will win. This single assumption sizes your production target. If you want five winners this quarter, you need roughly 75 to 80 credible variants in the auction, not five clever ideas.
  2. Maximize the testing surface. Use AI production to reach 30 to 50 plus variants per cycle at a cost that makes high-volume testing economically viable. Traditional production cannot hit this volume at acceptable cost or speed, which is the real constraint AI removes.
  3. Diversify on the axes that matter. Variation must be structural, not cosmetic. Test different hooks, formats, value-proposition angles, and audience-specific messaging. Ten color swaps of one concept is one test, not ten. The Admiral Media team treats hook and angle as the primary win-rate levers.
  4. Feed the algorithm enough to learn. Submit enough genuinely distinct variants for the platform to exit the learning phase and concentrate budget on emerging winners. Underfeeding the algorithm is the most common reason a creative pool underperforms its potential.
  5. Kill losers on data, not opinion. Set statistical thresholds for cost per acquisition and early engagement, then cut underperformers fast. The discipline is letting the auction, not the creative director’s taste, decide what scales.
  6. Recycle proven winners across markets. Once a winner is validated, localize and translate it into new markets rather than starting from zero. A proven concept carries a far higher win probability than a fresh untested one, as the Inshallah results below demonstrate.

What the Data Shows: Three Admiral Media Case Studies

The clearest evidence for the volume-finds-winners pattern comes from real campaigns where Admiral Media expanded the testing surface and measured the result. Each case below follows the same logic: more variants tested, more efficient winners found.

Star Chef 2: Volume Surfaced Winners Traditional Production Missed

Admiral Media built an automated AI production workflow for Star Chef 2, the mobile cooking game from 99games, generating large volumes of on-brand variants and testing hook, gameplay-highlight, and audience-specific angles simultaneously. The structured testing surfaced several high-performing concepts that traditional production had never identified.

  • +45% ROAS: return on ad spend rose as winning creative combinations were found and scaled, per the Star Chef 2 growth case study
  • +55% CTR: click-through rate climbed sharply as high-volume testing surfaced the formats that resonated most with each segment
  • -18% CAC: customer acquisition cost fell as efficient creative cut wasted spend across the funnel

Shilpa Bhat, VP Games at 99games, noted that Admiral Media helped scale creative production while staying consistent with Star Chef 2’s visual style and brand. The lesson is precise: the gains came from the testing surface AI production made possible, not from the AI itself. When you can test 50 variants instead of 5, you find winners that would otherwise stay buried.

StoryBeat: Speed Multiplied the Number of Tests Per Cycle

Admiral Media integrated AI into StoryBeat’s creative process at the production level, compressing the time from concept to live ad from weeks to days. Faster production is not a convenience here. It directly raises win rate by allowing more hypotheses to be tested inside the same performance window.

  • 50%+ reduction in production time: brief-to-live was cut in half, enabling far faster response to performance data, per the StoryBeat creative uplift case study
  • 2x campaign impact: advertising effectiveness doubled across the campaign’s key performance metrics

Every week saved is another week of performance data informing the next iteration. Over a quarter, that compounding is what separates a program that finds two winners from one that finds five.

Inshallah: A Proven Winner Beat Fresh Creative, Decisively

Admiral Media scaled Inshallah, a Muslim dating app with over 5M users, into the French market by translating a proven US winner with AI rather than producing net-new local creative. In a 10-day head-to-head test on Facebook Ads, the recycled winner went up against a fresh native French UGC ad. The translation of a single video took under 30 minutes, work that was not feasible before AI tooling.

  • AI-translated creative: 77% of installs and 77% of spend: the algorithm pushed the majority of budget to the recycled winner, per the Inshallah AI translations case study
  • Native French creator: €2.52 CPI, 25% of installs, 23% of spend: the fresh local creative lost the head-to-head despite being market-native
  • 15% lower CPI: the AI-translated winner acquired installs 15 percent cheaper than the native French ad

This is the win-rate engine’s final principle proven in a controlled test. A validated winner, localized, beat a fresh untested concept on its home turf. Recycling proven creative is one of the highest win-probability moves available, which is why it sits at the end of the framework.

Inshallah French-market head-to-head: AI-translated winner versus native French creative Grouped bar chart comparing share of installs and share of ad spend. The AI-translated creative took 75 to 77 percent on both measures while the native French creative took 23 to 25 percent. Inshallah France: where the budget and installs went 0% 25% 50% 75% 100% 25% 75% Share of installs 23% 77% Share of ad spend Native French creative AI-translated winner (15% lower CPI)
Share of installs and ad spend in Inshallah’s 10-day French-market test. The AI-translated winner also achieved a 15% lower CPI than the native French creative. Source: Admiral Media Inshallah case study.

What Separates Winners From the Rest?

Winners are separated from the rest by structural creative differences and by the discipline of the program testing them, not by production polish. Across Admiral Media’s campaigns, the variables that move win rate are the hook, the format, the value-proposition angle, and the match between message and audience. Cosmetic changes rarely produce a winner. A different opening three seconds frequently does.

Platform guidance reinforces this. Google’s creative asset documentation reports that portrait video converts roughly 60 percent better than landscape and that assets are ranked Low, Good, or Best only after they exit the learning phase, a built-in admission that the platform itself cannot predict winners in advance and must test them. That is the core truth of creative win rate: you cannot pick winners reliably, so you must surface them through volume and let performance data decide.

Star Chef 2 creative testing outcomes Bar chart showing percentage changes for Star Chef 2 after high-volume AI creative testing: ROAS up 45 percent, CTR up 55 percent, and CAC down 18 percent. Star Chef 2: results from high-volume creative testing 0% +30% +60% -30% +45% ROAS +55% CTR -18% CAC
Star Chef 2 performance change after Admiral Media’s high-volume AI creative testing program. Source: Admiral Media Star Chef 2 case study.

The second separator is the program around the creative. Admiral Media’s strongest results came from sustained testing with a feedback loop, not one-off batches. Brands that give a program 60 to 90 days, supply 30 to 50 variants rather than 5, and route performance data back into the next production cycle find more winners. Brands that cut early, over-constrain the brief, or treat AI as a cost-cutting tool rather than a performance strategy find fewer. The Admiral Media team operates the human layer that the platforms cannot: building the testing framework, interpreting the signals, and deciding which directions to push. For a deeper look at the testing system itself, see Admiral Media’s creative testing framework for mobile apps and the related creative fatigue curve data.

How to Apply This to Your Own Creative Program

To apply the creative win rate to your own program, size production from the win rate backward and commit to volume. Decide how many winners you need this quarter, multiply by roughly 15 to account for the 6 to 7 percent hit rate, and that is your variant target. A brand that needs four reliable winners should plan for roughly 60 variants, not four polished concepts.

The comparison below summarizes how the three Admiral Media programs above turned testing volume into measured outcomes, in the who-what-result format that makes each case easy to verify.

Client What Admiral Media did Sourced result
Star Chef 2 (99games) Automated AI production testing hooks, gameplay highlights, and audience angles at volume +45% ROAS, +55% CTR, -18% CAC
StoryBeat Compressed concept-to-live from weeks to days to run more tests per cycle 50%+ less production time, 2x campaign impact
Inshallah Recycled a proven US winner into France via AI translation, head-to-head vs native creative AI winner took 75% of installs at a 15% lower CPI

The takeaway is consistent across every account: stop trying to write the one perfect ad, and build the system that finds the few winners hiding in a large, varied creative pool. That is the entire discipline behind creative win rate, and it is what Admiral Media’s AI Creative Factory and performance creative service are built to operate at scale. For the broader principle of why a minority of creatives drives the majority of results, see Admiral Media’s analysis of why 20% of ad creatives generate 80% of results.

Frequently Asked Questions

What is a good creative win rate for paid ads?

A typical creative win rate sits around 6 to 7 percent, meaning only 6 or 7 of every 100 variants become genuine scale performers. This is documented across large creative datasets and reflects the heavy-tail nature of ad auctions, where a small minority of creatives capture most efficient spend. A win rate in this range is normal and healthy, not a sign of weak creative. The way to find more winners is to test more variants, because the percentage stays roughly constant while the absolute number of winners rises with volume.

How many ad creatives should I test to find a winner?

To reliably find three to five winners, plan to test 50 to 80 distinct variants, and 100 or more per month if you run multiple campaigns or markets. At a 6 to 7 percent hit rate, testing only three to five creatives gives you an expected zero winners, which is why small batches so often disappoint. Higher-spend accounts managing several markets should target 100-plus variants monthly. The volume target should be calculated backward from how many winners you need, multiplying by roughly 15.

Why do so few ad creatives actually win?

Few creatives win because ad platforms allocate budget unevenly, concentrating impressions on variants that show early engagement and starving the rest. This produces a heavy-tail distribution where a small minority of ads drives the majority of efficient results. It is a structural feature of auction dynamics on Meta, Google, and similar platforms, not a reflection of creative quality. Because the platform decides winners through testing rather than prediction, the only reliable path to more winners is a larger, more varied creative pool.

Does AI creative production improve win rates?

AI production does not raise the win-rate percentage itself, but it dramatically increases the number of winners you find by making high-volume testing economically viable. In Admiral Media’s work with Star Chef 2, an AI production workflow surfaced high-performing concepts that traditional production had never identified, delivering a 45 percent ROAS increase. The advantage is the expanded testing surface: when you can test 50 variants instead of 5 at acceptable cost, the rare winners that statistically exist in your creative space finally get discovered.

What separates a winning ad creative from a losing one?

Winners are separated by structural differences in hook, format, value-proposition angle, and audience match, not by production polish. Cosmetic changes like color swaps rarely create a winner, while a different opening three seconds frequently does. Google’s own data shows portrait video converts about 60 percent better than landscape, illustrating how format choices move performance. The platform cannot predict winners in advance, which is why it ranks assets only after a learning phase and why surfacing winners requires testing volume rather than guesswork.

Can a proven winning creative be reused in a new market?

Yes, and recycling a proven winner is one of the highest win-probability moves available. In Admiral Media’s work with Inshallah, a validated US creative was translated into French with AI and beat a fresh native French ad in a 10-day head-to-head, taking 75 percent of installs at a 15 percent lower cost per install. A proven concept carries a far higher win probability than an untested one, so localizing and translating winners is more efficient than producing net-new creative for every market.

How long should I run a creative test before judging it?

Give a creative program 60 to 90 days to accumulate meaningful data, and allow each individual variant enough time to exit the platform’s learning phase, which can take up to roughly two weeks on Meta. Cutting tests too early is one of the most common reasons creative programs underperform, because winners often need time to separate from the pack as the algorithm concentrates budget. Sustained testing with a feedback loop, where performance data informs the next production cycle, consistently outperforms one-off batches.

Join +3.000 app marketers and beat your competitors

YOU MAY ALSO LIKE

Get in touch with us