How many emails do I need to send before I know if a variant is working?

Start evaluating at 500 contacts. Make your hard decision by 1,000. Anything before 500 is noise. Anything past 1,000 without a decision is burning infrastructure for no reason.

What if none of my 12 initial variants work?

Then your offer frame is wrong, not your copy. Go back to situation mining. Find different contexts your prospects live in. The problem is almost never the words. It is the angle.

Should I test personalization in month one?

No. Personalization comes after you have a winning offer. Testing personalization on a losing offer is like polishing a car with no engine. Find the offer fit first, then layer personalization on top.

How long should my cold emails be?

Test both. Short form (30 to 40 words) and long form (50 to 100 words). Some markets respond to punchy one liners. Others need the full picture. Let the data decide, not your preference.

What positive reply rate should I aim for?

8% minimum to scale. 12%+ is where things get exciting. 20%+ means you found something exceptional. Below 6% means sprint for new angles immediately.

How many emails do I need to send before I know if a variant is working?

Start evaluating at 500 contacts. Make your hard decision by 1,000. Anything before 500 is noise. Anything past 1,000 without a decision is burning infrastructure for no reason.

What if none of my 12 initial variants work?

Then your offer frame is wrong, not your copy. Go back to situation mining. Find different contexts your prospects live in. The problem is almost never the words. It is the angle.

Should I test personalization in month one?

No. Personalization comes after you have a winning offer. Testing personalization on a losing offer is like polishing a car with no engine. Find the offer fit first, then layer personalization on top.

How long should my cold emails be?

Test both. Short form (30 to 40 words) and long form (50 to 100 words). Some markets respond to punchy one liners. Others need the full picture. Let the data decide, not your preference.

What positive reply rate should I aim for?

8% minimum to scale. 12%+ is where things get exciting. 20%+ means you found something exceptional. Below 6% means sprint for new angles immediately.

Free Tools Book Strategy Call

All Attack Plans

Intermediate30 days

The Cold Email Testing Playbook

Find your winning offer in 30 days or less

The exact testing framework we use to find winning cold email offers. 12 variants on day one, a winner identified within 48 hours, and a scalable campaign by day 30. No guessing. Just data.

The Sprint Phase: Month 1

Month one has one job: find the offer that makes strangers respond. Everything else is noise.

The Sprint Phase runs weekly cycles. Four cycles in the first month. Each cycle tests aggressively, evaluates fast, and feeds the next round with real signal. Not hunches. Not "I think this sounds good." Actual reply data from actual humans.

Your cadence is simple. Launch variants Monday. Evaluate Wednesday or Thursday. Make decisions Friday. Repeat.

Most teams spend months "refining" copy that never worked in the first place. The Sprint Phase compresses that into 4 weeks by testing 24 to 48 variants total across the month. You will know what works and what doesn't before your competitors finish their first draft.

Tips

Test the offer and the frame only. Personalization comes after you have a winner.
Do not test structure variations in month one. Same format, different frames.
Kill losers fast. Emotional attachment to copy you wrote is the enemy of results.

Weekly cycles

24-48

Total variants tested

48 hours

Time to first signal

Day 1 Setup: 12 Variants From 3 Situations

Day one you launch 12 variants. Not 2. Not 5. Twelve.

Here is how you build them. Start with 3 different situations your prospect could be in. These are not industries or job titles. These are inferred contexts based on real signals.

Example: a SaaS company with 11 employees is not just "a small SaaS company." That founder is stretched thin, probably thinking about their first sales hire, weighing whether to scale the team or stay lean. That is a situation.

For each of those 3 situations, write 2 lengths:

Short form (30 to 40 words): Just the offer and a CTA. No preamble. No context setting. Punch them in the face with the value prop.
Long form (50 to 100 words): Same offer, but you paint the picture. See, feel, touch language. Make the pain visceral before you present the solution.

3 situations x 2 lengths x 2 angle variations = 12 variants minimum. All running simultaneously from day one.

Tips

Mine situations from signals: hiring patterns, funding rounds, tech stack, growth rate.
Each situation should produce a differently framed offer. Same core service, different frame.
Write like a human. 8th grade reading level. Conversational. No corporate speak.

Decision Points: 500 and 1,000 Contacts

Two thresholds matter. Everything else is premature.

At 500 contacts sent: Start thinking. Are any variants showing promise? Is there a clear separation between winners and losers? If nothing is working at all, start planning your pivot. This is the "heads up" threshold.

At 1,000 contacts sent: Decision time. This is the absolute maximum before you commit. Either you have a winner or you pivot entirely. No "let's give it another week." No "maybe the list is bad." 1,000 contacts is enough data to know.

Why 1,000 max? Infrastructure protection. Every contact you burn on a losing variant is a contact you cannot reach again with a winning one. Your domain reputation, your inbox health, your total addressable market. All finite resources. Treat them that way.

Tips

Never exceed 1,000 contacts without a clear decision. You are burning infrastructure.
If nothing works at 500, the problem is usually the offer frame, not the copy.
Step 1 performance is what matters. Do not make decisions based on Step 2 or 3 results.

500 contacts

First evaluation

1,000 contacts

Hard decision point

Max days before deciding

The Winner Testing Phase: Month 2+

You found your winner. The instinct is to scale it immediately. Resist.

The Winner Testing Phase exists because a single winning email sent at volume will eventually get fingerprinted by ESPs. Spintax can only vary copy so much. Google, Microsoft, and every other inbox provider detect pattern repetition. When they do, your deliverability craters.

So you take your winner and create 3 to 6 variants that are similar in intent but different in structure. Accept a slight performance reduction in exchange for longevity.

The cadence shifts to bi-weekly cycles. You are no longer sprinting. You are sustaining.

The ideal state is 6 variants of one winning angle running for 3+ months. That is a campaign with legs. That is a campaign that builds pipeline consistently without burning out your infrastructure.

Tips

Accept a slight performance dip from variants. Longevity beats short term spikes.
Run all variants simultaneously. Sequential testing gives ESPs a clear pattern to detect.
The winner stays in rotation. Variants supplement, they do not replace.

3-6

Variants per winner

Bi-weekly

Cycle cadence

3+ months

Target campaign lifespan

Variant Testing Rules: One Variable at a Time

When you create test variants, change one element. Not two. Not "a few tweaks." One.

If you change the CTA and the opening line and the offer frame in the same variant, you learn nothing. The variant might perform better or worse, but you have no idea which change caused it. That is not testing. That is guessing with extra steps.

One variable per variant. Period.

Acceptable Performance Variation

Your winner hits 12% positive reply rate. A variant comes in at 10%. Keep it. A variant hits 8%? Keep it (borderline, but acceptable for diversity). A variant drops to 4%? Kill it or revisit the angle entirely.

The math is simple: a 10% variant that extends your campaign by 3 months produces more total pipeline than a 12% variant that gets fingerprinted in 6 weeks.

Tips

Document which variable you changed in each variant. Future you will thank present you.
A 2 percentage point drop is acceptable. A 50% drop means the variable mattered more than you thought.

Testing Sequence Priority

Not all variables are equal. Test them in this order:

1. Offer Framing (Test This First)

How you present the offer is the single biggest lever. Getting granular with the who and attaching outcomes to work they are already doing increases believability. This is the difference between "we help companies grow" and "you have thousands of meeting transcripts riddled with those moments where everyone laughs. We turn those into videos."

Same service. Different frame. Massive difference in results.

2. CTA (Test This Second)

The call to action determines who responds and how qualified they are:

Soft: "Worth sending over more info?"
Medium: "Open to learning more?"
Hard: "Worth a quick call?"

Softer CTAs get more replies but fewer booked calls. Harder CTAs get fewer replies but higher booking rates. Test to find your sweet spot.

3. Opening Line (Test This Third)

Different ways to create relevance and add context. Important, but secondary to offer framing and CTA. A great opening line cannot save a bad offer.

Tips

Offer framing is 80% of the battle. Most people skip straight to wordsmithing the opening line.
Hard CTAs with strong worldview alignment often outperform soft CTAs on total pipeline value.

When to Kill a Variant

Killing variants is a skill. Most people kill too slow or not at all.

Performance Tier Thresholds

Use these benchmarks to classify every variant and campaign:

Positive Reply Rate	Classification	Action
20%+	Exceptional Winner	Scale aggressively. Micro tests only. Do not ruin this.
12% to 20%	Strong Winner	Scale significantly. Conservative tests, small changes.
8% to 12%	Winner	Scale steadily. Moderate testing, controlled variance.
6% to 8%	Workable	Scale cautiously. Heavy testing, significant variance OK.
Below 6%	Underperforming	Sprint for new angles. Do not scale. Do not "give it more time."

Below 6% means the message is not working. More volume amplifies the failure, not the results. Kill it, learn from it, and test a new angle on the same list.

20%+

Exceptional

12-20%

Strong Winner

8-12%

Winner

6-8%

Workable

<6%

Underperforming

Key Takeaways

1Launch 12 variants on day one. 3 situations x 2 lengths x 2 angles.
2Sprint Phase = speed. 4 weekly cycles. 24 to 48 variants in month one.
3Hard decision at 1,000 contacts max. Never burn infrastructure on a losing variant.
4Winner Testing Phase = longevity. 3 to 6 variants per winner for 3+ months.
5Test one variable at a time. Offer framing first, CTA second, opening line third.
6Below 6% positive reply rate = kill it. No exceptions.

Frequently Asked Questions

Let us implement this for you

Reading attack plans is one thing. Executing them at speed with proven infrastructure is another. We do both.

Book Strategy Call

or browse more attack plans