Hey Reader,
You know that feeling when a brand new restaurant opens up in your town?
Gorgeous space. Interesting menu. Everyone's buzzing about it.
But you won't go yet, because nobody you actually trust has been there. No reviews. No word of mouth. No data.
So you wait.
That's exactly what Google does with your brand new website.
It shows up. Looks around a little. And leaves. Not because your site is bad. Not because your content isn't good. Because your site is unproven, and Google needs proof before it makes a commitment.
This is crawl budget. And it's the reason brand new websites can launch, look great, have solid content, and still sit invisible in search for months while the owner wonders what on earth is happening.
This week's newsletter is pulled directly from Episode 6 of Ready For Traction, and it covers one of the most misunderstood concepts in technical SEO. If you've ever launched a site, done everything you were supposed to do, and watched it get absolutely no traction from Google, this one explains exactly why.
⚙️ SEO TERM OF THE WEEK ⚙️ CRAWL BUDGET
BY THE BOOK: The number of pages Google's bot — Googlebot — is willing to crawl on a given website within a specific timeframe, allocated based on the site's perceived authority, health, and efficiency.
IN PLAIN ENGLISH: Google doesn't have unlimited time. It has to decide which sites deserve attention, how deep to go, and how often to come back. Your crawl budget is your site's share of that attention — and a brand new site with no track record gets very little of it.
|
WHAT DETERMINES YOUR CRAWL BUDGET:
→ Domain age and history: new sites get less budget than established ones
→ Backlink profile: more authoritative inbound links = more crawl allocation
→ Site health: broken links, redirect chains, and thin content all waste budget
→ Crawl efficiency: a clean, well-structured site gets crawled deeper and more often
→ Content quality: pages with real value get revisited; junk pages get deprioritized
⚠️ You can earn more crawl budget, but you can also waste what you have. Most new sites waste significant budget on low-value pages before Google reaches the good stuff
Why 'Just Publish Content and Google Will Find You' Is Only Half True
This is the advice that gets handed to almost every new site owner. And technically it's not wrong. Eventually, Google will find your content. But 'eventually' is not a traffic strategy. Here's what actually happens on most new sites I audit:
→ Google Search Console shows minimal crawl activity
Maybe a handful of pages indexed when there should be dozens or hundreds
→ Ahrefs shows a thin backlink profile
Which tells me Google hasn't seen enough outside signals to assign meaningful budget yet
→ Screaming Frog usually finds a handful of technical issues
quietly bleeding the crawl allocation, redirect chains, duplicate title tags, pages accidentally set to noindex, internal links pointing to old URLs that redirect, instead of the final destination.
None of these problems feel catastrophic on their own.
But together they're telling Google: this site isn't ready for a full commitment. Come back later. Maybe.
I use three tools together to diagnose crawl health, Screaming Frog, Ahrefs, and Google Search Console, because each one tells a different part of the story. When all three are telling the same story, I know exactly where to start.
The 4 Ways Sites Waste Their Crawl Budget
Google assigns crawl budget based on trust and signal. Here's how websites quietly burn through the budget they do have, without realizing it:
#1 LOW-VALUE PAGES
Thin content, tag archive pages, filter URLs that generate hundreds of near-identical variations, like an e-commerce site where every combination of color and size gets its own URL. Google crawls all of those. Every time it visits a junk page, it uses up budget that could have been spent on your best content. You're sending Google into a room full of stuff it doesn't want, and wondering why it never made it to the good stuff.
The fix: Noindex thin pages, consolidate duplicate content with canonical tags, and audit your URL structure to eliminate parameter-generated junk URLs.
#2 BROKEN LINKS AND REDIRECT CHAINS
If your site has 404 errors, Googlebot follows those links anyway, every dead end is wasted budget. And redirect chains, URL A goes to URL B goes to URL C, force Google to follow the whole chain every single time. That's inefficient. Google notices inefficiency and reduces how often it comes back.
The fix: Fix 404 errors by updating or removing the links that point to them. Flatten redirect chains to a single hop wherever possible. Audit your internal links so they point to final destination URLs, not intermediate redirects.
#3 A BLOATED SITEMAP
A sitemap that includes noindexed pages, pages that redirect somewhere else, or pages that simply shouldn't be submitted is actively diluting your crawl signal. Your sitemap should be a curated recommendation to Google, here's what I actually want you to look at. Submitting everything tells Google nothing about what matters most.
The fix: Audit your sitemap and remove any pages that are noindexed, redirected, or low-value. Think of your sitemap as a VIP list — only pages you genuinely want indexed should make the cut.
#4 NO CRAWL DIRECTION
If you haven't told Google what matters most, through internal linking structure, sitemap priority, robots.txt, it's guessing. Think of handing Google a 200-item menu where half the items are out of stock, a third are slight variations of the same dish with different names, and there's no indication of what the kitchen actually does best. Google's going to order something random, have a mediocre experience, and not rush back.
The fix: Build a clear internal link hierarchy that guides Google toward your most important pages. Make sure your robots.txt blocks areas Google shouldn't waste time in. Create a logical site architecture that communicates priority through structure.
What Recovery Actually Looks Like
Improving crawl health is methodical and not glamorous. But it is predictable when you understand what you're building toward.
Step 1: Clean the technical issues first. That's the foundation. Broken links, redirect chains, noindex errors, duplicate title tags, fix these before anything else. You can't build authority on a technically broken foundation.
Step 2: Build internal linking with intention. Create a clear hierarchy so Google understands which pages matter most. Every important page should be reachable within a few clicks from the homepage.
Step 3: Submit a clean, curated sitemap. Only your best pages. Only pages you want indexed. No redirects. No noindexed pages. No junk.
Step 4: Start generating quality backlinks. Even a few authoritative inbound links significantly affect the budget Google assigns. This is the outside signal that tells Google your site is worth a closer look.
Step 5: Watch Google Search Console. Crawl data in GSC is one of the clearest early signals that things are moving. When you see pages moving from 'discovered, not currently indexed' to 'indexed and appearing in search', that's confirmation that Google is starting to trust the site.
It is not fast. I will never tell you it's fast. But every week spent on a technically broken foundation is a week of potential rankings you won't get back.
Your Goal This Week
Open Google Search Console and pull your Coverage report. Look specifically for pages listed as:
- "Discovered, currently not indexed", Google found these but decided not to index them yet
- "Crawled, currently not indexed", Google visited but didn't find them worth indexing
- "Excluded by noindex tag", pages being intentionally or accidentally blocked
If you have a significant number of pages in any of these categories, you have a crawl health issue that warrants investigation. The checklist from Episode 6 of Ready For Traction walks you through exactly what to look at first.
|
|
🎙 Episode 6 of Ready For Traction is out now.
It's the full audio version of everything in this newsletter, plus the restaurant analogy delivered the way it deserves to be, plus a free crawl health checklist in the show notes.
|
Thank you for reading and have a killer week!