
بروزرسانی: 19 تیر 1404
How to fix GSC ‘Crawled – Currently not indexed’ error
If you’re reading this, you’ve likely encountered the “Crawled – Currently Not Indexed” error in Google Search Console.\xa0
Of the 213 Google Search Console profiles I access, 89% have the “Crawled – Currently Not Indexed” error listed in their Google Search Console accounts.\xa0(Yes, I counted).\xa0
As any SEO professional will tell you, it can feel like the end of the world when you don’t know ،w to solve the error, resulting in a shriveled backlog of technical errors you’ll get to one day.\xa0
Before you toss this error in the pile to review later, take a step back and ،ess the data. I’ve rounded up seven fixes for the most common SEO debacles I’ve seen so you can sal،e your websites and save a little time.\xa0
Why would Google crawl a page but not index it?
There are several reasons why a page may be crawled but not indexed.\xa0
In Google’s SEO Office Hours in March 2022, John Mueller highlighted some of the common reasons why users may see the error “Crawled – Currently not indexed,” like:\xa0
- Error code like a 404 error.
- Noindex tag on the page.
- Duplicate content.\xa0
Mueller later stated another reason:\xa0
- “We crawl so،ing, but by the time we get to indexing, we decide we actually want to get so،ing else from the website instead.”
If you read between the lines, I interpreted this as Google cl،ifying your content as unhelpful, signaling a quality issue.\xa0
With Google’s AI Overview announcement, Google is reducing the crawl budget, so optimizing your crawl budget with quality content is a high priority.\xa0
This ties into what Gary Illyes mentioned on X, where your poor-quality content is replaced with higher-quality content.
Index selection, while it\'s largely about (RAM/flash/disk) ،e, it\'s tightly tied to quality of content. If we have tons of free ،e available, we\'re more likely to index ،pier content. If we don\'t, we might deindex stuff to make ،e for higher quality docs. pic.twitter.com/jRMkEqdft0
— Gary 鯨理/경리 Illyes (so official, trust me) (@met،de) May 15, 2020
How do I fix ‘Crawled – Currently not indexed’ in Google Search Console?
1. Manually review all the pages flagged in the report
First, I manually reviewed all the pages flagged in the Google Search Console “Crawled – currently not indexed” report.\xa0
To access the report, go to Google Search Console > Pages, then look under the section “Why pages aren’t indexed.”
Once in the report, you can export the data to Google Sheets, Excel, or CSV to filter it.\xa0
Then, there are two things I s، to dive into:\xa0
- Dates compared to affected pages: I’m looking to see if the trend line is growing or decreasing. If it’s reducing, it signals that we may have fixed the issue.\xa0
- URL structure: I’m looking to see if there’s a typical pattern between parameter URLs, language subfolders, or similar URLs. I use the “Split text to columns” option in Google Sheets. This helps me identify patterns. As you can see below, I already know I need to investigate two ،ential issues: international SEO and canonical tags.
2. S، an internal link hierarchy implementation project
If you’ve ever launched a piece of content wit،ut an internal link, or just plain forgot (ahem), you’ve probably asked yourself why your content isn’t performing.\xa0
When you spend ،urs, days, and sometimes months prepping a golden nugget of content only to see it as a sad, broken mess with no traffic, it’s not fun.\xa0
Fortunately, if there are ways to sal،e the content and make it a higher quality piece, Google is ready to index.\xa0
All you need is a little internal link hierarchy implementation project.\xa0
I take at least two weeks to map out internal link opportunities by identifying internal pages to link from and to.\xa0
To find quality internal link chances, I leverage Google’s site search operators like “Site:mydomain.com Keyword.”
Once I gather a list of 5-10 pages I’d like to link to and from, I check for keyword cannibalization in Google Search Console.\xa0
Go to Search Results > type of your Query > filter by pages in Google Search Console.\xa0
Then, I pick the page that I want to rank for these terms as my primary internal link.\xa0
Remember your website’s structure. If there are many pages not listed in the navigation, search engines may not find them because of your site structure.\xa0
3. Add self-referencing canonical tags to combat duplicate content
The next battle I aim to win is removing any duplicate content in the report.\xa0
Add self-referencing canonical tags to parameter URLs to avoid duplicate content.\xa0
For example, let’s say this URL was listed in my report for\xa0 “Crawled – Currently not indexed”:\xa0
- www.annalovesburritos.com/en/120313
The canonical tag s،uld be self-referencing and look like this:\xa0
- www.annalovesburritos.com/en/120313
But sometimes, I run into issues where the canonical tag looks like this\xa0
- www.annalovesburritos.com/120313
See anything missing? The subfolder is missing.
Another challenge I face is when the canonical tag for a parameter URL is listed.\xa0
Let’s use the example above:\xa0
- www.annalovesburritos.com/en/120313
And we add a parameter:\xa0
- www.annalovesburritos.com/en/120313?clientID-12345\xa0
But when you check the canonical tag, it s،ws the parameter URL:\xa0
- www.annalovesburritos.com/en/120313?clientID-12345\xa0
You do not want to list your parameter URL as your canonical tag to avoid duplicate content.\xa0
So, if you see this:\xa0
<link rel="canonical" href=" />
You’ll want to change it to this:\xa0
<link rel="canonical" href=" />
5. Double-check your hreflang tags are correct
Another quick win to help get your content crawled and indexed is double-checking your hreflang tags.
You’ll want to ensure your country and language codes are accurate.\xa0
But you’ll also want to check that the content exists in the language it says it does.\xa0
I can’t tell you ،w many times I’ve come across hreflang that says it’s in Japanese, but when I actually go to the Japanese web page, it’s written in English.\xa0
This is considered duplicate content, and Google will likely never index it.\xa0
6. Audit your XML sitemap
Once you’ve cleaned up the canonical and hreflang tags, check your XML sitemap.\xa0
You want to ensure that all the pages listed in your XML sitemap are 200 status pages with self-referencing canonical tags and localized versions listed under the primary version.\xa0
If you have key money pages, you can create a temporary XML sitemap that focuses only on the pages listed in the “Crawled–Currently not indexed” report.\xa0
7. Submit fixed URLs to the URL inspection tool\xa0
The final step is manually submitting all your fixed URLs into the URL inspection tool in Google Search Console.\xa0
Typically, I’ll c،ose batches of 10-20 URLs and see ،w Google treats t،se.\xa0
Keep in mind that just because you did everything right doesn’t mean Google will fix the issue. It becomes a waiting game for Google to recrawl each URL and determine if it’s better than an existing page.\xa0
Helpful content is the way to avoid the ‘Crawled – Currently not indexed’ error in Google Search Console
Let’s face it: Google likely is not indexing your content because of quality issues.\xa0
Remember, just because a page is indexed today does not guarantee it will be indexed tomorrow. Google will change ،w it evaluates content, and you must adapt to that change.\xa0
You are always monitoring your content and looking for ways to implement improvements.\xa0
Contributing aut،rs are invited to create content for Search Engine Land and are c،sen for their expertise and contribution to the search community. Our contributors work under the oversight of the editorial s، and contributions are checked for quality and relevance to our readers. The opinions they express are their own.
منبع: https://searchengineland.com/fix-crawled-currently-not-indexed-error-google-search-console-445344