Skip to main content

Mastering Search Engine Control: How to Use NoIndex, NoFollow, and Disallow Directives for SEO Optimization

🔹 Learn how to control which of your site pages are crawled, indexed, and served in search results using NoIndex, NoFollow, and Disallow directives.

Updated over a week ago

The NoIndex, NoFollow, and Disallow directives tell search engines how to interact with your site’s content:

  • NoIndex — Don’t add my page to search results.

  • NoFollow — Don’t look at the links on this page.

  • Disallow — Don’t look at this page at all.

These tools help you manage how search engines discover, store, and display your site content.

⏰ Step-by-Step Instructions

🧠 What Does “NoIndex” Mean?

The noindex directive tells crawlers not to include a webpage in search results.
Indexing is how Google scans (“crawls”) and stores new content for its search index.

How to Mark a Page NoIndex

There are two methods:

  1. Add a noindex meta tag in the HTML.

  2. Return a noindex header in the HTTP request.

Examples:

  • Block most search engines:

    <meta name="robots" content="noindex">
  • Block only Google:

    <meta name="googlebot" content="noindex">

⚠️ Note: As of September 2019, Google no longer respects noindex directives in robots.txt. Always use meta tags or HTTP headers.

🔗 What Is the Difference Between NoIndex and NoFollow?

  • NoIndex — Page-level; prevents a page from appearing in search results.

  • NoFollow — Page or link-level; prevents crawlers from following links.

NoFollow as a Page Attribute

<meta name="robots" content="nofollow">

NoFollow as a Link Attribute

<a href="https://domain.com" rel="nofollow">

Google treats NoFollow links as “hints” since March 2020, meaning they may still influence site authority.

💡 Why Should You Use NoFollow?

Use page-level NoFollow only when:

  • You don’t control posted content (e.g., user-generated content).

  • You want to prevent Google from crawling potentially harmful links.
    Otherwise, apply NoFollow at the individual link level.

🧭 When to Use NoIndex Pages

Mark pages as NoIndex if they offer little value to users or cause duplicate content.

Common NoIndex candidates:

  • Pagination pages (/category/resultspage=2)

  • Internal search pages

  • Ad-optimized landing pages

  • Archived author pages

  • Checkout or confirmation pages

  • Plugin-generated or admin pages

⚙️ Marking a Page as NoIndex and NoFollow

A NoIndex + NoFollow combination:

  • Prevents the page from appearing in search results.

  • Stops crawlers from exploring the page’s links.

🔄 Marking an Already Indexed Page as NoIndex

If a page is already indexed, mark it NoIndex and ensure it’s not blocked in robots.txt, or search engines won’t see the tag.

🚫 How to Stop Search Engines from Indexing Your Site

  1. Apply the NoIndex directive (via meta tag or HTTP header).

  2. Request recrawling in Google Search Console.

  3. Confirm removal by searching site:yourdomain.com/page.

  4. If not removed, check that no Disallow directive blocks the page.

Example for disallowing one exact URL:

Disallow: /page$

🗑️ How to Remove a Page from Google Search

If you own the site, use the Webmaster URL Removal Tool (temporary for ~90 days).

For a permanent solution, use NoIndex, robots.txt, or remove the page entirely.

If you don’t own the site, request removal from Google if it:

  • Contains personal information

  • Hosts malware/phishing content

  • Violates the law or copyright

🧱 Should You NoIndex Category Pages?

Generally no — category pages often drive strong organic traffic.
Only consider NoIndex if you have duplicative or auto-generated category pages.

Properly tagged category pages often rank for hundreds of keywords and attract significant organic traffic.

🌐 How to Stop Google from Indexing Subdomains

You can block subdomains using:

  • Password protection (.htpasswd)

  • Robots.txt file

  • NoIndex directives

  • 404 responses

Password Example: Blocks crawlers and unauthorized users.
Used for:

dev.domain.com   staging.domain.com   testing.domain.com   qa.domain.com   uat.domain.com

Robots.txt Example:

User-agent: * Disallow: /

For selective blocking, use NoIndex tags on specific pages or directory-level Disallow rules

🔍 How to Check if Your Pages Are NoIndexed

Accidentally NoIndexing important pages can hurt rankings.

On WordPress

  • Go to Settings → Reading → Search Engine Visibility.

  • Disable “Discourage search engines from indexing this site.”

  • For full privacy, use password protection in cPanel or plugins.

On Squarespace

Use Code Injection to add or remove NoIndex tags manually.
Alternatively, password-protect sensitive content.

On Wix

In Menus & Pages, uncheck “Show this page in search results.”
Note: Hiding a page from menus does not hide it from search.

✅ Closing Note

Proper use of NoIndex, NoFollow, and Disallow directives gives you full control over your site’s visibility, protects sensitive content, and optimizes crawl efficiency.

Did this answer your question?