Blocking Default Launch Domains From Google Search

Each project hosted on Contentstack Launch automatically receives a default domain (for example, your-site.contentstackapps.com).

Even if your website uses a custom domain, search engines may still crawl and index the default Launch domain. This can create duplicate content and SEO issues.

This guide explains several ways to prevent search engines from indexing your default Launch domain, such as:

robots.txt
X-Robots-Tag headers
403 Forbidden responses

This guide also covers how to protect your setup when using an external CDN in front of Launch.

Methods to Block Default Domain Indexing

Tip: This approach is recommended for sites that use the custom domain feature on Launch.

Dynamic robots.txt for Default Domain

Serve a restrictive robots.txt file dynamically for your Launch default domain (*.contentstackapps.com). This keeps your custom domain unaffected.

Example (Edge Function):

export default async function handler(request) {
  const url = new URL(request.url);
  
  if (url.pathname === '/robots.txt' && url.hostname.endsWith('contentstackapps.com')) {
    return new Response(`User-agent: *
Disallow: /`, {
      status: 200,
      headers: {
        'Content-Type': 'text/plain',
        'Cache-Control': 'no-cache, no-store, must-revalidate'
      }
    });
  }
  return fetch(request);
}

This method serves a robots.txt file only on the default Launch domain, so your custom domain stays unaffected.

Limitation:

This method relies on crawlers honoring the robots.txt file.

Using the X-Robots-Tag Header
Use the X-Robots-Tag response header with the noindex, nofollow to stop search engines from indexing your Launch default domain or following its links.
Example (Edge Function):
```
export default async function handler(request, context) {
  const url = new URL(request.url);
   if (url.hostname.includes('contentstackapps.com')) {
    const response = await context.next();
    response.headers.set('X-Robots-Tag', 'noindex, nofollow');
    return response;
  }
    return context.next();
}
```
This instructs crawlers not to index or follow any content served from your default Launch domain.
Limitation:
- This method relies on crawlers honoring the X-Robots-Tag header.
Returning a 403 Forbidden Response
A stricter option is to block all access to your Launch default domain, while still allowing normal traffic on your custom domain.
Example (Edge Function):
```
export default async function handler(request) {
  const currentUrl = new URL(request.url);
  const hostname = currentUrl.hostname;
    if (hostname.includes('contentstackapps.com')) {
    return new Response('Forbidden', {
      status: 403,
      statusText: 'Forbidden',
    });
  }
  return fetch(request);
}
```
This blocks all traffic to any Launch default domain that contains contentstackapps.com in its hostname, while keeping your custom domain fully accessible.
Additional Resource: For a working implementation example, refer to the GitHub repository: contentstack-launch-examples/launch-edge-default-domain-blocking

Using an External CDN (CDN-over-CDN setup)

If your site uses an external CDN (like CloudFront, Fastly, or Akamai) that proxies traffic to your Launch default domain, secure it by authenticating those requests. This setup allows only authorized CDN traffic to reach Launch and blocks direct crawler access.

Steps for Execution

Add a custom header in your CDN configuration:
X-Launch-Auth-Key: <unique-secret-value>

Create an Edge Function in Launch:

export default async function handler(request, context) {
  const authKey = request.headers.get('X-Launch-Auth-Key');
  const validKey = 'YOUR_SECRET_VALUE';   // must match CDN configuration
  if (authKey !== validKey) {
    return new Response('Forbidden', { status: 403 });
  }
  return context.next();
}

This ensures that only requests with a valid header are served by Launch, effectively preventing direct crawler access to your default domain.

Blocking Default Launch Domains From Google Search

Methods to Block Default Domain Indexing

Dynamic robots.txt for Default Domain

Limitation:

Using the X-Robots-Tag Header

Limitation:

Returning a 403 Forbidden Response

Using an External CDN (CDN-over-CDN setup)

Steps for Execution

Add API Endpoints to a Website in Launch

Deploy a Project from a Monorepo on Launch

Disable Automatic Redeployment

Host a Static Site on Launch

Redeploy Automatically when Content is Published on CMS

Setup Production and Non-Production Environment

Auto-populate Environment Variables from a Linked Stack

Repair GitHub Connection for Projects

Repair Git Provider Connection for Projects

Host any SSR Framework on Launch

Trigger Deployments on Launch Platform based on Tags/Releases

Caching Guide for Contentstack Launch

Handling Next.js RSC Issues on Launch

IP-based Access Control Using Edge Functions in Launch

Blocking AI Crawlers

Troubleshooting Launch Response Error Codes

Can I Run a Docker Container on Launch?