How to restrict crawling/indexing of specific URLs in Adobe Search and Promote

Home
Search & Promote
How to restrict crawling/indexing of specific URLs in Adobe Search and Promote

April 27, 2020
Estimated Post Reading Time ~

How to restrict crawling/indexing of specific URLs in Adobe Search and Promote

Some cases we may need to index specific types of URLs from the website and excluding all other URLs available.

The URL Masks can be used in Adobe S&P to achieve this.

URL mask will help us to define the rules to include or exclude the specific URLs during the indexing.

We will be able to define include and exclude rules

Include - pattern that specifies the URLs will be indexed
Exclude - pattern that specifies the URLs will be excluded from the indexing.

To index the URLs that is starting with mask.

The crawler will index all the URLs that starts with https://server.com/content/doc

To index the URLs that is in the particular format.

This crawler will index all the URLs matching with - https://server.com/content/doc/*.html?id=*

e.g. https://server.com/content/doc/sample.html?id=123

Regex can be used to match the URLs for indexing

This crawler will index all the URLs matching with the regex ^.*/content/doc/.*\.html$
e.g. https://server.com/content/doc/sample.html

By aem4beginner

AEM Tutorials for Beginners

April 27, 2020
Estimated Post Reading Time ~

How to restrict crawling/indexing of specific URLs in Adobe Search and Promote

No comments:

Post a Comment

Get Posts In Your Inbox

April 27, 2020 Estimated Post Reading Time ~

How to restrict crawling/indexing of specific URLs in Adobe Search and Promote

No comments:

Post a Comment

April 27, 2020
Estimated Post Reading Time ~