The ultimate guide to bot herding and spider wrangling — Part 3

This article was originally published under Search Engine Land.

In parts one and two of this series, we learned what bots are and why crawl budgets are important. In the third and final segment, we’ll review common coding, mobile and localization issues bots may encounter on their journey to let the search engines know what’s important on your site.

Common coding issues

Good, clean code is important if you want organic rankings. Unfortunately, small mistakes can confuse crawlers and lead to serious handicaps in search results.

Here are a few basic ones to look out for:

Infinite spaces (also known as spider traps). Poor coding can sometimes unintentionally result in “infinite spaces” or “spider traps.”

Some issues can cause the spider to get stuck in a loop that can quickly exhaust your crawl budget. These include endless uniform resource locators (URLs) pointing to the same content; pages with the same information presented in a number of ways (e.g., dozens of ways to sort a list of products); or calendars that contain an infinity of different dates.

Mistakenly serving up a 200 status code in your hypertext transfer protocol (HTTP) header of 404 error pages is another way to present to bots a website that has no finite boundaries. Relying on Googlebot to correctly determine all the “soft 404s” is a dangerous game to play with your crawl budget.

When a bot hits large amounts of thin or duplicate content, it will eventually give up, which can mean it never gets to your best content, and you wind up with a stack of useless pages in the index.

Finding spider traps can sometimes be difficult, but using the aforementioned log analyzers or a third-party crawler like Deep Crawl is a good place to start.

What you’re looking for are bot visits that shouldn’t be happening, URLs that shouldn’t exist or substrings that don’t make any sense. Another clue may be URLs with infinitely repeating elements, like:

example.com/shop/shop/shop/shop/shop/shop/shop/shop/shop/…

Embedded content. If you want your site crawled effectively, it’s best to keep things simple. Bots often have trouble with Javascript, frames, Flash and asynchronous JavaScript and XML (AJAX).

Even though Google is getting better at crawling formats like Javascript and AJAX, it’s safest to stick to old-fashioned hypertext markup language (HTML) where you can.

One common example of this is sites that use infinite scroll. While it might improve your usability, it can make it difficult for search engines to properly crawl and index your content. Ensure that each of your article or product pages has a unique URL and is connected via a traditional linking structure, even if it is presented in a scrolling format.

Mobile sites

Google’s announcement of mobile-first indexing in November 2016 sent shockwaves through the search engine optimization (SEO) community. It’s not really surprising when you think about it, since the majority of searches are conducted from mobile devices, and mobile is the future of computing. Google is squarely focused on the mobile versions of pages rather than the desktop versions when it comes to analysis and ranking. This means that bots are looking at your mobile pages before they look at your desktop pages.

Optimize for mobile users first. Gone are the days when a mobile site could be a simplified version of your desktop site. Instead, start by considering the mobile user (and search engine bots) first, and work backward.
Mobile/desktop consistency. Although most mobile sites are now responsive, if you have a separate mobile version of your site, ensure that it has the same internal linking structure, and link bi-directionally between the two sites using rel=alternate and rel=canonical link elements.

Point to the desktop version from the mobile site using rel=canonical and point to the mobile site from the desktop site with rel=alternate. Note that this is an interim solution until you move to responsive design, which is the preferred approach, according to Google.

Accelerated mobile pages. Accelerated mobile pages (AMP) are one of Google’s more controversial inventions, and many webmasters are still hesitant to use them, since it means letting Google host a cached version of your pages on their own domain.

Google’s rationale is that accelerated mobile pages allow them to serve content up more quickly to users, which is vitally important with mobile. While it’s not clear whether Google actually prioritizes accelerated mobile pages over other types of mobile pages in search results, the faster load time could contribute to a higher ranking.

Point to the AMP version of a page using rel=amphtml and point back to the canonical URL from the AMP page using rel=canonical. Note that even though accelerated mobile pages are hosted on a Google URL, they still use up your crawl budget.

Should you block bad bots?

Unfortunately, it’s not only search engines that use bots. They come in all shapes and sizes… and intentions, including those designed to hack, spy, spam and generally do nasty stuff to your website.

Unlike friendly search engine bots, these spiders are more likely to ignore all your instructions and go straight for the jugular. There are still some hacks you can use to keep bad bots out. Be warned, these hacks can be time-consuming, so it might be worth consulting your hosting company on their security solutions if you’re really struggling.

Using htaccess to block internet protocol (IP) addresses. Blocking bad bots can be as simple as adding a “deny” rule to your htaccess file for each bot you want to block. The tricky part here, of course, is actually figuring out what IP address the bot is using.

Some bots might even use several different IPs, meaning you need to block a range of addresses. You also want to make sure you don’t block legitimate IP addresses. Unless you got a list of known IPs to block from a trusted source or you know which page the bot accessed, along with the approximate time or geographical location of the server, you’re likely to spend hours searching through your log files.

Using htaccess to block user agent strings. Another option is to set up a “deny” rule for a specific user agent string. Again, you’ll need a list from a trusted source, or you’ll be sorting through your log files to identify a particular bot, and then add the information to your htaccess file.

Localization

Since bots need to understand what country/regional version of a search engine you want your pages to appear in, you need to make sure your code and content provide consistent cues about where your sites should be indexed.

Hreflang. The hreflang tag (which is actually a type of rel=alternate link element) tells bots what language and region your page is targeting (e.g., en-ca or en-au).

This sounds simple enough, but it can cause a number of headaches. If you have two versions of the same page in different languages, you will need to provide one hreflang tag for each. Those two hreflang tags will need to be included in both pages. If you mess this up, your language targeting could be considered invalid, and your pages might trip the duplicate content filter or not be indexed in the right country version of Google.

Local spellings. While hreflang tags are important, bots are also looking for other clues that guide them on how they should index your site. One thing to be careful of is local spellings. If your page is targeted at a US audience, yet you use UK spellings, it could result in being listed in the wrong country version of Google.
Top-level domains, subdomains or subdirectories for different locations. If you want to make it even clearer to bots that your content is targeted to a specific region, you can use country code top-level domains (ccTLDs), subdomains or subdirectories. For example, the following are various ways to indicate content targeted at Canadian users:

example.ca/category/widget

ca.example.com/category/widget

example.com/ca/category/widget

Conclusion

While many website owners and even some SEOs may think they can wing it with good content and quality backlinks alone, I want to emphasize that many of these small tweaks can have a significant impact on your rankings.

If your site’s not crawled — or crawled badly — your rankings, traffic and sales will ultimately suffer.

Last Updated: August 8, 2025

FREE SEO HIRING BLUEPRINT

Learn How To Hire An Insanely Great SEO In 7 Simple Steps

STEPHAN SPENCER

Since coming into his own power and having a life-changing spiritual awakening, Stephan is on a mission. He is devoted to curiosity, reason, wonder, and most importantly, a connection with God and the unseen world. He has one agenda: revealing light in everything he does. A self-proclaimed geek who went on to pioneer the world of SEO and make a name for himself in the top echelons of marketing circles, Stephan’s journey has taken him from one of career ambition to soul searching and spiritual awakening.

Stephan has created and sold businesses, gone on spiritual quests, and explored the world with Tony Robbins as a part of Tony’s “Platinum Partnership.” He went through a radical personal transformation – from an introverted outlier to a leader in business and personal development.

FREE GUIDES

Download the SEO Hiring Blueprint and SEO Bulls#!t Detector

FREE GUIDE

Download the Grateful Eight

Privacy Policy

This Privacy Policy was last modified on November 8, 2024.

We at Koshkonong, LLC are strongly committed to protecting your privacy and providing a safe online experience for all of our visitors while offering a high-quality user experience here at www.stephanspencer.com (the “Website”), or/and any other websites operated by the Company, including but not limited to:

We know that you care about how the information you provide to us is used and shared. We have developed this Privacy Policy to inform you of our policies regarding the collection, use, and disclosure of Information we receive from users of the Website. Koshkonong, LLC (the “Company”) operates the Website.

This Privacy Policy, along with our Term & Conditions, governs your use of this site. By using www.stephanspencer.com (the “Website”), or by accepting the Terms of Use (via opt-in, checkbox, pop-up, or clicking an email link confirming the same), you agree to be bound by our terms and consent to this Privacy Policy.

Information We Collect

When you visit our Website, we will learn certain information about you.

In order to provide you with the most efficient and enhanced personalized service and attention, and to accommodate certain requests which you make, we request information about you in certain circumstances and collect certain information automatically.

Consistent with our previous practices, we will only collect and process your personal data when we have a lawful basis for doing so. These lawful bases include when you provide us consent, when we have a contractual obligation to collect or process the data, and when we have a legitimate interest in processing your personal data.

We may collect and/or receive the following types of information from you:

Information Provided Voluntarily

Personal Information. You may be asked to provide personal information including your name, address, email address and phone number when you sign up for any of our newsletters, respond to a survey, register for a class, or purchase a product or service. We will only request the personal information that is required in order to fulfill our obligations to you, i.e. in order to deliver what you have requested, as well as to comply with any legal obligations that may accompany such an exchange.

Order & Billing Information. If you place an order through our Website, we track certain information about the products and services you purchase. At checkout, you will also be required to provide additional information required for processing your payment. This generally includes a debit or credit card or other payment information, expiration dates, card security codes, your billing address or similar information. This information is generally processed and captured by third-party vendors, such as payment processors and merchant account providers. To the fullest extent possible, we make an effort not to capture or retain this information ourselves (unless, for example, we are requested to process a refund, follow up on the delivery of goods or services, or work with customers or vendors on resolving any errors or other scenarios.) Third-party vendors who assist us in any of our business functions will also be doing so pursuant to their own terms and conditions, including privacy policies and terms of use.

Information Collected Automatically. In addition to the information described above, we may collect some or all of the following information:

Activity Info (Log Data). Information may be collected based on your use of the Website, which generally includes information about your computer hardware and software, such as:

Internet Protocol (“IP”) addresses, operating systems, browser types, device types, URLS, access dates and times; Website pages that you visit; referring website information; universally unique identifiers (“UUID”), advertising identifier (“IDFA”), carrier and country location, hardware and processor information, network type, and other related data.

Use of Cookies. Our website may use cookies in order to deliver a better experience for you. Cookies are files with small amounts of data that a website stores on your computer’s or mobile device’s hard drive so that certain information about your visit and web-browsing preferences will be recognized upon a return visit. Cookies serve functions such as “remembering” log-in names and passwords, or enabling or saving shopping cart contents. Like many websites, we may use both session cookies (which expire once you close your web browser) and persistent cookies (which stay on your computer until you delete them) to help us improve the experience you have with our Website. Most web browsers have a function that allows you to delete existing cookies on your device or you can set your browser options so that your device does not receive or accept cookies. Doing this may interfere with your ability to use the Website. [ ]

Third-Party Analytics. We use third-party analytics services (such as Google Analytics) to evaluate and aggregate visitor data. These services help us evaluate Visitors’ use of the Website, including time spent on certain pages, which areas of the Website receive the most traffic, how often visitors visit pages within the Website, and also provide general geographic location about visitors, the source of referred traffic (from other websites, vs search engines, etc). These third party analytic services use cookies and other technologies to help analyze and provide us the data. By providing your consent to this Privacy Policy, you consent to the collection and review of data about you by these analytics providers in the manner and for the purposes set out herein.

For more information on Google Analytics, including how to opt out from certain data collection, please visit https://www.google.com/analytics. If you opt out of any service, you may not enjoy the full functionality of the Website.

Other Third Party Tracking Tools. We may also collect or receive information from third parties, such as Facebook, Instagram, LinkedIn, and/or other third-party social media and similar sites.

How to Access Your Personal Information

If you have provided personal, billing or other voluntarily provided information, you may access, review and make changes to it via instructions found on the Website or by emailing us at me [at] stephanspencer [dot] com. To manage your receipt of marketing and non-transactional communications, you may unsubscribe by clicking the “unsubscribe” link located on the bottom of any related email from the Company. Emails related to the purchase or delivery of orders are provided automatically – Customers are not able to opt out of transactional emails. We will try to accommodate any requests related to the management of Personal Information in a timely manner. However, it is not always possible to completely remove or modify information in our databases (for example, if we have a legal obligation to keep it for certain timeframes).

How We Use and Share the Information

We use the collected Information to understand customer needs, including regarding our Website services to:

present you with information through our Website and email services;
provide our services, process orders, and administer our programs;
maintain and improve our Website;
respond to your requests, and provide customer service, including in response to any problems that may arise, such as difficulties in navigating our Website or accessing certain features;
solicit your feedback, and to inform you about our products and services and those of our third-party marketing partners that we feel may be of interest or value to you;
personalize or display advertisements to you on third-party platforms;
improve our services or offerings; and
fulfill our legal or contractual obligations to you.

As with transactions elsewhere, when you purchase products, services, or programs via our Website, your credit card company will also retain certain information regarding your purchase. We will not otherwise provide any personal data to your credit card company without your permission.

We may share the Information collected with joint venture partners, affiliates, agents, current or prospective business partners, businesses under common control, third-party businesses or partners participating in administering our programs or services, or to those providing business functions such as technical support, customer service, marketing assistance, etc. These businesses will only have access to information as necessary to perform their functions and to the extent permitted by law.

In the event of the sale or transfer of our business to a third party buyer, including in the event of bankruptcy or liquidation, we may, as necessary, share your data with the buyer or transferee.

In the event that we have to collect on a debt owed to us by you, we may, as necessary, share your information with an attorney or a court of law to enforce our rights and collect what is owed.

Under certain circumstances we may be obligated or compelled to disclose the Information: when required by law, court order or government agency, or when disclosing such Information is reasonably required to protect the Company, its property, the Website, the safety of visitors or others.

Retargeting / Personalized and Behavioral Ads

We may, using the techniques described above, and in conjunction with third party marketing partners, gather information from your visit to our Website for purposes of providing relevant advertising content to you in the future. This means that through third party marketing partners or third party sites such as Google, Facebook, or Instagram, we may display advertisements to you (remarketing or retargeting), based on usage data collected during your visit to our website.

Google Analytics: As described above, we use Google Analytics on our website. We may utilize any of the following advertising features that utilize the Google advertising cookies: Remarketing with Analytics, Demographics and Interest reporting, Segments, and Double Click integration.

To opt out of seeing personalized ads via the Google platform, please visit https://support.google.com/ads/answer/2662922?hl=en

For more information regarding how Google uses cookies in advertising and how you can control advertising cookies, visit http://www.google.com/policies/technologies/ads/

To personalize the ads you see via the Google platform, visit http://www.google.com/settings/ads

For more information on how Google uses data when you visit its partners’ sites or apps, visit http://www.google.com/policies/privacy/partners/

Or, via an opt-out browser add-on, you can opt out of having your site activity available to Google Analytics. For more information, or to install the opt-out browser ad-on, please visit https://support.google.com/analytics/answer/181881?hl=en Please note that if you opt out of any service, you may not enjoy the full functionality of the Website.

Google Ad Words: We may utilize Google Ad Words, which provides certain of the information collected on our website to the Adwords advertising network. To opt out of Google Ad Words, visit https://support.google.com/ads/answer/2662922?hl=en

Facebook Custom Audience: We may utilize Facebook’s Custom Audience capability which allows us to display ads on Facebook to individuals on our email lists or in our database. We may provide personal information to Facebook, such as your name, email address and phone number to enable Facebook to identify whether you are a Facebook account holder. To opt-out of the Facebook Custom Audience, please email us at me [at] stephanspencer [dot] com with (1)“Facebook Custom Audience Opt-Out” in your Subject Line, and (2) your name and email address in the body of the email.

Other Personalized and Behavioral Advertising Services: We may participate in other retargeting services that are similar to those described above.

Managing Cookies Via Your Browser:

You may be able to change your cookie preferences via your browser settings. Please visit your browser’s help section for assistance with turning on notifications regarding cookies, or disabling cookies through your browser.

Managing Cookies on Your Mobile Device:

You may be able to change your cookie preferences on your mobile device either via your browser settings, or via the settings (preferences) on your device. For additional info, please visit http://www.allaboutcookies.org/mobile/

Use Caution When Sharing Information Online

When you voluntarily make your Personal Information available online in an environment shared by third parties – including in webinars, classes, online conferences, via email, on message boards, chat rooms or on blogs, or via telephone calls or conferences – that information can be viewed, saved, collected, heard, and/or used by others outside of the Company. We are not responsible for any unauthorized third-party use of information provided in these contexts. Please be mindful whenever you share any information online.

Security of Your Information

The security of your Personal Information is important to us, but remember that no method of transmission over the Internet, or method of electronic storage, is 100% secure. While we incorporate standard industry practices internally and with our services providers, which we believe is sufficient for the information involved, we cannot guarantee its absolute security. Because we work with third-party businesses and vendors in various aspects of our business including operating this website, database management, website security, etc., we cannot guarantee the absolute security of our databases, nor can we guarantee that the Information you supply will not be intercepted while being transmitted to and from us over the Internet. Any information that you provide to us via email should be treated with extra caution, as we cannot control the level of security available through email providers.

Children

We are strongly committed to protecting the safety and privacy of children who visit our website. We do not knowingly collect Personal Information from children under the age of 13 through the Website. If you are under 13, please do not give us any Personal Information. We encourage all parents to talk to their children about online safety and to monitor their children’s use of the Internet. If you have reason to believe that a child under the age of 13 has provided Personal Information to us, please contact us, and we will make best efforts to immediately delete that information from our databases.

External Websites

While we have carefully chosen those vendors with whom we work, especially those involved in the transmission of data on behalf of our business, we do not control and are not responsible for the privacy practices or content of third-party websites, including those of affiliates, business partners, sponsors, advertisers, or other websites to which we may link from time to time. When visiting any third-party websites, you are responsible for reviewing the privacy policy and terms of use applicable to each site. They may be different than those that you see here.

California Privacy Rights

Pursuant to Section 1798.83 of the California Civil Code, residents of California have the right to request from a business with whom the Californian has an established business relationship, certain information with respect to the types of personal information the business shares with third parties for direct marketing purposes by such third party and the identities of the third parties with whom the business has shared such information during the immediately preceding calendar year. To request a copy of the information disclosure provided by the Company pursuant to this provision, please contact us via email at me [at] stephanspencer [dot] com.

Our California Do Not Track Notice

Consistent with our policies described above, we collect information from our visitors on our website and across third party websites where applicable, to provide relevant content and advertising. We do not support Do Not Track (“DNT”) signals of web browsers.

DNT is a setting in your web browser that informs websites that you do not want to be tracked. You can enable or disable DNT through the Preferences or Settings options of your web browser.

Visitors Outside the U.S.

Our Website and the servers that make this Website available worldwide are located in the United States. The Internet laws in the United States govern all matters relating to this Website. Any information you provide in subscribing to or visiting our Website will be transferred to the United States. By visiting our Website and submitting information, you authorize this transfer, processing, and use.

EU Visitors and the GDPR

If you live in the EU, certain of your data may be subject to protection by the General Data Protection Regulation (the “GDPR”), a privacy regulation intended to help you have greater control over your personal data.

Under the GDPR, you have the following rights, which we extend to all of our website visitors:

to know what information is being collected, as disclosed in this Privacy Policy (“right to transparent information”);
to object on grounds relating to your particular situation to the collection or processing of certain kinds of information (“right to object”);
to withdraw consent even after it has been given, without affecting the lawfulness of the processing of your data prior to your withdrawal (“right to withdraw consent”);
to access, modify or update your information so that it is correct (“right to access” and “right to rectification”);
to have your information deleted or erased (“right to erasure” or “right to be forgotten”);
to have your data transferred or ported elsewhere (“right to data portability”);
and the right to restrict processing in certain situations (“right to restriction of processing”).

For purposes of the GDPR, to the fullest extent possible we do not directly collect or maintain personal data of our EU visitors or clients. This is done through partnerships with third-party vendors who have systems designed for the collection and storage of data on behalf of their clients (“data processors”). The data is only kept as long as is reasonably required to accomplish the purpose or purposes for which it was collected. Certain data may be expunged or utilized more quickly than others, and depending on your engagement and behaviors, including purchasing behaviors, we may be engaged in a business or contractual relationship with some longevity. (For example, if you purchase a course or program, or services that span over some time, you will continue to receive communications in regards to those offerings or services).

The categories of data processors with whom we work, and that help us in the collection of information that allows us to serve and interact with you include website analytics provider(s), website and web form plug-ins and related third-party services that allow us to maintain an up-to-date online platform and communicate with you electronically, an email service provider or providers to assist in managing email correspondence with our visitors, customers and list, as well as payment processor(s) and/or merchant account(s) to help us transact sales online and follow up with you regarding any purchases. For a list of specific data processors with whom we currently work, please visit our list here.

Those within the Company that may have access to the data include Officers, Directors, Managers, Supervisors, Salespersons, or those providing customer or technical support. Access to the data is provided for specific purposes, generally related to communicating with you, fulfilling our contractual obligations, providing customer service or technical support, collecting payment, or monitoring performance of marketing and sales efforts.

You may contact us at any time to have your information with us or any of these processors updated or corrected, deleted, or to obtain a copy for your records. Additionally, you may opt out of receiving any additional emails or marketing messages from us by clicking the “unsubscribe” link at the bottom of our email notifications.

Further, if you feel that we have not complied with the relevant data protection regulations, please contact us by sending us an email at me [at] stephanspencer [dot] com with your first and last name, your country of origin, and a summary of your concern or complaint. We will endeavor to respond at our earliest possible convenience to address your concern. If contacting us does not resolve your complaint, you may have additional options.

Residents in Designated Countries may also have the right to lodge a complaint with the relevant EU data protection authority. You may access a list of the Data Protection Authorities in the EU here.

Changes to This Privacy Policy

The Company may update this Privacy Policy from time to time as necessary to protect our users and to comply with a changing environment. Please review the Privacy Policy when you visit our Website to remain updated on our current policy. We have no intention of making any changes to our Privacy Policy and practices to make them less protective of personal information collected in the past. By accessing the Website and/or using our services after making any such changes to this Privacy Policy, you are deemed to have accepted such changes. Please be aware that, to the extent permitted by applicable law, our use of the Information is governed by the Privacy Policy in effect at the time we collect the information. You are advised to review this Privacy Policy periodically for any changes. Whenever we make changes to this Privacy Policy we will update the date at the top.

Contact Us

If you have any questions about this Privacy Policy, please contact us via email at me [at] stephanspencer [dot] com with “Privacy Policy” in the subject line.

The ultimate guide to bot herding and spider wrangling — Part 3

Common coding issues

Mobile sites

Should you block bad bots?

Localization

Conclusion

FREE SEO HIRING BLUEPRINT

STEPHAN SPENCER

How to Have a Conversation with ChatGPT And Actually Get Results Worth Using

Is Google’s AI Telling the Truth? Here’s How to Fact-Check It

SEO: Best Practices, Bad Practices

The Grateful Eight

Are you and your team spending at least a half hour a day in AI tools?

Are you working with a marketing agency that has an AI first mindset?

Are you actively working to disrupt yourself before you get disrupted by your competitors?

In a 1-10 Scale, how urgent is it for you to solve the problem you have today?

What do you need right now?

Which Best Describes You?

In a 1-10 Scale, how urgent is it for you to solve the problem you have today?

What do you need right now?

The ultimate guide to bot herding and spider wrangling — Part 3

Common coding issues

Mobile sites

Should you block bad bots?

Localization

Conclusion

FREE SEO HIRING BLUEPRINT

STEPHAN SPENCER

Check Out These Related Posts

How to Have a Conversation with ChatGPT And Actually Get Results Worth Using

Is Google’s AI Telling the Truth? Here’s How to Fact-Check It

SEO: Best Practices, Bad Practices

The Grateful Eight

Download the SEO Hiring Blueprint and SEO Bulls#!t Detector

Download the Grateful Eight

Are you and your team spending at least a half hour a day in AI tools?

Are you working with a marketing agency that has an AI first mindset?

Are you actively working to disrupt yourself before you get disrupted by your competitors?

In a 1-10 Scale, how urgent is it for you to solve the problem you have today?

What do you need right now?

Which Best Describes You?

In a 1-10 Scale, how urgent is it for you to solve the problem you have today?

What do you need right now?

Privacy Policy