Web crawlers, Regex for Markdown URLs, and Removing your site from Google search results

Today I learned:

Web Crawlers

Need a web crawler but don’t want to write one?


Getting pages removed from Google cache

Have an old site that you need to keep live but don’t want the results to show on Google searches? Here are a few things you need to do:

  1. Change the robots.txt or password protect your site to prevent search engines from indexing.
  2. Log in to Google Webmaster Tools and submit the site to the URL Removal tool.
  3. Finish what you need the site up for ASAP and take it offline.

This matches the links above:

  • Search: ([wS]*[mo7b/])$
  • Replace: [1](1)


Comments

Leave a Reply

Webmentions

If you've written a response on your own site, you can enter that post's URL to reply with a Webmention.

The only requirement for your mention to be recognized is a link to this post in your post's content. You can update or delete your post and then re-submit the URL in the form to update or remove your response from this page.

Learn more about Webmentions.