IP Address Crawlable
Good day,
It seems like Google is crawling our website on it's IP address, hostname, etc. at Amazon EC2 eg:
example.com/blog/
ec2-54-88-xx-xxx.compute-1.amazonaws.com/blog/
54.88.xx.xxx/blog/
How can we make sure only the domain is crawled and accessible?
-
If your domain name resolves to an IP (and it must for anyone to be able to call the website using the domain name) the DNS is probably correct. I am intrigued as to how the DNS is resolving to the /blog website using the hostname and the IP in the first place. Perhaps this would better be addressed by your web host. There is no way of really influencing what the various robots can crawl other than by using a robots.txt file (or include crafted meta tags into the headers of the parsed web page) to exclude certain folders/files from being searched. As far as I know, if you put an IP into this file, it would be ignored, and you need to remember that not all the robots respect the robots.txt or meta tag conventions anyway ! Hope this helps. 0 -
If your domain name resolves to an IP (and it must for anyone to be able to call the website using the domain name) the DNS is probably correct. I am intrigued as to how the DNS is resolving to the /blog website using the hostname and the IP in the first place. Perhaps this would better be addressed by your web host. There is no way of really influencing what the various robots can crawl other than by using a robots.txt file (or include crafted meta tags into the headers of the parsed web page) to exclude certain folders/files from being searched. As far as I know, if you put an IP into this file, it would be ignored, and you need to remember that not all the robots respect the robots.txt or meta tag conventions anyway ! Hope this helps.
Thanks I will continue to investigate and post back here once I have a solution to this. In the meanwhile, I hope that Google doesn't penalize for this.0 -
Hello, You should add a 301 redirect to one url, once you 301 redirect everything, all the urls currently indexed with the IP will eventually drop out in favor of the same urls on the domain to which you redirected them. You can use the redirect in the following link for doing the same. - Removed - 0
Please sign in to leave a comment.
Comments
5 comments