28, May 2012

list of all google crawlers? - webmaster forum

 
Webdigity webmaster forums
[ Home | Help | Search | Forum's Shop | Archive | Login | Register | Webmaster Directory ]
Webdigity Webmaster Forums  >  Web site promotion  >  Search Engine Optimization  >  Google SEO
Topic: list of all google crawlers?
« previous next »
Pages: [1] Print
Instabuck - The easy way to sell digital products online

Author Topic: list of all google crawlers?  (Read 4201 times)
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« on: Jul 23, 2006, 01:09:01 pm »

Hello,

does someone know a complete list of ranges/ip addresses used by the google webcrawler?

and maybe the same list for MSN?

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #1 on: Jul 23, 2006, 01:12:38 pm »

I think there was a thread about this in the forum, but anyway it is impossible to have a complete list while Google has too many datacenters/ips

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #2 on: Jul 23, 2006, 01:16:08 pm »

I think there was a thread about this in the forum, but anyway it is impossible to have a complete list while Google has too many datacenters/ips

just noticed that this range is from google:
NetRange:   66.249.64.0 - 66.249.95.255

but what is a better way to identify the search engine in your stats the "HTTP_USER_AGENT" ?

Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #3 on: Jul 23, 2006, 01:16:54 pm »

Nick, why is thread about seo?

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #4 on: Jul 23, 2006, 01:21:24 pm »

Nick, why is thread about seo?

I guess because crawlers about SEO Smiley

I think there was a thread about this in the forum, but anyway it is impossible to have a complete list while Google has too many datacenters/ips

just noticed that this range is from google:
NetRange:   66.249.64.0 - 66.249.95.255

but what is a better way to identify the search engine in your stats the "HTTP_USER_AGENT" ?


I think this range is from Google ( I remember the 66 in the start), but rather than user agent there is not other good way.

But you can allways check some things. For example google fetches robots.txt anytime it visits a site (at least once per session), also it is visiting only one page per time, and it never sends referer headers.

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #5 on: Jul 23, 2006, 01:24:57 pm »

I need this test for something other, I need a feature to block users to access a page only a limited number of time and I wnat to do that with a test on the ip address (maybe also a cookie). That why I need a check to know who is google. I have hundreds of visits with aa fake "HTTP_USER_AGENT"

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #6 on: Jul 23, 2006, 01:30:57 pm »

Check mod_limitipconn. It does all the work automatically Smiley

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #7 on: Jul 23, 2006, 01:33:21 pm »

Check mod_limitipconn. It does all the work automatically Smiley

can I access the module via php?

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #8 on: Jul 23, 2006, 01:42:02 pm »

Check mod_limitipconn. It does all the work automatically Smiley

can I access the module via php?

No you just set some directives in virtual host or in htaccess.

Here is an example :

Code:
<IfModule mod_limitipconn.c>
    <Location />
        MaxConnPerIP 7
        # exempting images from the connection limit is often a good
        # idea if your web page has lots of inline images, since these
        # pages often generate a flurry of concurrent image requests
        NoIPLimit images/* css/* admin/* js/*
    </Location>
  </IfModule>

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #9 on: Jul 23, 2006, 01:49:52 pm »

This is a nice feature, but I have to prevent users from using a form to often (because they are searchiung for advertisers), that will say I hev to protect only two result lists from a complete website...

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #10 on: Jul 23, 2006, 01:54:37 pm »

If those files exist in the same directory then you can for sure do this with this mod.

Otherwise I am not sure if you can manage it to check only two files instead of the whole site.

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #11 on: Jul 23, 2006, 02:50:49 pm »

If those files exist in the same directory then you can for sure do this with this mod.

Otherwise I am not sure if you can manage it to check only two files instead of the whole site.
I will remember this mod, but for now I need to do something else, thanks

Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #12 on: Jul 23, 2006, 02:51:50 pm »

buy the way how does it work if the visitor is surfing via a proxy server?

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #13 on: Jul 23, 2006, 02:53:43 pm »

buy the way how does it work if the visitor is surfing via a proxy server?

There's no problem, as the proxy has one ip adress. The only possible problem could be if there were many visitors using the same proxy at the same time, but this is almost impossible (at least for a site with less traffic than Google Smiley )

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6691
34714 credits
Members referred : 374


It's time to use PHP5!


« Reply #14 on: Jul 23, 2006, 02:55:02 pm »

buy the way how does it work if the visitor is surfing via a proxy server?

There's no problem, as the proxy has one ip adress. The only possible problem could be if there were many visitors using the same proxy at the same time, but this is almost impossible (at least for a site with less traffic than Google Smiley )

I remember that modem users having 3 diff. ip addresses within 5 minutes... what about them?

I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 5799
46391 credits
Members referred : 3



« Reply #15 on: Jul 23, 2006, 03:08:26 pm »

buy the way how does it work if the visitor is surfing via a proxy server?

There's no problem, as the proxy has one ip adress. The only possible problem could be if there were many visitors using the same proxy at the same time, but this is almost impossible (at least for a site with less traffic than Google Smiley )

I remember that modem users having 3 diff. ip addresses within 5 minutes... what about them?

I didn't know of this. Dial up users have one ip, and it changes whenever the connection resets.

If someone has multiple ips, this mod can't do anything as it works only with the ip (no sessions involved)

Trial and Error my two best teachers Cool
Join us @ facebook or twitter

Last blog : Butterfly Marketing 2.0
Trackback URI for this entry : http://www.webdigity.com/trackback.php?topic=3378
Tags : google seo proxy robots.txt Bookmark this thread : Digg Del.icio.us Dzone more....

Pages: [1] Print 
Webdigity Webmaster Forums  >  Web site promotion  >  Search Engine Optimization  >  Google SEO
Topic: list of all google crawlers?
« previous next »
Jump to:
User Area
Welcome, Guest. Please login or register.
Did you miss your activation email?
May 28, 2012, 12:27:47 pm





Login with username, password and session length

Donate to our community, and get a permanent link back to your site!

Donate to our community, and get a permanent link back to your site!






Web Design Gallery · Whois Lookup · Pagerank · Tag Browsing · Lo-fi version · Syndication · Webmaster forum history · Advertise
Developed by HumanWorks © 2005 - 2012 Webdigity webmaster community · sublime directory
Webdigity Webmaster Forums | Powered by SMF 1.0.12. © 2001-2005, Lewis Media. All Rights Reserved.