12, October 2008

blocking robots with robots.txt - webmaster forum

 
Webdigity webmaster forums
This forum shares its ad revenue with its members!
[ Home | Help | Search | Forum's Shop | Archive | Login | Register | Webmaster Directory ]
Webdigity Webmaster Forums  >  Design and Layout  >  General webmaster discussions (Moderator: Meth0d)
Topic: blocking robots with robots.txt
« previous next »
Pages: [1] Print

Author Topic: blocking robots with robots.txt  (Read 1488 times)
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6349
38918 credits
Members referred : 374


It's time to use PHP5!


« on: Jul 06, 2006, 10:11:15 AM »

Hello,

just tried some robots.txt validator and I'm surprised about the result, this is not OK:

Code:
User-agent: Slurp
User-agent: *
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

the validator gives this error:
Line 1    User-agent: Slurp
You specified both the generic user-agent "*" and specific user-agents for this block of code; this could be misinterpreted.


By the way MSN is indexing all the links to this file "/classes/download.php" don't know why...


Last blog : Upload images for usage in TinyMCE
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 8116
41653 credits
Members referred : 3



« Reply #1 on: Jul 06, 2006, 11:16:05 AM »

This is the right one (if I understand right what you want to do)

Code:
User-agent: *
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

or

Code:
User-agent: *
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

User-agent: Slurp
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

Both do the same thing

Trial and Error my two best teachers Cool
Join us @ facebook Visit through proxy

Last blog : Free Unlimited Bandwith and disk space to good to be true?
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6349
38918 credits
Members referred : 374


It's time to use PHP5!


« Reply #2 on: Jul 06, 2006, 11:40:17 AM »

I want to exclude the slurp bot for all pages and for all other the listed file/folders...


Last blog : Upload images for usage in TinyMCE
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 8116
41653 credits
Members referred : 3



« Reply #3 on: Jul 06, 2006, 11:50:55 AM »

Then the correct one is :

Code:
User-agent: *
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

User-agent: Slurp
Disallow: *

Trial and Error my two best teachers Cool
Join us @ facebook Visit through proxy

Last blog : Free Unlimited Bandwith and disk space to good to be true?
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6349
38918 credits
Members referred : 374


It's time to use PHP5!


« Reply #4 on: Jul 06, 2006, 12:12:54 PM »

Then the correct one is :

Code:
User-agent: *
Disallow: /includes/
Disallow: /cache/
Disallow: /classes/download.php
Disallow: /rand_code.php

User-agent: Slurp
Disallow: *
Oeps...


Last blog : Upload images for usage in TinyMCE
Bill Gates is my home boy
*****
Gender: Female
Posts: 622
3877 credits
Members referred : 2



« Reply #5 on: Jul 06, 2006, 05:35:31 PM »

Wrote this a few days ago, maybe it will help.

http://www.helpforwebbeginners.com/webmasters/robot-exclusion-standard.html Visit through proxy

PS. Hey Nik, isn't there some way to use link text instead of this giant URL?

www.yourmessageconsultant.com Visit through proxy, providing online content and printed marketing materials.
www.helpforwebbeginners.com Visit through proxy, Tutorials and how to's for new  webmasters.
www.CraftyTips.com Visit through proxy, a unique Arts & Crafts Directory
www.nocans.com Visit through proxy - Pet Food Recipe Site
www.petsiteguides.com Visit through proxy - A New Pet Directory

Last blog : Back to Crafting
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 8116
41653 credits
Members referred : 3



« Reply #6 on: Jul 06, 2006, 05:38:31 PM »

Wrote this a few days ago, maybe it will help.

http://www.helpforwebbeginners.com/webmasters/robot-exclusion-standard.html Visit through proxy

PS. Hey Nik, isn't there some way to use link text instead of this giant URL?

Thanks for the article. You can write [ url = www....... ] anchor [ /url ] Wink

Trial and Error my two best teachers Cool
Join us @ facebook Visit through proxy

Last blog : Free Unlimited Bandwith and disk space to good to be true?
Bill Gates is my home boy
*****
Gender: Female
Posts: 622
3877 credits
Members referred : 2



« Reply #7 on: Jul 06, 2006, 05:57:06 PM »

OK, now I've got this working.

Here's the link to the article.

Robot Exclusion Standard Visit through proxy


www.yourmessageconsultant.com Visit through proxy, providing online content and printed marketing materials.
www.helpforwebbeginners.com Visit through proxy, Tutorials and how to's for new  webmasters.
www.CraftyTips.com Visit through proxy, a unique Arts & Crafts Directory
www.nocans.com Visit through proxy - Pet Food Recipe Site
www.petsiteguides.com Visit through proxy - A New Pet Directory

Last blog : Back to Crafting
Kill the googlebot
*
Gender: Male
Posts: 6
40 credits
Members referred : 0



« Reply #8 on: Feb 02, 2007, 03:30:56 PM »

G`day gang  Cool
                I came across this web site optimization tool that does heaps of things to keep your web site free of errors and such. I ran it on one of my web sites and it came up with a "No Archive" tag for the robots. That means my whole site DOES NOT get indexed by googlebot ect. Is that correct? Why on earth would it be there in the first place? The reason given by the optimization tool said it is usually spammers that do that so they dont get caught....i have no clue. I bought the web site premade and NOW i know why i haven`t made a sale on it yet. Slow page loads and over 450 warnings on it  Sad   lucky i paid very little for it i guess....lol. Anyone wanting to know about what optimization tool i use or if youknow of one i could check out, please e-mail me or rely to this post.

                           Cool Yikes  Cool
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 8116
41653 credits
Members referred : 3



« Reply #9 on: Feb 02, 2007, 03:39:18 PM »

Maybe the site is banned from google?

If your robots.txt file is ok you shouldn't have indexing problems with any search engine.

Regarding spammers check this thread Visit through proxy Wink

Trial and Error my two best teachers Cool
Join us @ facebook Visit through proxy

Last blog : Free Unlimited Bandwith and disk space to good to be true?
Kill the googlebot
*
Gender: Male
Posts: 6
40 credits
Members referred : 0



« Reply #10 on: Feb 02, 2007, 05:58:57 PM »

[tr]Maybe the site is banned from google?[/tr]

 Banned? I only just bought it early december. i sure as hell didn`t spam anyone...what do you mean exactly? The website in question is www.kashyhealth.com.au Visit through proxy and if you see it, yeah it "looks" ok but as far as making $$$ go...its a HUGE DUD!!! I`ve tried to get things changed on it but it seems they have it setup so i CANNOT do the work myself. I guess if i knew HOW I might be able to but i think they set it up so if any work needs doing, they get it done for (i expect) an outrageous price. So much for the company i bought it off "going to make me rich"...lol. I didnt expect to become a millionaire in 2 months but i sure NEVER expected to have a shoppingcart website and not have a sale after 2months!!! Google did their bit with ADWORDS, burnt badly there, over $600 and no sale. I read that a conversion rate of less than 1% is poor...I don`t even have a conversion rate for that site. Later Guys, I`ve depressed myself now....lol      CYA

                                           Cool Yikes  Cool
Kill the googlebot
*
Gender: Male
Posts: 6
40 credits
Members referred : 0



« Reply #11 on: Feb 02, 2007, 06:09:24 PM »

[tr]Maybe the site is banned from google?[/tr]    Huh well i did that wrong in my last post!!..lol

                                       Cool Yikes  Cool
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 8116
41653 credits
Members referred : 3



« Reply #12 on: Feb 03, 2007, 12:43:22 PM »

I don't want to make you feel bad, but these MLM companies are scams. It costed me 4500 euros to realize that, hope you haven't paied them that much....

Trial and Error my two best teachers Cool
Join us @ facebook Visit through proxy

Last blog : Free Unlimited Bandwith and disk space to good to be true?
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6349
38918 credits
Members referred : 374


It's time to use PHP5!


« Reply #13 on: Feb 04, 2007, 12:08:28 PM »

I don't want to make you feel bad, but these MLM companies are scams. It costed me 4500 euros to realize that, hope you haven't paied them that much....

Nick sometimes is better to ignore a thread Cheesy


Last blog : Upload images for usage in TinyMCE
Tim Nash
Global Moderator
Community Supporter ?
Internet Junkie
*****
Posts: 2173
5036 credits
Members referred : 2


Venture Skills - New Media & IT group


« Reply #14 on: Feb 04, 2007, 12:39:04 PM »

ouch, Yikes I hate seeing people ripped off, send me a pm and I will get one of our boys to do a site review, see if we can find the problem and get you reindexed, no charge Smiley

Would you like to be an SEO, let me help with, The Tim Nash introduction to SEO Visit through proxy alternatively for Social media optimisation take a look at the Venture Skills Blog Visit through proxy

Last blog : Its all in the mp3s
Chicken-run Manager
*
Posts: 9
62 credits
Members referred : 0


« Reply #15 on: Apr 12, 2007, 10:36:02 AM »

Emmm,

just looking aounds. Im newbie here
Global Moderator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 6349
38918 credits
Members referred : 374


It's time to use PHP5!


« Reply #16 on: Apr 12, 2007, 10:39:02 AM »

Emmm,

just looking aounds. Im newbie here

don't post if you have nothing to say!


Last blog : Upload images for usage in TinyMCE
Trackback URI for this entry : http://www.webdigity.com/trackback.php?topic=3161
Tags : msn articles anchor text robots.txt Bookmark this thread : Digg Del.icio.us Dzone more....

Topic sponsors:
Get a permanent link here for $1.99!


Pages: [1] Print 
Webdigity Webmaster Forums  >  Design and Layout  >  General webmaster discussions (Moderator: Meth0d)
Topic: blocking robots with robots.txt
« previous next »
Jump to:
User Area
Welcome, Guest. Please login or register.
Did you miss your activation email?
Oct 12, 2008, 12:19:14 PM





Login with username, password and session length

Donate to our community, and get a permanent link back to your site!

Donate to our community, and get a permanent link back to your site!


Forum Statistics
Total Posts: 36.918
Total Topics: 7.558
Total Members: 4.151
Tutorials : 56
Resources : 143
Designs : 220
Latest Member: 22gigs

15 Guests, 3 Users online :

10 users online today:



Readers

Web Design Gallery · Whois Lookup · Pagerank · Tag Browsing · Lo-fi version · Syndication · Webmaster forum history · Advertise
Developed by HumanWorks © 2005 - 2008 Webdigity webmaster community · sublime directory
Webdigity Webmaster Forums | Powered by SMF 1.0.12. © 2001-2005, Lewis Media. All Rights Reserved.