13, May 2008

Feauture requests - webmaster forum

 
Webdigity webmaster forums
This forum shares its ad revenue with its members!
[ Home | Help | Search | Forum's Shop | Archive | Login | Register | Webmaster Directory ]
Webdigity Webmaster Forums  >  WebDigity Community  >  User Forums  >  aStatSpam forum
Topic: Feauture requests
« previous next »
Pages: [1] Print

Author Topic: Feauture requests  (Read 949 times)
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 7823
39873 credits
Members referred : 3



« on: Sep 29, 2005, 02:10:01 PM »

As there are many people that using the script, I thought it would be great to ask them what they need to be puted on the script.

Some things I thought, and I'll put them on later versions are here :

  • Abillity to choose the way that the script deals with the spammer ( redirect, deny)
  • Multiple .htaccess files parsing. This is allready implemented for the new version, and actually it is detecting the location of the files.
  • Script updates. The script will check for new versions when it runs
  • I am thinking of creating an additional script that will check the referers for links, in order to be easier to find out who are the spammers
  • Better implementation for the users that run php under safe mode

If you have some advices or requests please share them with us Smiley

Trial and Error my two best teachers Cool
Promote your blog for free.... Visit through proxy

Last blog : Keep it Legal - Tims guide to legal notices
I love Pokemon
*
Gender: Male
Posts: 14
84 credits
Members referred : 0



« Reply #1 on: Mar 28, 2006, 04:16:20 PM »

Being able to ban specific useragents for blocking bad robots or by keyword would be nice.
For example, being able to block any domain with *casino* in it would be extremely useful.

I do not think this is possible using the current script but I would think it is possible to add a couple of extra functions to setup.php along with a small menu that would create links to connect to a different update file.

For example, the script currently used HTTP_REFERER and to block bad bots we would need to use HTTP_USER_AGENT and the syntax for the directives would be slightly different.

I am working on this functionality already for aStatSpam which I have converted to a phpNuke content management system module and have so far verified over 2000 bad referers.
My current list can be found at www.code-authors.com/update.php Visit through proxy
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 7823
39873 credits
Members referred : 3



« Reply #2 on: Mar 28, 2006, 04:25:02 PM »

Well about the first thing you say, it is possible now.

Just put in the custom domains array the word casino and you are done Smiley

Quote
I do not think this is possible using the current script but I would think it is possible to add a couple of extra functions to setup.php along with a small menu that would create links to connect to a different update file.

That's a nice advice. I will look it when I will start coding the script again.

About the user agent, that's an easy one, I will add it in the future. Untill then you can use this Visit through proxy Wink

Trial and Error my two best teachers Cool
Promote your blog for free.... Visit through proxy

Last blog : Keep it Legal - Tims guide to legal notices
I love Pokemon
*
Gender: Male
Posts: 14
84 credits
Members referred : 0



« Reply #3 on: Mar 28, 2006, 06:28:08 PM »

Cool thanks.
I'll compare your bad useragent list with my own as I have been collecting a few and authenticating them over the last few weeks.
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 7823
39873 credits
Members referred : 3



« Reply #4 on: Mar 28, 2006, 06:29:43 PM »

Good. Let us know of your improvements Smiley

Trial and Error my two best teachers Cool
Promote your blog for free.... Visit through proxy

Last blog : Keep it Legal - Tims guide to legal notices
I love Pokemon
*
Gender: Male
Posts: 14
84 credits
Members referred : 0



« Reply #5 on: Mar 28, 2006, 06:44:06 PM »

I am just about to leave for work but here is my current list - I'll reformat these as soon as I have time so they can be copied/pasted into htaccess
anonymiz
asterias
backdoorbot
black hole
blackwidow
blowfish
botalot
builtbottough
bullseye
bunnyslippers
catch
cegbfeieh
charon
cheesebot
cherrypicker
chinaclaw
combine
copyrightcheck
cosmos
crescent
curl
dbrowse
disco
dittospyder
dlman
dnloadmage
download
dreampassport
dts agent
ecatch
eirgrabber
erocrawler
express webpictures
extractorpro
eyenetie
fantombrowser
fantomcrew browser
fileheap
filehound
flashget
foobot
franklin locator
freshdownload
fscrawler
gamespy_arcade
getbot
getright
getweb
go!zilla
go-ahead-got-it
grab
grafula
gsa-crawler
harvest
hloader
hmview
httplib
httpresume
httrack
humanlinks
igetter
image stripper
image sucker
industry program
indy library
infonavirobot
installshield digitalwizard
interget
iria
irvine
iupui research bot
jbh agent
jennybot
jetcar
jobo
joc
kapere
kenjin spider
keyword density
larbin
leechftp
leechget
lexibot
libweb/clshttp
libwww-perl
lightningdownload
lincoln state web browser
linkextractorpro
linkscan/8.1a.unix
linkwalker
lwp-trivial
lwp::simple
mac finder
mata hari
mediasearch
metaproducts
microsoft url control
midown tool
miixpc
missauga locate
missouri college browse
mister pix
moget
mozilla.*newt
mozilla/3.0 (compatible)
mozilla/3.mozilla/2.01
msie 4.0 (win95)
multiblocker browser
mydaemon
mygetright
nabot
navroad
nearsite
net vampire
netants
netmechanic
netpumper
netspider
newsearchengine
nicerspro
ninja
nitro downloader
npbot
octopus
offline explorer
offline navigator
openfind
pagegrabber
papa foto
pavuk
pbrowse
pcbrowser
peval
pompos/
program shareware
propowerbot
prowebwalker
psurf
puf
puxarapido
queryn metasearch
realdownload
reget
repomonkey
rsurf
rumours-agent
sakura
scan4mail
semanticdiscovery
sitesnagger
slysearch
spankbot
spanner
spiderzilla
sq webscanner
stamina
star downloader
steeler
steeler
strip
superbot
superhttp
surfbot
suzuran
swbot
szukacz
takeout
teleport
telesoft
test spider
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
tsurf
turing machine
turingos
urlblaze
urlgetfile
urly warning
utilmind
vci
voideye
web image collector
web sucker
webauto
webbandit
webcapture
webcollage
webcopier
webenhancer
webfetch
webgo
webleacher
webmasterworldforumbot
webql
webreaper
website extractor
website quester
webster
webstripper
webwhacker
wep search
wget
whizbang
widow
wildsoft surfer
www-collector-e
www.netwu.com Visit through proxy
wwwoffle
xaldon
xenu
zeus
ziggy
zippy
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 7823
39873 credits
Members referred : 3



« Reply #6 on: Mar 28, 2006, 06:47:46 PM »

I think that having so much rules in your .htaccess can cause the server run slower

Trial and Error my two best teachers Cool
Promote your blog for free.... Visit through proxy

Last blog : Keep it Legal - Tims guide to legal notices
I love Pokemon
*
Gender: Male
Posts: 14
84 credits
Members referred : 0



« Reply #7 on: Mar 29, 2006, 08:51:37 AM »

Yes, that is something I had thought about but I found that the page load time was quicker with the directives in htaccess than the page load time I was getting without them due to multiple page loads per useragent.
Before blocking them my average page generation time was over 3 seconds, with the directives in place it is just over 1 second.
I currently have over 2000 blocks on referers, several hundred blocks by IP including some whole country ranges as well as the useragent blocks.
By far the biggest improvement was made by banning 'Slurp' which can consume massive amounts of resources even if you use the robots.txt Crawl-delay directive (crawl delay is specific to Yahoo's slurp / inktomi search bots).

I know I am getting a little off topic now but this information *may* be useful to someone.
I am a metal monkey!
Administrator
Community Supporter ?
Jedai Sword Master
*****
Gender: Male
Posts: 7823
39873 credits
Members referred : 3



« Reply #8 on: Mar 29, 2006, 02:22:48 PM »

That's very interesting.

About the slurp I don't think that is good to ban it, as yahoo can bring some traffic to your site.

You can use this to your robots.txt to make Slurp treat better to your server :

Code:
User-agent: Slurp
Crawl-delay: 12

Trial and Error my two best teachers Cool
Promote your blog for free.... Visit through proxy

Last blog : Keep it Legal - Tims guide to legal notices
Trackback URI for this entry : http://www.webdigity.com/trackback.php?topic=473
Tags : php referer spam yahoo domains robots.txt Bookmark this thread : Digg Del.icio.us Dzone more....

Topic sponsors:
Get a permanent link here for $1.99!


Pages: [1] Print 
Webdigity Webmaster Forums  >  WebDigity Community  >  User Forums  >  aStatSpam forum
Topic: Feauture requests
« previous next »
Jump to:
User Area
Welcome, Guest. Please login or register.
Did you miss your activation email?
May 13, 2008, 01:22:52 AM





Login with username, password and session length

Donate to our community, and get a permanent link back to your site!

Donate to our community, and get a permanent link back to your site!


Forum Statistics
Total Posts: 34.929
Total Topics: 7.262
Total Members: 3.479
Tutorials : 56
Resources : 143
Designs : 220
Latest Member: mileymo1

24 Guests, 3 Users online :

12 users online today:



Readers

Web Design Gallery · Whois Lookup · Pagerank · Tag Browsing · Lo-fi version · Syndication · Webmaster forum history · Advertise
Developed by HumanWorks © 2005 - 2008 Webdigity webmaster community · sublime directory
Webdigity Webmaster Forums | Powered by SMF 1.0.12. © 2001-2005, Lewis Media. All Rights Reserved.