How robots work


The proper way to use the robot.txt file

When optimizing your web site most webmasterscontent.
don't consider using the robot.txt file. This
is a very important file for your site. ItHere  another
let the spiders and crawlers know what they
can and can not index. This is helpful inUser-agent:  *
keeping them out of folders that you do not
want index like the admin or stats folder orDisallow:  /cgi-bin/
content  that  they  can  not  index.
The above would block all spiders from
Here is a list of variables that you canindexing  the  cgi-bin  directory.
include in a robot.txt file and there
meaning:User-agent:  googlebot
1)User-agent: In this field you can specify aDisallow:
specific robot to describe access policy for
or a "*" for all robots more explained inUser-agent:  *
example.
Disallow:  /admin.php
2)Disallow: In the field you specify the
files and folders not to include in theDisallow:  /cgi-bin/
crawl.
Disallow:  /admin/
3)#  the  number  sign  represents  comments
Disallow:  /stats/
Here are some examples of a robot.txt file
for  User-agent:  *In the above example googlebot can index
everything while all other spiders can not
Disallow:index admin.php, cgi-bin, admin, and stats
directory. Notice that you can block single
The above would let all spiders index allfiles like admin.php.



1 A B C D 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105