How robots work


The Proper Way To Use The robot.txt File

When optimizing your web site most webmasterscontent.
don't consider using the robot.txt file. This
is a very important file for your site. ItHere  another
let the spiders and crawlers know what they
can and can not index. This is helpful inUser-agent: *
keeping them out of folders that you do not
want  index  like  the admin or stats folder.Disallow: /cgi-bin/
Here is a list of variables that you canThe above would block all spiders from
include in a robot.txt file and thereindexing  the  cgi-bin  directory.
meaning:
User-agent: googlebot
1. User-agent: In this field you can specify
a specific robot to describe access policyDisallow:
for or a "*" for all robots more explained in
example.User-agent: *
2. Disallow: In the field you specify theDisallow: /admin.php
files and folders not to include in the
crawl.Disallow: /cgi-bin/
3.  The  #  is  to  represent  commentsDisallow: /admin/
Here  are  some  examples of a robot.txt fileDisallow: /stats/
User-agent: *In the above example googlebot can index
everything while all other spiders can not
Disallow:index admin.php, cgi-bin, admin, and stats
directory. Notice that you can block single
The above would let all spiders index allfiles like admin.php.



1 A B C D 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105