| by: Alan murray | | | | Disallow:/ filename here |
| What is the Robot Text File? | | | | If you use |
| The robot text file is used to disallow specific or all | | | | User-agent: * |
| search engine spider's access to folders or pages that | | | | The * acts as a wildcard and disallows all spiders. You |
| you don't want indexed. | | | | may want to use this to stop search engines listing |
| Why would you want to do this? | | | | unfinished pages. |
| You may have created a personnel page for | | | | To disallow an entire directory use |
| company employees that you don't want listed. Some | | | | Disallow:/mydirectory/ |
| webmasters use it to exclude their guest book pages | | | | To disallow an individual file use |
| so to avoid people spamming. There are many | | | | Disallow:/file.htm |
| different reasons to use the robots text file. | | | | You have to use a separate line for each disallow. |
| How do I use it? | | | | You cannot you for example use |
| You need to upload it to the root of your web site or it | | | | Disallow:/file1.htm,file2.html |
| will not work - if you don't have access to the root | | | | You should use |
| then you will need to use a Meta tag to disallow | | | | Use-agent/* |
| access. You need to include both the user agent and | | | | Disallow:/file1.htm |
| a file or folder to disallow. | | | | Disallow:/file2.htm |
| What does it look like? | | | | For a list of spider names visit |
| It's really nothing more than a "Notepad" type .txt file | | | | Make sure you use the right syntax if you don't it will |
| named "robots.txt" | | | | not work. You can check you syntax here |
| The basic syntax is | | | | For help on creating robot text files there is a program |
| User-agent: spiders name here | | | | call robogen. |