| Don't you hate it if your download page or thank you | | | | your certain directory: User-agent:* Disallow:/directory/ |
| page can be found in Google, Yahoo or MSN? What | | | | where directory is the directory that you don't want to |
| about your eBook, is it listed in Google, too? Your result | | | | be crawled or indexed by search engine robots. 3. |
| of your hard work can be accessed by everyone | | | | Save the file as robots.txt. 4. Upload it to your domain |
| because your page is simply indexed by search | | | | root directory so that it looks like this: 5. Confirm if |
| engines such as Google. You invested your money, | | | | everything is correct using robots.txt validator. (You |
| time and energy to create your best product but your | | | | can easily find robots.txt validator using Google |
| product can be found by everyone easily without even | | | | Search). For more information on robots.txt, you can |
| paying you... Don't you hate it if your download page or | | | | visit Are these two tricks enough to protect your |
| thank you page can be found in Google, Yahoo or | | | | product download page, thank you page and private |
| MSN? What about your eBook, is it listed in Google, | | | | page? Perhaps, because nothing is perfect in this |
| too? Your result of your hard work can be accessed | | | | world. So I'll give you another trick to double protect |
| by everyone because your page is simply indexed by | | | | your precious page. Meta tags Another trick, which |
| search engines such as Google. You invested your | | | | may be more convenient for page usage, adds a |
| money, time and energy to create your best product | | | | META tag to an HTML page to tell search engine |
| but your product can be found by everyone easily | | | | robots not to index the page. To prevent all robots |
| without even paying you. You could loose thousands | | | | from indexing a page on your site, place this following |
| of dollars. You don't want this happen to you, do you? | | | | META tag into the HEAD section of your page: META |
| So here comes the yellow mouse robots spray. Repel | | | | NAME="ROBOTS" CONTENT="NOINDEX, |
| search engine robots from indexing your pages! | | | | NOFOLLOW" (Add "" after the code). That should |
| Robots.txt Robots.txt is a simple text file that tells the | | | | helps protect your precious pages. One more trick, |
| search engine robots not to crawl certain directories | | | | exclusive to blog only! Disallow Pings All of those tricks |
| and pages of your site. When a robot crawls your site | | | | above are useless if you still allow pings for your blog |
| it looks for the robots.txt file. If it doesn't find one it | | | | page or blog post. Disallow pings for your precious |
| assumes automatically that it may crawl and index the | | | | page. If you use Wordpress: 1. Expand your discussion |
| entire site. Not having a robots.txt file can also create | | | | tab. 2. Uncheck "Allow Pings". These tricks should help |
| unnecessary 404 errors (page not found error) in your | | | | you protect your product download page, thank you |
| server logs, making it more difficult to track the real | | | | page and private page. However, these tricks don't |
| 404 errors. Creating robots.txt file is easy. 1. Open your | | | | protect your precious page if your buyer adds it to his |
| notepad or other text editor (not HTML editor). 2. Type | | | | Internet Explorer's Favorite, Firefox's Bookmarks or |
| this following to prevent all robots to crawl or index | | | | social bookmarks and share the link with his friends. |