Wordpress and Your Robots Txt File

The results web users get when they type in wordscontent. That information is then entered into its large
or phrases that are associated with your blog ordatabase to be processed into the search ranks you
website is one of the best and cheapest ways to gainsee when you type in the terms you're searching for
traffic (if you aren't worried about gaining traffic to(i.e.: "how do I bake the best chocolate chip cookies").
your site you can stop reading now). New users toThe "User-agent" tells what specific search engine bot
Wordpress and websites in general may not begets the instructions (typically you leave it as * to
familiar with many of the terms and tools used by thesignify all search engines).
professionals to create and maintain a website,In this example, we tell all search engines to follow all
especially when it comes to SEO and search enginethe directories it can find and index them:
ranking.* SEO, or search engine optimization can seemUser-agent: *
like voodoo magic to the uninitiated users. Since no oneDisallow:
except Google (or any search engine) knows exactlyHere, we tell all search engines to not index anything
how the searches are ranked and computed, the best(useful if you want your site to be as private as
we can do is some trial and error to see what workspossible and do not care about gaining traffic):
best. There is no one formula that always works. AUser-agent: *
practical approach for the uninitiated webmaster isDisallow: /
what is needed. A properly configured robots.txt file willThis is an example of how a typical Wordpress blog
help, and is easy to configure once you know what toshould be configured (depending on what plugins and
look for.content you have there can be variations, but the
Wordpress by default has a virtual robots.txt file that'sconcept remains the same):
created automatically. This file, however, is not setupUser-agent: *
properly (as far as SEO is concerned). Ordinarily youDisallow: /wp-admin
would create your own robots.txt file and place it in theDisallow: /wp-includes
root of your site files that make up your website in theDisallow: /wp-content
web server. But, since Wordpress does thisDisallow: /feed/
automatically with a virtual file, we must disable thatDisallow: /tag/
feature first.Disallow: /author/
Download the robots.txt plugin (just do a googleDisallow: /comments/
search for kb-robotstxt for wordpress) and activate it.Disallow: /archive/
In the control panel for the plugin, we can add orThe main purpose of this configuration is to protect
remove the options and lines of code that make upany admin or core Wordpress files and folders from
the file. It's these lines that will help to make our siteaccidentally being indexed (this can be a security issue)
more search engine friendly.and to help reduce the chances of the search engines
The two things you must concern yourself with in theregarding duplicate content (which can hurt your page
code (see examples below) are prettyrank and search results). For example, you may have
straightforward:enough posts on your site to have one or many
1) What parts of my site do I want the search enginesarchive pages for users to go back and see the
to crawlprevious old posts. The search engines may see the
2) What parts of my site do I not want the searchoriginal post and then the post from the archive page
engines to crawlas duplicate content. In the long run, this can hurt your
By crawling your site with what's called a "bot", thesearch results (Google sees this as a sort of cheat
search engines comb over your site finding links andand penalizes you for it).