please define robot.txt what is it and how it works?
please define robot.txt what is it and how it works?
2WDH.com .:. Professional Hosting since 2003
use google to learn how to create a robots file.
its a file where you can control which search bot can crawl you blog.
and at the same time you can also configure which part of your site or directory shall be crawled by bots.
the very common config is
which means you allow every bots to crawl by indicating the asterisk[*]User-agent: *
Disallow: /
forward slash means you allow bots to crawl all from the root
here are some examples when users try to exclude folders not to be crawled
User-agent: *
Disallow: /private/
Disallow: /images-saved/
Disallow: /images-working/
shopster (04-29-2010)
it is a file to instruct search engines to crawl your website content
For me, robot.txt is a text file that contains directives for search engines bots that directly affects on how it'll gonna digest your site. For example, if a directive says, disallow blog folder - this would mean that yourdomain.com/blog won't be crawled and indexed. robot.txt file is used to address dupe contents between web pages and directories/folders within a website.
Check out OnlineCasinoSpecialist.com for safe and secure online casino gambling.
The robots.txt file is a set of instructions for visiting robots (spiders) that index the content of your web site pages.
For those spiders that obey the file, it provides a map of what they can, and cannot index. The file must reside in the root directory of your web. The URL path (web address) of your robots.txt file should look like this...
www. your site name .com/robots.txt
Hello,
Robots.txt is a text file you place on your site to inform search robots which pages you would like them not to visit. Robots.txt is by no means compulsory for search engines but usually search engines follow what they are asked not to do. It is essential to make clear that robots.txt is not a way from stop search engines from crawling your site
Bookmarks