Ask Alexa to Crawl your site

You may be wondering about "ia_archiver" and be curious about why it is visiting your site, or you may want to invite the robot to crawl your site. To block ia_archiver from crawling your site, please read below.

Additional information regarding our privacy policy, web crawling philosophy, and technology can be found on the following pages Privacy Policy and Technology. If you wish to change the contact information for your site, please visit our contact information editor. If you would like to suggest some Related Links, please visit our related link suggestion page.



The Alexa crawler (robot), which identifies itself as ia_archiver in the HTTP "User-agent" header field, uses a web-wide crawl strategy. Basically, it starts with a list of known URLs from across the entire Internet, then it fetches local links found as it goes. There are several advantages to this approach, most importantly that it creates the least possible disruption to the sites being crawled.

We will not index anything you would like to remain private. All you have to do is tell us. How? By using the Standard for Robot Exclusion (SRE).

The SRE was developed by Martijn Koster at Webcrawler to allow content providers to control how robots behave on their sites. All of the major Web-crawling groups, such as AltaVista, Inktomi, and Google, respect this standard. Alexa Internet strictly adheres to the standard:

The Alexa crawler looks for a file called "robots.txt". Robots.txt is a file website administrators can place at the top level of a site to direct the behavior of web crawling robots.

The Alexa crawler will always pick up a copy of the robots.txt file prior to its crawl of the Web. If you change your robots.txt file while we are crawling your site, please let us know so that we can instruct the crawler to retrieve the updated instructions contained in the robots.txt file.

To exclude all robots, the robots.txt file should look like this:

User-agent: *
Disallow: /

To exclude just one directory (and its subdirectories), say, the /images/ directory, the file should look like this:

User-agent: *
Disallow: /images/

Web site administrators can allow or disallow specific robots from visiting part or all of their site. Alexa's crawler identifies itself as ia_archiver, and so to allow ia_archiver to visit (while preventing all others), your robots.txt file should look like this:

User-agent: ia_archiver
Disallow:

To prevent ia_archiver from visiting (while allowing all others), your robots.txt file should look like this:

User-agent: ia_archiver
Disallow: /

For more information regarding robots, crawling, and robots.txt visit the Web Robots Pages at www.robotstxt.org, an excellent source for the latest information on the Standard for Robots Exclusion.
Fill out the form below to be crawled by Alexa.

There are a few reasons that Alexa may not have visited your site. Perhaps your site is new or we haven't discovered any links on the web that lead to your site. Or perhaps we haven't had any Alexa users visit your site. It is also possible that your web site administrator has disallowed crawlers from visiting your site - please read the information about robots.txt that we have provided above.

In any event, simply by visiting your site with the Alexa Toolbar open, Alexa will learn of your site and add it to our list of sites to visit, thus ensuring your inclusion in the Alexa service and in the Alexa archive.

If you are the type of person who won't be satisfied until you get to click a button that says "Crawl My Site," then we have just the form for you. Simply type your site's web address into the box below, and then click the button. Alexa will include your site in the next crawl of the web, usually within 8 weeks of submission.


for more infomation Visit: http://www.alexa.com/help/webmasters#crawl_site

3 comments:

Sell Original Software, Original Software Online store to buy Software Online said...

Sell Original Software!! The best software online store offering a complete collection of original software to buy at very affordable price. Give it a try now

Aceh Software Store said...

Thanks brother I try this All must try this

Make Money With Adsense said...

wow nice info Ser thanks for sharing to us

Post a Comment

◄ Newer Post Older Post ►