Monday, November 29, 2010

Getting indexed and Prevent Crawling


Indexed?

When you type the name of your site into the Google search engine, nothing comes up. This is quite frustrating to you because it means that your site cannot be found by internet users. Your site is, essentially, missing. Getting indexed by Google means basically getting into their directory – introducing your site to Google by that one first hand shakes.



When you have a new website and you want it to be popular as soon as possible. So, your website needs to be found by people. How? Your website needs to be indexed by search engines, especially Google, is it really possible to get indexed by Google in 24-48 hours? Yes possible.

You may already know that adding your URL to Google is the easiest way to do it. Google recommends it too and then who challenges? Google gives you the option of submitting your URL to it, so that the next day you can get listed on its search for a particular term (keyword).

Each of the major search engines has a form you can fill out to add your URL in order for their search engine to spider your site.

What is a spider? It is the search engine process that grabs content from your site and (hopefully) is shown in their search engine. If you are lucky your site is spidered often looking for content updates. Spiders LOVE updated content.
So, have you got what’s important to get your site be indexed?
1. Content
2. Links
3. Social bookmarking sites, Forum, Web Directory
4. Sitemap, Ping
Many beginners know the first two – content and links – are important to get indexed and good ranking.
The latter two are always missed or not known. Social websites are very helpful to spread the word out. Sitemap helps search engine spiders to read your site. Pinging will send a message for. “Hey! My site is updated! Come visit my site!”.

“Google loves regularity” This doesn’t mean you have to post every day. Do it in equal intervals.

Say if you post 15 posts a week, do it once in 48 hours. Too slow a pace wouldn’t do any good either. Keep your blog updated at least once in 3 days. If you try it, initially you won’t see any results. Keep doing it for a month or two, the results will come.

How do you know when you get indexed?

Use Google Alerts for your own blog and when Google indexes you, you get notified with an email. Remember to choose, ‘as-it-happens‘, when you make the alert.


Preventing crawling:

To avoid undesirable content in search indexes, webmasters can instruct spiders not to crawl files or directories through the robots.txt file in the root directory of domain. A page can be explicitly excluded from a search engine's database by using a metatag specific to robots. When a search engine visits a site, the robots.txt located to the root directory is the first file crawled. The robots.txt file is then parsed and will instruct the robot as to which pages are not to be crawled. As a search engine crawler may keep a cached copy of this file, it may on occasion crawl pages a webmaster doesn't wish crawled. Pages typically prevented from being crawled include login specific pages such as shopping carts and user-specific content and search results from internal searches.
In March 2007, Google warned to webmasters that they should prevent indexing of internal search results because those pages are considered search as spam.
SEO RIDERS Web Developer

Morbi aliquam fringilla nisl. Pellentesque eleifend condimentum tellus, vel vulputate tortor malesuada sit amet. Aliquam vel vestibulum metus. Aenean ut mi aucto.

4 comments:

  1. New York Dentist
    I don't have any words to appreciate this post.....I am really impressed ....the person who created this post surely knew the subject well. Thanks for sharing this with us.

    ReplyDelete
  2. This comment has been removed by the author.

    ReplyDelete
  3. I like this article especially because it is very unique, filled with important concept. All the looks on your blog are amazing! I hope you can get some inspirations of mine too.
    XBRL Software | Online eTDS Software

    ReplyDelete