Google Starting in on the Invisible Web
The Google Webmaster blog announced that, Google is starting to crawl through HTML forms to get data. "In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn't find and index for users who search on Google."
Why is this important or interesting? Google crawling through HTML means that it is starting to harvest some of the stuff that is available on the "invisible web." The invisible web, is a term used to describe websites are not registered with any search engine. According a 2000 white paper, there are approximately 550 billion individual documents hanging around not found by search engines (invisible).
This step will make more information searchable online. It will be interesting to see how this effects searching and what kind of information Google will find.
For more indepth information on this from Google, go to the Official Google Webmaster Central Blog.

0 Comments:
Post a Comment
<< Home