Tuesday, April 22, 2008

Google Starting in on the Invisible Web

The Google Webmaster blog announced that, Google is starting to crawl through HTML forms to get data. "In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn't find and index for users who search on Google."
Why is this important or interesting? Google crawling through HTML means that it is starting to harvest some of the stuff that is available on the "invisible web." The invisible web, is a term used to describe websites are not registered with any search engine. According a 2000 white paper, there are approximately 550 billion individual documents hanging around not found by search engines (invisible).

This step will make more information searchable online. It will be interesting to see how this effects searching and what kind of information Google will find.

For more indepth information on this from Google, go to the Official Google Webmaster Central Blog.

0 Comments:

Post a Comment

<< Home

RSS Button Subscribe to this feed.
Creative Commons License
This work is licensed under a Creative Commons Attribution 2.5 License.
       
 
The Krafty Librarian has been a medical librarian since 1998. She is currently the medical librarian for a hospital system in Ohio. You can email her at: