| |
Schloman, Barbara. (April 22, 2002). Information Resources Column: "Plumbing the Depths: Using the Invisible Web" Online Journal of Issues in Nursing. Available http://www.nursingworld.org/ojin/infocol/info_8.htm © 2002 Online Journal of Issues in Nursing Article published April 22, 2002 PLUMBING THE DEPTHS: USING THE INVISIBLE WEBBarbara F. Schloman, PhD, AHIPJust when we already feel overwhelmed by the number of results returned when using Web search engines, there is mounting evidence that there is a great deal that is not being searched--namely the "Invisible" or "Deep" Web. In fact, the argument is made that standard search engines are not indexing most of the information on the Web. Not only is a majority of Web content submerged within these sites, but it is also some of the best information on the Internet. Two publications in particular have explored what the Invisible Web is and the implications of it for us as Web searchers. Michael K. Berman (2001) published a white paper, "The Deep Web-Surfacing Hidden Value." Although the paper has a commercial influence because the research he presents showcases technology available from his employer BrightPlanet, the issues raised have provoked considerable attention. The second publication of note is a book by Chris Sherman and Gary Price (2001), The Invisible Web-Uncovering Information Sources Search Engines Can't See. This column will explore some of the issues raised and how to plumb the depths of the Invisible Web for health and other needed information. Bergman prefers the terminology of the "Deep Web." However, this column will use the more pervasive phrase of the "Invisible Web." Just what is it and why don't conventional search engines reach it? The Invisible Web is that portion of the Web that is not reached by standard search engines such as Alta Vista or Google. It includes specialized databases and search engines, archives of documents, directories and locators, dictionaries, library catalogs, and gated resources requiring a password or login. Some sites may have a hybrid status with some content visible and other that is not.
Content accessible on the Web continues to grow as government agencies, organizations, and corporations commit to doing business through this medium and are aided by advances in Web technology and the increasingly favorable economics of computer storage. Bergman's paper on the Deep Web was based on research done in March 2000, so one can imagine that the findings would be even more dramatic today. His findings on the Deep Web included:
Real-time Resource Directory Library Online Catalog Bibliographic Database News How do I find Invisible Web sites? Specialized finding aids have been created to help searchers identify sources such as these that will not turn up using standard search engines. What characterizes these tools is that they are not just automated retrievers, but the product of human effort and judgment. CompletePlanet Infomine: Scholarly Internet Resource Collections InvisibleWeb.com: The Search Engine of Search Engines BIOME - Health and Life Sciences www.Invisible-web.net -- The Invisible Web Directory Tools available from standard search engines AltaVista For example, a search on "Agent Orange news" turned up a shortcut that led to 35 headline articles from sources such as the BBC, MSNBC, Miami Herald, ABC Online, USA Today. For more information on AltaVista shortcuts and how they work, go to http://www.altavista.com/sites/search/shortcuts_overview Google If you want to search only for a particular file type, use Google's Advanced Search page. A drop-down menu is provided to allow you to restrict your search to a specific file type. For more information on searching by file type, go to http://www.google.com/help/faq_filetypes.html. Finding it once, retrieving it again Web users are often confounded by URLs that are uncomfortably lengthy, have embedded punctuation, and are impossible to remember. Such URLs typically reflect pages that have been retrieved from a search of a database on the Invisible Web. The URL is unique to the search that was done on that database and to the item retrieved. Reusing this URL would retrieve the same document. For example, a search on PubMed for "Nightingale and evidence-based nursing" retrieved the following reference. Citing or bookmarking this URL will refer the user to this same item. Evid Based Nurs 2001 Jul;4(3):68-9 The library's gated resources on the Invisible Web Libraries have traditionally served as the repository of our cultural heritage and of the scholarly record. These archives in the past were in print and sometimes in other media such as microform. Now the move, of course, is toward providing digitized resources available on the Web. However, many of these resources--such as the majority of electronically available journals and research databases (CINAHL, PsycINFO)-can be obtained only through subscription or licensing agreements. These contractual arrangements include restrictions from the publishers as to how these electronic resources can be used and by whom. To insure compliance with these agreements, libraries limit access to these resources through some validation mechanism. The bottom line is that some of the most valuable and timely information that libraries have to offer is on the Invisible Web and in fact is "gated." Users will not gain access to this material by searching on the Web at large, nor by using the tools mentioned above. It is incumbent upon users with privileges in a given library to seek out what that library has to offer electronically and what paths to that material are available.
THE AUTHORBarbara F. Schloman, PhD, AHIP
Keywords: Invisible Web, Internet, search engines
REFERENCESBergman, M. K. (2001). The deep Web: Surfacing hidden value. Retrieved March 11, 2002, from http://beta.brightplanet.com/deepcontent/tutorials/DeepWeb/index.aspSherman, C., & Price, G. (2001). The invisible Web: Uncovering information sources search engines can't see. Medford, NJ: Information Today. Disclaimer: Mention of a Web site does not imply endorsement by the author, OJIN, or NursingWorld.
Article published April 22, 2002
| ||||||||||