home › event - what lies beneath: harnessing the deep web




What lies beneath: harnessing the deep web
PARC Forum

25 February 2010
George E. Pake Auditorium, PARC
map/ directions

about PARC forum


Type the word "Kauai" into your favorite search engine, and odds are you'll get more than seven million results--a jumble of blue links to the Hawaiian tourism bureau, luxury hotels on Poipu beach and tour operators on the Napali coast. Think those results are the sum total of all the Web's content on Kauai? Not even close.

Welcome to the Deep Web—the portion of the Web that is invisible to the search engine systems that crawl the Internet. Social networks, media-sharing sites for photos and videos, library catalogs, airline reservation systems, phone books and all kinds of scientific databases lie deep inside the Web. This hidden content is an estimated 500 times larger than the slice of the Web we see on Google or Yahoo.

As one of the world's foremost experts in the Deep Web, Anand Rajaraman will outline the size and scope of the Deep Web, share new approaches for bringing the Deep Web to light, discuss the potential of Deep Web content to change the way we explore information online, and examine the privacy issues to anticipate as the Deep Web becomes more accessible.


Anand Rajaraman and Venky Harinarayan co-founded Kosmix in 2005 with the vision of connecting people to information that makes a difference in their lives.

In addition to being a co-founder of Kosmix, Anand Rajaraman is a founding partner of Cambrian Ventures. Venky and Anand also co-founded Junglee, the company that pioneered internet comparison shopping, in 1996. After Amazon's acquisition of Junglee in 1998 for $250 million, Anand helped create Amazon's marketplace business, which today accounts for over 30% of Amazon's gross merchandise sales.

Anand holds a bachelor's degree in Computer Science from the Indian Institute of Technology, Madras, and a PhD in Computer Science from Stanford University. On his blog Datawocky he discusses data mining techniques in search, social media, and advertising.





upcoming events   view all 

New Habitats for Hard Technology?
Dr. Ilan Gur
15 October 2015 | George E. Pake Auditorium, PARC
PARC Forum  

Secure Fragmentation for Content Centric Networking
Marc Mosko
19 October 2015 | Dallas, TX
Conferences & Talks  

Printed Electronics USA 2015— Visit PARC’s Booth #F25
Markus Larsson, Janos Veres
18 November 2015 - 19 November 2015 | Santa Clara, CA
Conferences & Talks