14
How Do Search Engines Work? Dr. Steve Broskoske Misericordia University

How Do Search Engines Work? Dr. Steve Broskoske Misericordia University

Embed Size (px)

Citation preview

  • Slide 1
  • Slide 2
  • How Do Search Engines Work? Dr. Steve Broskoske Misericordia University
  • Slide 3
  • Do search engines actually search the entire Web when you enter search terms?
  • Slide 4
  • Yes and no. Yes, it does search the entire world. But not when you hit the search button. The search was already done in advance.
  • Slide 5
  • How do search engines actually work?
  • Slide 6
  • Without using a computer, how would you search out all of the people named Smith in the Wyoming Valley area? Use the phone book! Analogy
  • Slide 7
  • Search engines create an index, analogous to a phone book. When you search, you actually search the index! Word No. of Times URL technology 15 http://www.somewhere technology 12 http://www.somewhere terabyte 4 http://www.somewhere
  • Slide 8
  • Do search engines all produce the same quality of results?
  • Slide 9
  • No! The quality of results depends on the quality of the index. Indexes are not all the same! Read part of Web page. Read more of Web page. Read all of Web page. Also: Some indexes are arranged by humans, and others by computer algorithm.
  • Slide 10
  • How do Web pages make it into a search engine?
  • Slide 11
  • Web pages make it to a search engine through: 1.Web crawlers Software that searches the Web and indexes Web pages. 2.Voluntary submission.
  • Slide 12
  • Why can I sometimes not find the words I searched for?
  • Slide 13
  • Why you cant find search terms on Web pages: Web page content: Has been changed. Has been renamed or removed. Better search engines cache a copy of the page when they index it. The search terms were found on this page when this page was cached.
  • Slide 14
  • Does any information hide from search engines? There is much information that cannot be indexed by Web crawlers. This is know as the: Dynamically generated Web pages. Databases. (Contents of PDF, PowerPoint, and Word documents used to be non-searchable.) Invisible Web Some search engines are designed to locate information on the invisible/deep Web.
  • Slide 15
  • Review Search engines read all of the Web pages in the world, and index them. When you search, you are querying a search engines index. Much of the Web is unsearchable (called the invisible Web). Knowing how search engines work makes you a better searcher.