Tuesday, May 21, 2013

Chapter 1 : How Search Engines Operate?

Search engines have a short list of critical operations that allows them to provide relevant web results when searchers use their system to find information.

1) Crawling the web:  Search engines run automated programs, called "bots"or "spiders" that use the hyperlink structure of the web to "crawl" the pages and documents that make up the world wide web. Estimates are that of crawled between 8 and 10 billion.


2) Indexing Documents:  once a page has been crawled, it's contents can be ''indexed'' - stored in a giant database of documents that makes up a search engine's "index".

3) Processing Queries:  When a request for information comes into the search engine (hundreds of millions do each day), the engine retrieves from its index all the documents that match the query. A match is determined if the user.

4) Ranking results:  Once the search engine has determined which results are a match for the query, the engine's algorithm (a mathematical equation commonly used for sorting) runs calculations on each of the results to determine which is most relevant to the given query.

No comments:

Post a Comment