Google's Secret revealed: How does Google search query work
- Date: 2007-08-14 - Word Count: 455
Share This!
You must be thinking at times as to how Google is able to give out the correct or the required result in such a short span of time. I myself use to ponder over it for a long time.
Let me explain you the process behind the search result delivered in Google.
The life span of a Google search query normally lasts less than half a second, yet involves a number of different steps that must be completed before results can be delivered to a person seeking information.
How Google spider crawls: In Google, the web crawling (downloading of web pages) is done by several distributed crawlers (called as Googlebots). There is a URLserver that sends lists of URLs to be fetched to these crawlers. The web pages that are fetched by the googlebots are then sent to the storeserver. The storeserver then compresses and stores the web pages into a repository (a kind of storage facility). Every web page has an associated ID number called a docID that is assigned whenever a new URL is parsed (analysed) out of a web page.
The indexing function is performed by the indexer and the sorter. The indexer performs a number of functions. It reads the repository, uncompresses the documents, and parses (analyses) them. Each document is converted into a set of word occurrences called hits. The hits record the word, position in document, an approximation of font size, and capitalization.
The indexer distributes these hits into a set of "barrels", creating a partially sorted forward index. The indexer performs another important function. It parses (analyses) out all the links in every web page and stores all the important information about them in an anchors file. This file contains enough information to determine where each link points from and to, and the text of the link.
The URLresolver reads the anchors file and converts relative URLs into absolute URLs and in turn into docIDs. It puts the anchor text into the forward index, associated with the docID that the
anchor points to. It also generates a database of links that are pairs of docIDs.
The links database is used to compute Page Ranks for all the documents. The pages that get into the barrel first will be most likely to be included in the search results. The question is, how does Google decide which pages go into the barrel first? The answer (as some of you probably already guess) is based on the fact that the docID is an integer.
For viewing how Google search engine works,
Click here
About the Author:
Hari Qumar is a management graduate from a UK university and owns Begonia Infosys. He is actively involved in promoting SEO in India and works for an Entertainment portal as well.
Let me explain you the process behind the search result delivered in Google.
The life span of a Google search query normally lasts less than half a second, yet involves a number of different steps that must be completed before results can be delivered to a person seeking information.
How Google spider crawls: In Google, the web crawling (downloading of web pages) is done by several distributed crawlers (called as Googlebots). There is a URLserver that sends lists of URLs to be fetched to these crawlers. The web pages that are fetched by the googlebots are then sent to the storeserver. The storeserver then compresses and stores the web pages into a repository (a kind of storage facility). Every web page has an associated ID number called a docID that is assigned whenever a new URL is parsed (analysed) out of a web page.
The indexing function is performed by the indexer and the sorter. The indexer performs a number of functions. It reads the repository, uncompresses the documents, and parses (analyses) them. Each document is converted into a set of word occurrences called hits. The hits record the word, position in document, an approximation of font size, and capitalization.
The indexer distributes these hits into a set of "barrels", creating a partially sorted forward index. The indexer performs another important function. It parses (analyses) out all the links in every web page and stores all the important information about them in an anchors file. This file contains enough information to determine where each link points from and to, and the text of the link.
The URLresolver reads the anchors file and converts relative URLs into absolute URLs and in turn into docIDs. It puts the anchor text into the forward index, associated with the docID that the
anchor points to. It also generates a database of links that are pairs of docIDs.
The links database is used to compute Page Ranks for all the documents. The pages that get into the barrel first will be most likely to be included in the search results. The question is, how does Google decide which pages go into the barrel first? The answer (as some of you probably already guess) is based on the fact that the docID is an integer.
For viewing how Google search engine works,
Click here
About the Author:
Hari Qumar is a management graduate from a UK university and owns Begonia Infosys. He is actively involved in promoting SEO in India and works for an Entertainment portal as well.
Related Tags: google, webpage, docid
Your Article Search Directory : Find in Articles
Recent articles in this category:
- Internet Marketing Articles - SEO Articles - How Do You Create the Perfect Title?
You've got a highly-researched niche, and you've written the perfect SEO article to showcase it. Th - Web Traffic Articles - How to Drive Quality Traffic to Your Website
Today, we'll concentrate on effective tricks of driving quality traffic to your web site. If you are - Search Engines Articles - The Opinion Of Web Sites That Sell About Search Engines
It is a known fact the Google is the largest of all the search engines. You do have to wonder though - Search Engines Articles - Improve Seo
With this kind of feedback you can fine tune your SEO marketing strategies and use social media opti - Search Engines Articles - Search Engine Promotion using SEO Software
Search engine optimization is the process of tweaking a website to appear in the top listing of any - Search Engines Articles - What is the main requirement to optimize the website
Search Engines Optimization so many websites are indexed regularly, but still fail to appear in top - Internet Articles - Pay Per Click Management Guide_ AdWords Management
Pay per click management is a vast technique to boost targeted traffic to your website. Google adwor - Search Engines Articles - SEO New York -An Efficient Way to Promote Website
There are different ways to promote anything be it a product service or goods through advertisements - Real Estate Articles - Should You Buy or Rent a Home
The rent a home vs buy a home debate has gone on for years but what is the best thing to do? Should - Search Engines Articles - Search Engine Marketing New York - An Effective Internet Marketing Tool
Search engine marketing New York is the most influential technique to combat the rising competition
Most viewed articles in this category:
- Getting Indexed In Yahoo
Getting indexed in Yahoo has become very difficult in the last few months. The indexing robot, Yahoo - Are You Getting Nuked By Google Lately?
Since the last Google update, there have been many instances and examples of the Google Nuke Bot! Th - The Google Bar and its Jadoo
Here is a simple but very effective to get your Optimized web pages (or any web page you choose) lis - yahoo publish network or google adsense
As a new pay per click campaign, Yahoo has a much smaller advertising base than Google. In many cate - How Google Indexes Content From Your Web Directory
In a fluke, I was able to notice something about the way Google indexes content from web directories - Unique Content Is Your Key To Success!
As you search around this article site looking for articles to add to your website, consider this: t - Google News - Just another article announcer?
In Google's recent battle towards becoming an international news center, I've come to notice that th - Can Google Really Deliver Country Specific Searching?
This is a serious matter, can Google really deliver top quality search results for other countries? - Is Google having a tough time with their website limit?
If you are one to pay attention to what happens within the Google realm, you might find yourself thr - Google - A Bit of History
The first question most people have is, “What the heck is a “Google?” It is a play