It’s amazing to know how Google search happens in 0.30
to search more than 45 billion web pages and it takes even minute to search my
hard drive with only 500GB size, know why?
Hi everyone, in
last few posts I was discussing about android and related stuff on my blog, you
can read it here. Well I was heading on Google somewhere, I don’t
remember where, than I faced this question. I thought and compared and
according to direct answer and technology, 30-100 MB/s hard drive seems faster
than 500 kbps internet, still Google does it all in 0.30 seconds and here I gets
result.
Google have more than
45 billion web pages, see the graph below, I got it from website www.worldwidewebsize.com
The search result
on Google is faster and on Windows or any computer hard drive is slower due to
some reasons I am going to deal with.
Why searching on hard drive is slower than Google?
We have normal PC
and having some 30-100MB/s drive and interfaces. On hard drive there is no
proper indexing all the time so that we can follow the file too fast. When we
type the search, hard drive pointer and controller keeps moving hard drive traversing
the drive until the keyword is found, this is no doubt slower process and takes
so much time.
Also this depends
upon the type of hard drive, example on SSD it will be faster.
In Widows 7 onward, Microsoft have included
indexing option for faster searching with hashing technique, this makes the
search faster but not superior.
How Google index the web?
Google is not directly searching the
web when we ask for something, Google
just search its index located on its data center spread around the globe.
These data centers
are connected with high performance fiber optics and many level of redundancy to
make the data available even after some damage to any piece. There are
thousands of machines where these indexes are saved.
Google index the
web through a crawling program called spider, it follows the link, collect the
words in title, subtitle, images, alt tags, and text arranges it with URL and
follow the link on the web pages and keeps going ahead.
These data are
stored on high performance disks on data center; these disks are having special
arrangement of RAID 10.
RAID 10 means data
is striped over multiple disks and there exist a parallel copy of these data. This
architecture is so faster and help the Google to deliver more and more
speed.
How Google quickly access quarry from data center?
As mention earlier,
Google engineers have special
planning to design these highly efficient data center and keep the data on the
tips by using the RAID10 array.
RAID10 means data
is distributed over many parallel disks and here index is maintained. Whenever you
type a search quarry, these keywords are already positioned according to their
search frequency so they are residing in cache most of the time, these keywords
are filtered according to the location and person and many more thing like page
rank and arranged the data into the order and sent to user PC.
Due to ultra high
efficient connection between all the data centers, highly efficient algorithm and
more complex search filters, Google already have several distributions of
estimated searches from the area and it is improving daily and on each search. All
these make Google faster and better
in search.
The bottom line
Concluding all,
highly efficient machine with many mainframe with 1000 of processors, ultra
high connection, highly efficient search algorithm and smart disk arrangement,
Google manage to deliver its search result at fastest speed, even more than
your Hard drives.
hope you Go the
super power of Google again, and you find it worth sharing, also let us know
your thought with comment.
No comments :
Post a Comment