AceNik's Portal

Update To All Your Tech Fads Begins Here !!!!!
Featured Posts
New Form of Touchscreen Displays Pioneered, Extremely Multi-Touch

New Form of Touchscreen Displays Pioneered, Extremely Multi-Touch

You've heard of resistive touchscreens, and hopefully you've been fortunate enough to own a capacitive touchscreen phone. But have you heard of Interpolating Force-Sensitive Resistance, or I.F.S.R touchscreen technology? Touchco hopes you soon will. A bunch of scientists at New York Universit

Continue to read more...

DIY Moped Runs on Air [Air Powered]

DIY Moped Runs on Air [Air Powered]

This Puch moped only has a range of about 7 miles and with a top speed of only 18 mph, it isn't going to break any land speed records, but there is definitely something special about it: it runs on air. Jim Stansfield, an aeronautics graduate outfitted his Puch with a pair of carbon-fiber air cylind

Continue to read more...

Broadcom Wi-Fi Chips to Have Skyhook Wi-Fi Positioning Built-In

Broadcom Wi-Fi Chips to Have Skyhook Wi-Fi Positioning Built-In

Broadcom already makes a boatload of the GPS chips found in mobile phones and other location-aware gadgets, and now they're adding Skyhook's Wi-Fi positioning service to most of their mobile Wi-Fi chipsets, spreading the location-based love even without GPS. This is how iPhone regular finds you

Continue to read more...

Nokia C6 and C7 Touchscreen Phones Have 8MP Camera and New ClearBlack Displays

Nokia C6 and C7 Touchscreen Phones Have 8MP Camera and New ClearBlack Displays

Describing the C6 as a "premium touchscreen," it has a new ClearBlack Display which they're trying to position as the Pioneer KURO of the phone world—blacker blacks, but also brighter colors. The C7 is an even skinnier version. Both Symbian^3 phones have 8MP cameras and shoot video at 720p reso

Continue to read more...

Google Wants to Test Gigabit Fiber Internet For Up To 500,000 People

Since Google wants to control all forms of communication, the logical next step is being not just what you do on the internet, but how you access the internet as well. To do that, they'll deploy 1Gbps fiber to you. The company is going to test this super high speed internet to "a small number o

Continue to read more...

CCleaner 2.21.940

CCleaner 2.21.940

CCleaner is a freeware system optimization and privacy tool. It removes unused files from your system - allowing Windows to run faster and freeing up valuable hard disk space. It also cleans traces of your online activities such as your Internet history. But the best part is that it's fast (normal

Continue to read more...

12TB DVDs Could Be On The Way

12TB DVDs Could Be On The Way

A storage density of 51MB per square centimeter? Whatever, standard DVDs. Australian scientists developed a new multilayer optical storage medium that can house data at 1.1TB/cm3. Unlike existing DVD technology, the key to this data storage technique is the fact that multiple pieces of data can b

Continue to read more...

Sign Up Now to Test Google Wave in September

Sign Up Now to Test Google Wave in September

Yesterday we told you that Google Wave was opening to 100,000 regular folk at the end of September, but on closer examination, it looks like Google's already allowing users to get in line for their invite to the limited preview. Just head over to the Google Wave's sign up for updates page, enter i

Continue to read more...


How Google Crunches All That Data

If data centers are the brains of an information company, then is one of the brainiest there is. Though always evolving, it is, fundamentally, in the business of knowing everything. Here are some of the ways it stays sharp.

For tackling massive amounts of data, the main weapon in Google’s arsenal is MapReduce, a system developed by the company itself. Whereas other frameworks require a thoroughly tagged and rigorously organized database, MapReduce breaks the process down into simple steps, allowing it to deal with any type of data, which it distributes across a legion of machines.

Looking at MapReduce in 2008, Wired imagined the task of determining word frequency in Google Books. As its name would suggest, the MapReduce magic comes from two main steps: mapping and reducing.

The first of these, the mapping, is where MapReduce is unique. A master computer evaluates the request and then divvies it up into smaller, more manageable “sub-problems,” which are assigned to other computers. These sub-problems, in turn, may be divided up even further, depending on the complexity of the data set. In our example, the entirety of Google Books would be split, say, by author (but more likely by the order in which they were scanned, or something like that) and distributed to the worker computers.

Then the data is saved. To maximize efficiency, it remains on the worker computers’ local hard drives, as opposed to being sent, the whole petabyte-scale mess of it, back to some central location. Then comes the second central step: reduction. Other worker machines are assigned specifically to the task of grabbing the data from the computers that crunched it and paring it down to a format suitable for solving the problem at hand. In the Google Books example, this second set of machines would reduce and compile the processed data into lists of individual words and the frequency with which they appeared across Google’s digital library.

The finished product of the MapReduce system is, as Wired says, a “data set about your data,” one that has been crafted specifically to answer the initial question. In this case, the new data set would let you query any word and see how often it appeared in Google Books.

MapReduce is one way in which Google manipulates its massive amounts of data, sorting and resorting it into different sets that reveal new meanings and have unique uses. But another Herculean task Google faces is dealing with data that’s not already on its machines. It’s one of the most daunting data sets of all: the internet.

Last month, Wired got a rare look at the “algorithm that rules the web,” and the gist of it is that there is no single, set algorithm. Rather, Google rules the internet by constantly refining its search technologies, charting new territories like social media and refining the ones in which users tread most often with personalized searches.

But of course it’s not just about matching the terms people search for to the web sites that contain them. Amit Singhal, a Google Search guru, explains, “you are not matching words; you are actually trying to match meaning.”

Words are a finite data set. And you don’t need an entire data center to store them—a dictionary does just fine. But meaning is perhaps the most profound data set humanity has ever produced, and it’s one we’re charged with managing every day. Our own mental MapReduce probes for intent and scans for context, informing how we respond to the world around us.

In a sense, Google’s memory may be better than any one individual’s, and complex frameworks like MapReduce ensure that it will only continue to outpace us in that respect. But in terms of the capacity to process meaning, in all of its nuance, any one person could outperform all the machines in the Googleplex. For now, anyway. [Wired, Wikipedia, and Wired]

If you enjoyed this post, make sure you subscribe to my RSS feed!
Tags:

Related posts

Other News:

Loading...