Wednesday, March 17, 2010

How Google Crunches All That Data [Memory Forever]

Source: http://gizmodo.com/5495097/how-google-crunches-all-that-data

If data centers are the brains of an information company, then Google is one of the brainiest there is. Though always evolving, it is, fundamentally, in the business of knowing everything. Here are some of the ways it stays sharp.

For tackling massive amounts of data, the main weapon in Google's arsenal is MapReduce, a system developed by the company itself. Whereas other frameworks require a thoroughly tagged and rigorously organized database, MapReduce breaks the process down into simple steps, allowing it to deal with any type of data, which it distributes across a legion of machines.

Looking at MapReduce in 2008, Wired imagined the task of determining word frequency in Google Books. As its name would suggest, the MapReduce magic comes from two main steps: mapping and reducing.

The first of these, the mapping, is where MapReduce is unique. A master computer evaluates the request and then divvies it up into smaller, more manageable "sub-problems," which are assigned to other computers. These sub-problems, in turn, may be divided up even further, depending on the complexity of the data set. In our example, the entirety of Google Books would be split, say, by author (but more likely by the order in which they were scanned, or something like that) and distributed to the worker computers.

Then the data is saved. To maximize efficiency, it remains on the worker computers' local hard drives, as opposed to being sent, the whole petabyte-scale mess of it, back to some central location. Then comes the second central step: reduction. Other worker machines are assigned specifically to the task of grabbing the data from the computers that crunched it and paring it down to a format suitable for solving the problem at hand. In the Google Books example, this second set of machines would reduce and compile the processed data into lists of individual words and the frequency with which they appeared across Google's digital library.

The finished product of the MapReduce system is, as Wired says, a "data set about your data," one that has been crafted specifically to answer the initial question. In this case, the new data set would let you query any word and see how often it appeared in Google Books.

MapReduce is one way in which Google manipulates its massive amounts of data, sorting and resorting it into different sets that reveal new meanings and have unique uses. But another Herculean task Google faces is dealing with data that's not already on its machines. It's one of the most daunting data sets of all: the internet.

Last month, Wired got a rare look at the "algorithm that rules the web," and the gist of it is that there is no single, set algorithm. Rather, Google rules the internet by constantly refining its search technologies, charting new territories like social media and refining the ones in which users tread most often with personalized searches.

But of course it's not just about matching the terms people search for to the web sites that contain them. Amit Singhal, a Google Search guru, explains, "you are not matching words; you are actually trying to match meaning."

Words are a finite data set. And you don't need an entire data center to store them—a dictionary does just fine. But meaning is perhaps the most profound data set humanity has ever produced, and it's one we're charged with managing every day. Our own mental MapReduce probes for intent and scans for context, informing how we respond to the world around us.

In a sense, Google's memory may be better than any one individual's, and complex frameworks like MapReduce ensure that it will only continue to outpace us in that respect. But in terms of the capacity to process meaning, in all of its nuance, any one person could outperform all the machines in the Googleplex. For now, anyway. [Wired, Wikipedia, and Wired]

Image credit CNET

Memory [Forever] is our week-long consideration of what it really means when our memories, encoded in bits, flow in a million directions, and might truly live forever.

Read More...

Samsung confirms slate PC in the works

Source: http://www.engadget.com/2010/03/16/samsung-confirms-slate-pc-in-the-works/

Well, the details are extremely thin here folks, but it looks like Samsung's working on a tablet PC -- or 'slate' if you're into the new fangled lingo -- just like pretty much every other manufacturer on the planet. Speaking with APC (the website, not the clothier), Philip Newton, director of Samsung Australia's IT division, said that the company is working on a slate PC for the second half of 2010 that will have "PC-grade processing power and connectivity" -- two things Newton had previously cited as the main things lacking in the iPad. That's really all there is to go on -- no specs or features mentioned, though Samsung has been vaguely hinting that the tablet, when it appears, will run on Intel's Atom platform, which sounds about right. We'll have to wait and see, won't we?

Samsung confirms slate PC in the works originally appeared on Engadget on Tue, 16 Mar 2010 16:02:00 EST. Please see our terms for use of feeds.

Permalink JKK Mobile  |  sourceAPC  | Email this | Comments

Read More...

Researchers teach liquid to flow uphill, hope to cool future CPUs (video)

Source: http://www.engadget.com/2010/03/16/researchers-teach-liquid-to-flow-uphill-hope-to-cool-future-cpu/

Another day, another experimental CPU cooling method that may or may not come to pass. We've seen "thermal paste" from IBM and polyethylene from MIT, and now researchers at the University of Rochester have developed a method for coaxing water along nanometer-scale grooves carved into silicon. So hydrophilic are the patterns that water will even flow against gravity (and we've got the video to prove it). Not only are the structures so precise and nondestructive that the surface feels smooth to the touch, but they also trap photons, according to The New York Times, "so the grooved silicon appears pitch-black." And who knows? Maybe your next PC will be cooled by streams of water flowing freely inside the case. It's a nice image, anyways. Peep the video after the break to see it in action for yourself.

Continue reading Researchers teach liquid to flow uphill, hope to cool future CPUs (video)

Researchers teach liquid to flow uphill, hope to cool future CPUs (video) originally appeared on Engadget on Tue, 16 Mar 2010 16:24:00 EST. Please see our terms for use of feeds.

Permalink New York Times  |  sourceUniversity of Rochester  | Email this | Comments

Read More...

Microsoft shows off Internet Explorer 9: says 'yes' to HTML5, 'no' to Windows XP

Source: http://www.engadget.com/2010/03/16/microsoft-shows-off-internet-explorer-9-says-yes-to-html5-n/

Microsoft is having a good old time at MIX10, showing off all sorts of new things. New things like... Internet Explorer 9, which has just been previewed at the developer event, and here's what we've gleaned about it so far. First off, as expected it will support HTML5 video, boast a new Microsoft JavaScript engine which is codenamed "Chakra," and it'll support new-fangled web technologies like CSS3 and SVG2. Microsoft says one of its main goals with IE9 is to provide a faster browsing experience -- always good news -- though they don't have things cranked quite as high as the competition just yet (remember, this is still early). Preliminary ACID3 tests on the preview show the IE9 scores a 55/100, up from IE8's dismal 20/100 -- a huge leap forward no doubt, but still a far cry from the Chrome, Opera, and Safari scores of 100. In both PCMag's and ZDNet's SunSpider JavaScript Benchmark test, the preview performed competitively as well. Microsoft has also confirmed that IE9 will not support Windows XP, but the preview Microsoft is showing off plays nice with Vista SP2 and higher, meaning the shipping version will probably do the same. No shockers there, really. Microsoft's also made the first developer preview of Internet Explorer 9 available for download today -- hit the source link to check that out.

Update:
Chrome, Opera, and Safari do indeed score 100/100 in ACID3 testing, not "nearly" as previously stated. Thanks commenters for pointing out the obvious.

Microsoft shows off Internet Explorer 9: says 'yes' to HTML5, 'no' to Windows XP originally appeared on Engadget on Tue, 16 Mar 2010 19:35:00 EST. Please see our terms for use of feeds.

Permalink   |  sourceIE9 Developer preview download, PC Mag  | Email this | Comments

Read More...

Intel's Core i7-980X Extreme Edition hits a slew of new gaming desktops

Source: http://www.engadget.com/2010/03/16/intels-core-i7-980x-extreme-edition-hits-a-slew-of-new-gaming-d/

Intel's six-core, twelve-threaded Core i7-980X Extreme Edition has turned the hardcore gaming community on its head, and just about everyone is scrounging around in a (mostly futile) attempt to locate $999. For those in dire need of an entire system replacement, it seems that today's the day to start looking. Shortly after we heard that this 32nm Gulftown chip would be landing with Alienware and Origin PC rigs, a veritable plethora of other outfits have shown up to make similar announcements. Digital Storm has popped an overclocked (4.4GHz) version into its Black|OPS machine (which conveniently starts at $5,642, while CyberPower is now offering the silicon in its Black Mamba, Black Pearl and Gamer Xtreme 3D machines. Maingear's also sliding said CPU into its world-beating Shift "supercomputer," and anyone shopping a high-end Velocity Micro system will also see the option. We suspect most every other PC maker in existence will be following suit soon, so if your prefab PC builder hasn't yet jumped on the bandwagon, just hold tight. Real tight.

Intel's Core i7-980X Extreme Edition hits a slew of new gaming desktops originally appeared on Engadget on Tue, 16 Mar 2010 20:12:00 EST. Please see our term s for use of feeds.

Permalink   |  sourceMaingear, Digital Storm, CyberPower, Velocity Micro  | Email this | Comments

Read More...