
Size matters: Yahoo claims 2-petabyte database is world’s biggest, busiest
Category: Internet and Network Date: May 23rd, 2008The petabyte is the new petaflop.
Interest in raw computational speed waned — sorry, IBM — after data center managers began turning away from super-expensive supercomputers and toward massive grids comprised of cheap PC servers.
Meanwhile, the rise of business intelligence and its even more technical cousin, business analytics, has spurred interest in super-large data warehouses that boost profits by crunching the behavior patterns of millions of consumers at a time.
Take Yahoo Inc.’s 2-petabyte, specially built data warehouse, which it uses to analyze the behavior of its half-billion Web visitors per month. The Sunnyvale, Calif.-based company makes a strong claim that it is not only the world’s single-largest database, but also the busiest.
Based on a heavily modified PostgreSQL engine, the year-old database processes 24 billion events a day, according to Waqar Hasan, vice president of engineering in Yahoo’s data group.
And the data, all of it constantly accessed and all of it stored in a structured, ready-to-crunch form, is expected to grow into the multiple tens of petabytes by next year.
Source: ComputerWorld
October 8th, 2008 at 6:19 pm
Surely, computational speed is not the end-all. The benefits Greenplum claims for its petabyte database solution is massive scale analytics AND speed.
http://www.greenplum.com/products/greenplum-database/