Monday, June 20, 2011

'Hadoop Alternative' to Be Open Sourced

IDG News Service (06/15/11) Chris Kanaracus

LexisNexis will open source its HPCC Systems supercomputing platform and offer developers an alternative to the Hadoop framework for large-scale data processing. The platform operates on clusters of commodity hardware and is comprised of components oriented around LexisNexis' Enterprise Control Language. The company says that data extraction, transformation, and loading tasks are managed by a component dubbed Thor, while a third system called Roxie delivers "highly scalable, high-performance online query processing and data warehouse capabilities." The system is capable of analyzing petabyte-sized data volumes "significantly faster and more accurately than current technology systems," scaling up to thousands of nodes, according to LexisNexis. Both a community edition and a commercial enterprise edition of HPCC Systems will be released by LexisNexis. HPCC Systems will initially be offered as a virtual machine for testing by the developer community, with full binaries and the source code to be released some weeks later. The community version will be issued under the GNU Affero GPL v3 license, while new code contributed by LexisNexis and community members will go to the open source edition first, according to LexisNexis.