Pivotal HD Architecture
Pivotal HD Architecture is a new Hadoop distribution by EMC Greenplum that includes a fully compliant SQL MPP database running on Hadoop Distributed File System (HDFS) and being "hundreds of times faster than Hive." Pivotal HD contains the usual suspects of a standard Hadoop distribution - HDFS, Pig, Hive, Mahout, Map-Reduce, etc. -- but adds several other components shown in the below architectural image. The main component of Pivotal is HAWQ, an MPP (Massively Parallel Processing) relational database running directly on HDFS in Hadoop through a dynamic pipelining mechanism and features: SQL compliant, Row or column-oriented data storage, query optimizer, fully JDBC compliant, interactive query, data management, supports data stored in HDFS, Hive, and sequence files, and deep analytics, including data mining or machine learning algorithms.
See More Related Templates