Hadoop Hive

Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives a SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. 1) Hadoop is a framework to process/query the Big data while Hive is an SQL Based tool which builds... » read more

Hadoop

Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. https://hadoop.apache.org/

Amazon Redshift

Amazon Redshift is an Internet hosting service and data warehouse product which forms part of the larger cloud-computing platform Amazon Web Services. It is built on top of technology from the massive parallel processing data warehouse company ParAccel, to handle large scale data sets and database migrations. https://aws.amazon.com/redshift/

Massively Parallel Processing (MPP)

MPP database is a type of database or data warehouse where the data and processing power are split up among several different nodes (servers), with one leader node and one or many compute nodes. MPP databases can scale horizontally by adding more compute resources (nodes), rather than having to worry about upgrading to more and more... » read more