Terminology and resources for Hive Hadoop

To understand the Hive Hadoop integration with Unica Campaign, refer to this list of definitions and links to online resources.

Terms

Apache Hadoop® is an open-source software framework written in Java for distributed storage and distributed processing of very large data sets on computer clusters built from commodity hardware.

Apache Hive™ is a data warehouse infrastructure built on top of Hadoop to facilitate querying and managing large datasets that reside in distributed storage. Hive provides a mechanism to project structure onto this data and query the data using an SQL-like language called HiveQL.

Apache HBase™ is an open source, non-relational, distributed database written in Java. It runs on top of HDFS, providing BigTable-like capabilities for Hadoop.

Hadoop Distributed File System (HDFS) is a distributed file system that runs on commodity hardware. It is designed to reliably store very large files across machines in a large cluster.

Hue is a Web interface for analyzing data with Apache Hadoop.

HiveQL (or HQL) is the Hive query language.

MapReduce is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster. MapReduce is the heart of Hadoop®. It is this programming paradigm that allows for massive scalability across hundreds or thousands of servers in a Hadoop cluster.

Big data distributions of Apache Hadoop: Several vendors have developed their own distributions of Hadoop, including Cloudera, Hortonworks, IBM® BigInsights®, and MapR.

User tables is an Unica Campaign term that indicates any data source that contains an organization’s marketing data for access by Unica Campaign flowcharts. Typically, user tables contain data about customers, prospects, and products. For example, customer account data pulled from user tables could be used in a flowchart to target customers with specific account types and balances.

Online resources

The following links were accurate at the time of publication but are subject to change: