Integration architecture: Hive-based Hadoop and Unica Campaign

This diagram shows the integration architecture for Unica Campaign and Hive-based Hadoop user data sources.

The Apache Hive data warehouse software facilitates querying and managing large datasets that reside in distributed storage. Built on top of Apache Hadoop, Hive provides:

  • Tools to enable easy data extract/transform/load (ETL)
  • A mechanism to impose structure on a variety of data formats
  • Access to files stored either directly in Apache HDFS or in other data storage systems such as Apache HBase
  • Query execution via MapReduce

Hive defines a simple SQL-like query language, called HiveQL (or HQL), that enables users familiar with SQL to query the data.

You can use the Hue editor (Hadoop UI) to work with your big data instance (for example: connect, view, create tables and databases).