Simple UI for monitoring and managing clusters as well as all services




Tools
ZETTASPARK integrates a suite of optimized and tested open-source tools that work seamlessly together, while providing additional features for management, security, and monitoring

Hadoop
Distributed Storage (Structured/ SemiStructured/ Unstructured Data)

Spark
Parallel Processing Engine for ETL, Scalable Machine Learning, and GPUAccelerated Big Data Execution

Hive on TEZ / Iceberg / Spark-SQL
Massive Data Analysis via SQL with Visualization Gateways like PowerBI, Qlik Sense, Tableau Software, Superset, etc.

Kafka / Flume / NiFi
Big Data Streaming

Airflow / NiFi
Scheduling and Monitoring of Jobs/Tasks in the Ecosystem

Kerberos / Atlas
Security and Data Governance

Sqoop
Data Ingestion from RDBMS to HDFS, Hive, and HBase

ZooKeeper
High Availability for Hadoop Masters, Coordination of HBase, Kafka, and Solr

Solr
Data Indexing and Search

HBase / Phoenix
Operational Big Data Transactional ACID Database

Hue / Zeppelin
Graphical Interfaces for HDFS, Hive and Spark Operations