Host Health Tests
Host Agent Log Directory
This is a host health test that checks that the filesystem containing the Cloudera Manager Agent's log directory has sufficient free space. This test can be configured using the Cloudera Manager Agent Log Directory Free Space Monitoring Absolute Thresholds and Cloudera Manager Agent Log Directory Free Space Monitoring Percentage Thresholds host monitoring settings.
Short Name: Agent Log Directory
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Cloudera Manager Agent Log Directory Free Space Monitoring Absolute Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's log directory. | host_agent_log_directory_free_space_absolute_thresholds | critical:1.073741824E9, warning:2.147483648E9 | BYTES |
Cloudera Manager Agent Log Directory Free Space Monitoring Percentage Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's log directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Log Directory Free Space Monitoring Absolute Thresholds setting is configured. | host_agent_log_directory_free_space_percentage_thresholds | critical:never, warning:never | PERCENT |
Host Agent Parcel Directory
This is a host health test that checks whether the filesystem containing the Cloudera Manager Agent's parcel directory has sufficient free space. This test can be configured using the Cloudera Manager Agent Parcel Directory Free Space Monitoring Absolute Thresholds and Cloudera Manager Agent Parcel Directory Free Space Monitoring Percentage Thresholds host monitoring settings.
Short Name: Agent Parcel Directory
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Cloudera Manager Agent Parcel Directory Free Space Monitoring Absolute Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's parcel directory. | host_agent_parcel_directory_free_space_absolute_thresholds | critical:5.36870912E9, warning:1.073741824E10 | BYTES |
Cloudera Manager Agent Parcel Directory Free Space Monitoring Percentage Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's parcel directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Parcel Directory Free Space Monitoring Absolute Thresholds setting is configured. | host_agent_parcel_directory_free_space_percentage_thresholds | critical:never, warning:never | PERCENT |
Host Agent Process Directory
This is a host health test that checks that the filesystem containing the Cloudera Manager Agent's process directory has sufficient free space. The process directory contains the configuration files for the processes which the Cloudera Manager Agent starts. This test can be configured using the Cloudera Manager Agent Process Directory Free Space Monitoring Absolute Thresholds and Cloudera Manager Agent Process Directory Free Space Monitoring Percentage Thresholds host monitoring settings.
Short Name: Agent Process Directory
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Cloudera Manager Agent Process Directory Free Space Monitoring Absolute Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's process directory. | host_agent_process_directory_free_space_absolute_thresholds | critical:1.048576E8, warning:2.097152E8 | BYTES |
Cloudera Manager Agent Process Directory Free Space Monitoring Percentage Thresholds | The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's process directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Process Directory Free Space Monitoring Absolute Thresholds setting is configured. | host_agent_process_directory_free_space_percentage_thresholds | critical:never, warning:never | PERCENT |
Host Agent Status
This is a host health test that checks that the host's Cloudera Manager Agent is heartbeating correctly to the Cloudera Manager Server, is in contact with the Host Monitor, and has the correct software version. A failure of this health test may indicate a lack of connectivity between the host's Cloudera Manager Agent and the Cloudera Manager Server, a lack of connectivity betwen the host's Cloudera Manager Agent and the Host Monitor, or that the Cloudera Manager Agent or Host Monitor software is out of date. Check the status of the Cloudera Manager Agent by running /etc/init.d/cloudera-scm-agent status on the host, or look in the host's Cloudera Manager Agent logs for more details. If this test reports a software version mismatch between the Cloudera Manager Agent and the Host Monitor, check the version of each component by consulting the appropriate logs or the appropriate status web pages. This test can be enabled or disabled using the Host Process Health Test host configuration setting.
Short Name: Agent Status
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Host Process Health Test | Enables the health test that the host's process state is consistent with the role configuration | host_scm_health_enabled | true | no unit |
Host Clock Offset
This is a host health test that checks if the host's system clock appears to be out-of-sync with its NTP server(s). The test uses the 'ntpdc -np' command to check that the host is synchronized to an NTP peer and that the absolute value of the host's clock offset from that peer is not too large. If the command fails, NTP is not synchronized to a server, or the host's NTP daemon is not running or cannot be contacted, the test will return "Bad" health. The 'ntpdc -np' output contains a row for each of the host's NTP servers. The row starting with a '*' contains the peer to which the host is currently synchronized. No row starting with a '*' indicates that the host is not currently synchronized. Communication errors and too large an offset between the peer and the host time are examples of conditions that can lead to a host being unsynchronized. Make sure that UDP port 123 is open in any firewall that is in use. Check the system log for ntpd messages related to configuration errors. Use 'ntpdc -c iostat' to verify that packets are sent and recieved between the different peers. More information about the conditions of each peer can be found by running the command 'ntpq -c as'. The output of this command includes the association ID that can be used in combination with 'ntpq -c "rv <association id>"' to get more information about the status of each such peer. The command 'ntpq -c pe' can also be used to return a summary of all peers and the reason why they are not in use. If NTP is not in use on the host, this check should be disabled for the host using the configuration options shown below. Cloudera recommends using NTP for time synchronization of Hadoop clusters. A failure of this health test can indicate a problem with the host's NTP service or configuration. This test can be configured using the Host Clock Offset Thresholds host configuration setting.
Short Name: Clock Offset
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Host Clock Offset Thresholds | The thresholds for the host clock offset health test. The test compares this threshold against the absolute value of the clock offset reported by the host's NTP service from the 'ntpdc -np' command. Setting both the warning and critical threshold values to never turns off collection of the clock offset by the Cloudera Manager Agent. If NTP is not in use, both threshold values should be set to never. Cloudera recommends using NTP for time synchronization of Hadoop clusters. | host_clock_offset_thresholds | critical:10000.0, warning:3000.0 | MILLISECONDS |
Host DNS Resolution
This is a host health test that checks that the host's hostname and canonical name are consistent when checked from a Java process, and that the DNS resolution completes in a timely manner. The DNS resolution duration is calculated by measuring the time taken to call getLocalHost in a Java process on this host. Note that DNS information may be cached on the host, and this caching can affect the reported resolution duration. A failure of this health test may indicate that the host's DNS configuration is not correct or the host's DNS server is responding slowly. Check the Cloudera Manager Agent log for the names that were detected by this test or for errors running the Java process. The hostname and canonical name are considered to be consistent if the hostname or the hostname plus a domain name is the same as the canonical name. This health test uses domain names from the domain and search lines in /etc/resolv.conf. This health test does not consult /etc/nsswitch.conf and may give incorrect results if /etc/resolv.conf is not used by the host. There may be a delay of up to 5 minutes before this health test picks up changes to /etc/resolv.conf. This test can be configured using the Hostname and Canonical Name Health Check host configuration setting.
Short Name: DNS Resolution
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Hostname and Canonical Name Health Check | Whether the hostname and canonical names for this host are consistent when checked from a Java process. | host_dns_resolution_enabled | true | no unit |
Host Frame Errors
This is a host health test that checks for network frame errors across all network interfaces. A failure of this health test may indicate a problem with network hardware (e.g. switches) and can potentially cause other service or role-level performance problems. Check the host and network hardware logs for more details. This test can be configured using the Host Network Frame Error Percentage Thresholds, Host Network Frame Error Check Window, Host Network Frame Error Test Minimum Required Packets host configuration settings.
Short Name: Frame Errors
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Host Network Frame Error Check Window | The amount of time over which the host frame error checks for frame errors. | host_network_frame_errors_window | 15 | MINUTES |
Host Network Frame Error Percentage Thresholds | The health check thresholds for the percentage of received packets that are frame errors. | host_network_frame_errors_thresholds | critical:0.5, warning:any | PERCENT |
Host Network Frame Error Test Minimum Required Packets | The minimum number of received packets that must be received within the test window for this test to return "Bad" health. If less that this number of packets is received during the test window, the health check will never return "Bad" health. | host_network_frame_errors_floor | 0 | no unit |
Host Network Interface Speed
This is a host health test that checks for network interfaces that appear to be operating at less than full speed. The Cloudera Manager Agent uses ethtool ioctl to determine the network interface speed and duplex mode. A failure of this health test may indicate that network interface(s) may be configured incorrectly and may be causing performance problems. Use the ethtool command to check and configure the host's network interfaces to use the fastest available link speed and duplex mode. For inactive interfaces or for interfaces that do not support ethtool ioctl, the Cloudera Manager Agent cannot collect speed metrics. If the Cloudera Manager Agent fails to collect speed metrics for all network interfaces on the host, the test is disabled. This test can be configured using the Host's Network Interfaces Slow Link Modes Thresholds, Network Interface Expected Link Speed and Network Interface Expected Duplex Mode host configuration settings.
Short Name: Network Interface Speed
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Host's Network Interfaces Slow Link Modes Thresholds | The thresholds for the health check of the number of network interfaces that appear to be operating at less than full speed. | host_network_interfaces_slow_mode_thresholds | critical:never, warning:any | no unit |
Network Interface Expected Duplex Mode | The expected duplex mode for network interfaces. | host_nic_expected_duplex_mode | Full | no unit |
Network Interface Expected Link Speed | The expected network interface link speed. | host_nic_expected_speed | 1000 | no unit |
Host Swapping
This is a health test that checks that the host has not swapped out more than a certain number of pages over the last fifteen minutes. A failure of this health test may indicate misconfiguration of the host operating system, or too many processes running on the host. Try reducing vm.swappiness, or add more memory to the host. This test can be configured using the Host Memory Swapping Thresholds, Host Memory Swapping Check Window host configuration settings.
Short Name: Swapping
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Host Memory Swapping Check Window | The amount of time over which the memory swapping test checks for pages swapped. | host_memswap_window | 15 | MINUTES |
Host Memory Swapping Thresholds | The health check thresholds of the number of pages swapped out on the host in the last 15 minutes | host_memswap_thresholds | critical:never, warning:any | PAGES |
Categories: Cloudera Manager | Monitoring | Operation | All Categories
- Cloudera Introduction
- CDH Overview
- Cloudera Manager 5 Overview
- Cloudera Navigator 2 Overview
- Frequently Asked Questions About Cloudera Software
- Getting Support
- Cloudera Release Notes
- Cloudera QuickStart
- Cloudera Installation and Upgrade
- Installation Requirements for Cloudera Manager, Cloudera Navigator, and CDH 5
- Cloudera Manager 5 Requirements and Supported Versions
- Permission Requirements for Package-based Installations and Upgrades of CDH
- Cloudera Navigator 2 Requirements and Supported Versions
- CDH 5 Requirements and Supported Versions
- Supported Configurations with Virtualization and Cloud Platforms
- Filesystem Requirements
- Ports
- Managing Software Installation Using Cloudera Manager
- Installation Overview
- Java Development Kit Installation
- Cloudera Manager and Managed Service Data Stores
- Installation Path A - Automated Installation by Cloudera Manager
- Installation Path B - Installation Using Cloudera Manager Parcels or Packages
- Installation Path C - Manual Installation Using Cloudera Manager Tarballs
- Installing Impala
- Installing Search
- Installing Spark
- Installing the GPL Extras Parcel
- Understanding Custom Installation Solutions
- Deploying Clients
- Testing the Installation
- Uninstalling Cloudera Manager and Managed Software
- Uninstalling a CDH Component From a Single Host
- Installing the Cloudera Navigator Data Management Component
- Installing Cloudera Navigator Key Trustee Server
- Installing Cloudera Navigator Key HSM
- Installing Key Trustee KMS
- Installing Cloudera Navigator Encrypt
- Installing and Deploying CDH Using the Command Line
- Before You Install CDH 5 on a Cluster
- Creating a Local Yum Repository
- Installing the Latest CDH 5 Release
- Installing an Earlier CDH 5 Release
- CDH 5 and MapReduce
- Migrating from MapReduce (MRv1) to MapReduce (MRv2)
- Deploying CDH 5 on a Cluster
- Installing CDH 5 Components
- Crunch Installation
- Flume Installation
- HBase Installation
- New Features and Changes for HBase in CDH 5
- Upgrading HBase
- Installing HBase
- Configuration Settings for HBase
- Starting HBase in Standalone Mode
- Configuring HBase in Pseudo-Distributed Mode
- Deploying HBase on a Cluster
- Accessing HBase by using the HBase Shell
- HBase Online Merge
- Using MapReduce with HBase
- Troubleshooting HBase
- Viewing the HBase Documentation
- HCatalog Installation
- HCatalog Prerequisites
- Installing and Upgrading the HCatalog RPM or Debian Packages
- Configuration Change on Hosts Used with HCatalog
- Starting and Stopping the WebHCat REST server
- Accessing Table Information with the HCatalog Command-line API
- Accessing Table Data with MapReduce
- Accessing Table Data with Pig
- Accessing Table Information with REST
- Viewing the HCatalog Documentation
- Impala Installation
- Hive Installation
- About Hive
- Upgrading Hive
- Installing Hive
- Configuring the Hive Metastore
- Configuring HiveServer2
- Starting the Metastore
- File System Permissions
- Starting, Stopping, and Using HiveServer2
- Starting HiveServer1 and the Hive Console
- Using Hive with HBase
- Using the Hive Schema Tool
- Installing the Hive JDBC Driver on Clients
- Setting HADOOP_MAPRED_HOME
- Configuring the Metastore to Use HDFS High Availability
- Troubleshooting Hive
- Viewing the Hive Documentation
- HttpFS Installation
- Hue Installation
- KMS Installation and Upgrade
- Mahout Installation
- Oozie Installation
- Pig Installation
- Search Installation
- Sentry Installation
- Snappy Installation
- Spark Installation
- Sqoop 1 Installation
- Sqoop 2 Installation
- Whirr Installation
- ZooKeeper Installation
- Building RPMs from CDH Source RPMs
- Apache and Third-Party Licenses
- Uninstalling CDH Components
- Viewing the Apache Hadoop Documentation
- Installation and Upgrade with the EMC DSSD D5
- DSSD D5 Installation Path A - Automated Installation by Cloudera Manager Installer
- DSSD D5 Installation Path B - Installation Using Cloudera Manager Parcels
- DSSD D5 Installation Path C - Manual Installation Using Cloudera Manager Tarballs
- DSSD D5 and Short-Circuit Reads
- Tuning the HDFS Block Size for DSSD Mode
- Upgrading with the EMC DSSD D5
- Upgrade
- Upgrading Cloudera Manager
- Upgrading the Cloudera Navigator Data Management Component
- Upgrading Cloudera Navigator Key Trustee Server
- Upgrading Cloudera Navigator Key HSM
- Upgrading Key Trustee KMS
- Upgrading Cloudera Navigator Encrypt
- Upgrading CDH and Managed Services Using Cloudera Manager
- Configuring the CDH Version of a Cluster
- Performing a Rolling Upgrade on a CDH 5 Cluster
- Performing a Rolling Upgrade on a CDH 4 Cluster
- Upgrading to CDH Maintenance Releases
- Upgrading to CDH 5.7
- Upgrading to CDH 5.6
- Upgrading to CDH 5.5
- Upgrading to CDH 5.4
- Upgrading to CDH 5.3
- Upgrading to CDH 5.2
- Upgrading to CDH 5.1
- Upgrading CDH 4 to CDH 5
- Upgrading CDH 4
- Upgrading CDH 3
- Upgrading Unmanaged CDH Using the Command Line
- Upgrading to Oracle JDK 1.7
- Upgrading to Oracle JDK 1.8
- Troubleshooting Installation and Upgrade Problems
- Rolling Back a CDH 4 to CDH 5 Upgrade
- Installation Requirements for Cloudera Manager, Cloudera Navigator, and CDH 5
- Cloudera Administration
- Managing CDH and Managed Services
- Managing CDH and Managed Services Using Cloudera Manager
- Configuration Overview
- Managing Clusters
- Managing Services
- Managing Roles
- Managing Hosts
- Maintenance Mode
- Cloudera Manager Configuration Properties
- CDH 5.7.0 Properties
- Accumulo 1.6 Properties in CDH 5.7.0
- Flume Properties in CDH 5.7.0
- HBase Properties in CDH 5.7.0
- HDFS Properties in CDH 5.7.0
- Hive Properties in CDH 5.7.0
- Hue Properties in CDH 5.7.0
- Impala Properties in CDH 5.7.0
- Isilon Properties in CDH 5.7.0
- Java KeyStore KMS Properties in CDH 5.7.0
- Kafka Properties in CDH 5.7.0
- Key Trustee KMS Properties in CDH 5.7.0
- Key Trustee Server Properties in CDH 5.7.0
- Key-Value Store Indexer Properties in CDH 5.7.0
- MapReduce Properties in CDH 5.7.0
- Oozie Properties in CDH 5.7.0
- Sentry Properties in CDH 5.7.0
- Solr Properties in CDH 5.7.0
- Spark Properties in CDH 5.7.0
- Spark (Standalone) Properties in CDH 5.7.0
- Sqoop 1 Client Properties in CDH 5.7.0
- Sqoop 2 Properties in CDH 5.7.0
- YARN (MR2 Included) Properties in CDH 5.7.0
- ZooKeeper Properties in CDH 5.7.0
- CDH 5.6.0 Properties
- Accumulo 1.6 Properties in CDH 5.6.0
- Flume Properties in CDH 5.6.0
- HBase Properties in CDH 5.6.0
- HDFS Properties in CDH 5.6.0
- Hive Properties in CDH 5.6.0
- Hue Properties in CDH 5.6.0
- Impala Properties in CDH 5.6.0
- Isilon Properties in CDH 5.6.0
- Java KeyStore KMS Properties in CDH 5.6.0
- Kafka Properties in CDH 5.6.0
- Key Trustee KMS Properties in CDH 5.6.0
- Key Trustee Server Properties in CDH 5.6.0
- Key-Value Store Indexer Properties in CDH 5.6.0
- MapReduce Properties in CDH 5.6.0
- Oozie Properties in CDH 5.6.0
- Sentry Properties in CDH 5.6.0
- Solr Properties in CDH 5.6.0
- Spark Properties in CDH 5.6.0
- Spark (Standalone) Properties in CDH 5.6.0
- Sqoop 1 Client Properties in CDH 5.6.0
- Sqoop 2 Properties in CDH 5.6.0
- YARN (MR2 Included) Properties in CDH 5.6.0
- ZooKeeper Properties in CDH 5.6.0
- CDH 5.5.0 Properties
- Accumulo 1.6 Properties in CDH 5.5.0
- Flume Properties in CDH 5.5.0
- HBase Properties in CDH 5.5.0
- HDFS Properties in CDH 5.5.0
- Hive Properties in CDH 5.5.0
- Hue Properties in CDH 5.5.0
- Impala Properties in CDH 5.5.0
- Isilon Properties in CDH 5.5.0
- Java KeyStore KMS Properties in CDH 5.5.0
- Kafka Properties in CDH 5.5.0
- Key Trustee KMS Properties in CDH 5.5.0
- Key Trustee Server Properties in CDH 5.5.0
- Key-Value Store Indexer Properties in CDH 5.5.0
- MapReduce Properties in CDH 5.5.0
- Oozie Properties in CDH 5.5.0
- Sentry Properties in CDH 5.5.0
- Solr Properties in CDH 5.5.0
- Spark Properties in CDH 5.5.0
- Spark (Standalone) Properties in CDH 5.5.0
- Sqoop 1 Client Properties in CDH 5.5.0
- Sqoop 2 Properties in CDH 5.5.0
- YARN (MR2 Included) Properties in CDH 5.5.0
- ZooKeeper Properties in CDH 5.5.0
- CDH 5.4.0 Properties
- Accumulo 1.6 Properties in CDH 5.4.0
- Flume Properties in CDH 5.4.0
- HBase Properties in CDH 5.4.0
- HDFS Properties in CDH 5.4.0
- Hive Properties in CDH 5.4.0
- Hue Properties in CDH 5.4.0
- Impala Properties in CDH 5.4.0
- Isilon Properties in CDH 5.4.0
- Java KeyStore KMS Properties in CDH 5.4.0
- Kafka Properties in CDH 5.4.0
- Key Trustee KMS Properties in CDH 5.4.0
- Key-Value Store Indexer Properties in CDH 5.4.0
- MapReduce Properties in CDH 5.4.0
- Oozie Properties in CDH 5.4.0
- Sentry Properties in CDH 5.4.0
- Solr Properties in CDH 5.4.0
- Spark Properties in CDH 5.4.0
- Spark (Standalone) Properties in CDH 5.4.0
- Sqoop 1 Client Properties in CDH 5.4.0
- Sqoop 2 Properties in CDH 5.4.0
- YARN (MR2 Included) Properties in CDH 5.4.0
- ZooKeeper Properties in CDH 5.4.0
- CDH 5.3.0 Properties
- Accumulo 1.6 Properties in CDH 5.3.0
- Flume Properties in CDH 5.3.0
- HBase Properties in CDH 5.3.0
- HDFS Properties in CDH 5.3.0
- Hive Properties in CDH 5.3.0
- Hue Properties in CDH 5.3.0
- Impala Properties in CDH 5.3.0
- Java KeyStore KMS Properties in CDH 5.3.0
- Kafka Properties in CDH 5.3.0
- Key Trustee KMS Properties in CDH 5.3.0
- Key-Value Store Indexer Properties in CDH 5.3.0
- MapReduce Properties in CDH 5.3.0
- Oozie Properties in CDH 5.3.0
- Sentry Properties in CDH 5.3.0
- Solr Properties in CDH 5.3.0
- Spark Properties in CDH 5.3.0
- Spark (Standalone) Properties in CDH 5.3.0
- Sqoop 1 Client Properties in CDH 5.3.0
- Sqoop 2 Properties in CDH 5.3.0
- YARN (MR2 Included) Properties in CDH 5.3.0
- ZooKeeper Properties in CDH 5.3.0
- CDH 5.2.0 Properties
- Accumulo 1.6 Properties in CDH 5.2.0
- Flume Properties in CDH 5.2.0
- HBase Properties in CDH 5.2.0
- HDFS Properties in CDH 5.2.0
- Hive Properties in CDH 5.2.0
- Hue Properties in CDH 5.2.0
- Impala Properties in CDH 5.2.0
- Java KeyStore KMS Properties in CDH 5.2.0
- Kafka Properties in CDH 5.2.0
- Key Trustee KMS Properties in CDH 5.2.0
- Key-Value Store Indexer Properties in CDH 5.2.0
- MapReduce Properties in CDH 5.2.0
- Oozie Properties in CDH 5.2.0
- Sentry Properties in CDH 5.2.0
- Solr Properties in CDH 5.2.0
- Spark Properties in CDH 5.2.0
- Spark (Standalone) Properties in CDH 5.2.0
- Sqoop 1 Client Properties in CDH 5.2.0
- Sqoop 2 Properties in CDH 5.2.0
- YARN (MR2 Included) Properties in CDH 5.2.0
- ZooKeeper Properties in CDH 5.2.0
- CDH 5.1.0 Properties
- Accumulo 1.6 Properties in CDH 5.1.0
- Flume Properties in CDH 5.1.0
- HBase Properties in CDH 5.1.0
- HDFS Properties in CDH 5.1.0
- Hive Properties in CDH 5.1.0
- Hue Properties in CDH 5.1.0
- Impala Properties in CDH 5.1.0
- Kafka Properties in CDH 5.1.0
- Key-Value Store Indexer Properties in CDH 5.1.0
- MapReduce Properties in CDH 5.1.0
- Oozie Properties in CDH 5.1.0
- Sentry Properties in CDH 5.1.0
- Solr Properties in CDH 5.1.0
- Spark Properties in CDH 5.1.0
- Spark (Standalone) Properties in CDH 5.1.0
- Sqoop 1 Client Properties in CDH 5.1.0
- Sqoop 2 Properties in CDH 5.1.0
- YARN (MR2 Included) Properties in CDH 5.1.0
- ZooKeeper Properties in CDH 5.1.0
- CDH 5.0.0 Properties
- Accumulo 1.6 Properties in CDH 5.0.0
- Flume Properties in CDH 5.0.0
- HBase Properties in CDH 5.0.0
- HDFS Properties in CDH 5.0.0
- Hive Properties in CDH 5.0.0
- Hue Properties in CDH 5.0.0
- Impala Properties in CDH 5.0.0
- Kafka Properties in CDH 5.0.0
- Key-Value Store Indexer Properties in CDH 5.0.0
- MapReduce Properties in CDH 5.0.0
- Oozie Properties in CDH 5.0.0
- Solr Properties in CDH 5.0.0
- Spark Properties in CDH 5.0.0
- Spark (Standalone) Properties in CDH 5.0.0
- Sqoop 1 Client Properties in CDH 5.0.0
- Sqoop 2 Properties in CDH 5.0.0
- YARN (MR2 Included) Properties in CDH 5.0.0
- ZooKeeper Properties in CDH 5.0.0
- CDH 4.7.0 Properties
- Accumulo Properties in CDH 4.7.0
- Accumulo 1.6 Properties in CDH 4.7.0
- Flume Properties in CDH 4.7.0
- HBase Properties in CDH 4.7.0
- HDFS Properties in CDH 4.7.0
- Hive Properties in CDH 4.7.0
- Hue Properties in CDH 4.7.0
- Impala Properties in CDH 4.7.0
- Key-Value Store Indexer Properties in CDH 4.7.0
- MapReduce Properties in CDH 4.7.0
- Oozie Properties in CDH 4.7.0
- Solr Properties in CDH 4.7.0
- Spark (Standalone) Properties in CDH 4.7.0
- Sqoop 2 Properties in CDH 4.7.0
- YARN (MR2 Included) Properties in CDH 4.7.0
- ZooKeeper Properties in CDH 4.7.0
- CDH 4.6.0 Properties
- Accumulo Properties in CDH 4.6.0
- Accumulo 1.6 Properties in CDH 4.6.0
- Flume Properties in CDH 4.6.0
- HBase Properties in CDH 4.6.0
- HDFS Properties in CDH 4.6.0
- Hive Properties in CDH 4.6.0
- Hue Properties in CDH 4.6.0
- Impala Properties in CDH 4.6.0
- Key-Value Store Indexer Properties in CDH 4.6.0
- MapReduce Properties in CDH 4.6.0
- Oozie Properties in CDH 4.6.0
- Solr Properties in CDH 4.6.0
- Spark (Standalone) Properties in CDH 4.6.0
- Sqoop 2 Properties in CDH 4.6.0
- YARN (MR2 Included) Properties in CDH 4.6.0
- ZooKeeper Properties in CDH 4.6.0
- CDH 4.5.0 Properties
- Accumulo Properties in CDH 4.5.0
- Accumulo 1.6 Properties in CDH 4.5.0
- Flume Properties in CDH 4.5.0
- HBase Properties in CDH 4.5.0
- HDFS Properties in CDH 4.5.0
- Hive Properties in CDH 4.5.0
- Hue Properties in CDH 4.5.0
- Impala Properties in CDH 4.5.0
- Key-Value Store Indexer Properties in CDH 4.5.0
- MapReduce Properties in CDH 4.5.0
- Oozie Properties in CDH 4.5.0
- Solr Properties in CDH 4.5.0
- Spark (Standalone) Properties in CDH 4.5.0
- Sqoop 2 Properties in CDH 4.5.0
- YARN (MR2 Included) Properties in CDH 4.5.0
- ZooKeeper Properties in CDH 4.5.0
- CDH 4.4.0 Properties
- Accumulo Properties in CDH 4.4.0
- Accumulo 1.6 Properties in CDH 4.4.0
- Flume Properties in CDH 4.4.0
- HBase Properties in CDH 4.4.0
- HDFS Properties in CDH 4.4.0
- Hive Properties in CDH 4.4.0
- Hue Properties in CDH 4.4.0
- Impala Properties in CDH 4.4.0
- Key-Value Store Indexer Properties in CDH 4.4.0
- MapReduce Properties in CDH 4.4.0
- Oozie Properties in CDH 4.4.0
- Solr Properties in CDH 4.4.0
- Spark (Standalone) Properties in CDH 4.4.0
- Sqoop 2 Properties in CDH 4.4.0
- YARN (MR2 Included) Properties in CDH 4.4.0
- ZooKeeper Properties in CDH 4.4.0
- CDH 4.3.0 Properties
- Accumulo Properties in CDH 4.3.0
- Accumulo 1.6 Properties in CDH 4.3.0
- Flume Properties in CDH 4.3.0
- HBase Properties in CDH 4.3.0
- HDFS Properties in CDH 4.3.0
- Hive Properties in CDH 4.3.0
- Hue Properties in CDH 4.3.0
- Impala Properties in CDH 4.3.0
- Key-Value Store Indexer Properties in CDH 4.3.0
- MapReduce Properties in CDH 4.3.0
- Oozie Properties in CDH 4.3.0
- Solr Properties in CDH 4.3.0
- Sqoop 2 Properties in CDH 4.3.0
- YARN (MR2 Included) Properties in CDH 4.3.0
- ZooKeeper Properties in CDH 4.3.0
- CDH 4.2.0 Properties
- Accumulo Properties in CDH 4.2.0
- Accumulo 1.6 Properties in CDH 4.2.0
- Flume Properties in CDH 4.2.0
- HBase Properties in CDH 4.2.0
- HDFS Properties in CDH 4.2.0
- Hive Properties in CDH 4.2.0
- Hue Properties in CDH 4.2.0
- Impala Properties in CDH 4.2.0
- MapReduce Properties in CDH 4.2.0
- Oozie Properties in CDH 4.2.0
- YARN (MR2 Included) Properties in CDH 4.2.0
- ZooKeeper Properties in CDH 4.2.0
- CDH 4.1.0 Properties
- Accumulo Properties in CDH 4.1.0
- Accumulo 1.6 Properties in CDH 4.1.0
- Flume Properties in CDH 4.1.0
- HBase Properties in CDH 4.1.0
- HDFS Properties in CDH 4.1.0
- Hive Properties in CDH 4.1.0
- Hue Properties in CDH 4.1.0
- Impala Properties in CDH 4.1.0
- MapReduce Properties in CDH 4.1.0
- Oozie Properties in CDH 4.1.0
- YARN (MR2 Included) Properties in CDH 4.1.0
- ZooKeeper Properties in CDH 4.1.0
- CDH 4.0.0 Properties
- Accumulo Properties in CDH 4.0.0
- Accumulo 1.6 Properties in CDH 4.0.0
- Flume Properties in CDH 4.0.0
- HBase Properties in CDH 4.0.0
- HDFS Properties in CDH 4.0.0
- Hive Properties in CDH 4.0.0
- Hue Properties in CDH 4.0.0
- MapReduce Properties in CDH 4.0.0
- Oozie Properties in CDH 4.0.0
- YARN (MR2 Included) Properties in CDH 4.0.0
- ZooKeeper Properties in CDH 4.0.0
- Host Configuration Properties
- Cloudera Manager Server Properties
- Cloudera Management Service Properties
- CDH 5.7.0 Properties
- Managing CDH Using the Command Line
- Managing Individual Services
- Managing Flume
- Managing HBase
- Managing HBase
- Starting and Stopping HBase
- Accessing HBase by using the HBase Shell
- Using HBase Command-Line Utilities
- Configuring HBase Garbage Collection
- Configuring the HBase Canary
- Checking and Repairing HBase Tables
- Hedged Reads
- Configuring the Blocksize for HBase
- Configuring the HBase BlockCache
- Configuring the HBase Scanner Heartbeat
- Limiting the Speed of Compactions
- Reading Data from HBase
- HBase Filtering
- Writing Data to HBase
- Importing Data Into HBase
- Configuring and Using the HBase REST API
- Configuring HBase MultiWAL Support
- Storing Medium Objects (MOBs) in HBase
- Configuring the Storage Policy for the Write-Ahead Log (WAL)
- Exposing HBase Metrics to a Ganglia Server
- Managing HDFS
- Managing Federated Nameservices
- NameNodes
- DataNodes
- JournalNodes
- Configuring Short-Circuit Reads
- Configuring HDFS Trash
- HDFS Balancers
- Enabling WebHDFS
- Adding HttpFS
- Adding and Configuring an NFS Gateway
- Setting HDFS Quotas
- Configuring Mountable HDFS
- Configuring Centralized Cache Management in HDFS
- Using CDH with Isilon Storage
- Configuring Heterogeneous Storage in HDFS
- Managing Hive
- Managing Hue
- Managing Impala
- Managing Key-Value Store Indexer
- Managing Oozie
- Configuring Oozie for High Availability
- Adding the Oozie Service Using Cloudera Manager
- Redeploying the Oozie ShareLib
- Configuring Oozie Data Purge Settings Using Cloudera Manager
- Adding Schema to Oozie Using Cloudera Manager
- Enabling the Oozie Web Console
- Setting the Oozie Database Timezone
- Scheduling in Oozie Using Cron-like Syntax
- Managing Solr
- Managing Spark
- Managing the Sqoop 1 Client
- Managing Sqoop 2
- Managing YARN (MRv2) and MapReduce (MRv1)
- Managing ZooKeeper
- Configuring Services to Use the GPL Extras Parcel
- Managing CDH and Managed Services Using Cloudera Manager
- Performance Management
- Resource Management
- High Availability
- HDFS High Availability
- MapReduce (MRv1) and YARN (MRv2) High Availability
- Cloudera Navigator Key Trustee Server High Availability
- Key Trustee KMS High Availability
- High Availability for Other CDH Components
- Configuring Cloudera Manager for High Availability With a Load Balancer
- Introduction to Cloudera Manager Deployment Architecture
- Prerequisites for Setting up Cloudera Manager High Availability
- Cloudera Manager Failover Protection
- High-Level Steps to Configure Cloudera Manager High Availability
- Database High Availability Configuration
- TLS and Kerberos Configuration for Cloudera Manager High Availability
- Backup and Disaster Recovery
- Cloudera Manager Administration
- Starting, Stopping, and Restarting the Cloudera Manager Server
- Configuring Cloudera Manager Server Ports
- Moving the Cloudera Manager Server to a New Host
- Managing the Cloudera Manager Server Log
- Cloudera Manager Agents
- Changing Hostnames
- Configuring Network Settings
- Alerts
- Managing Licenses
- Sending Usage and Diagnostic Data to Cloudera
- Exporting and Importing Cloudera Manager Configuration
- Backing up Cloudera Manager
- Other Cloudera Manager Tasks and Settings
- Cloudera Management Service
- Cloudera Navigator Data Management Component Administration
- Managing CDH and Managed Services
- Cloudera Data Management
- Cloudera Operation
- Monitoring and Diagnostics
- Introduction to Cloudera Manager Monitoring
- Monitoring Clusters
- Monitoring Multiple CDH Deployments Using the Multi Cloudera Manager Dashboard
- Monitoring Services
- Monitoring Hosts
- Monitoring Activities
- Events
- Triggers
- Lifecycle and Security Auditing
- Charting Time-Series Data
- Logs
- Reports
- Troubleshooting Cluster Configuration and Operation
- Cloudera Manager Entity Types
- Cloudera Manager Entity Type Attributes
- Cloudera Manager Events
- Cloudera Manager Health Tests
- Active Database Health Tests
- Active Key Trustee Server Health Tests
- Activity Monitor Health Tests
- Alert Publisher Health Tests
- Beeswax Server Health Tests
- Cloudera Management Service Health Tests
- DSSD DataNode Health Tests
- DataNode Health Tests
- Event Server Health Tests
- Failover Controller Health Tests
- Flume Health Tests
- Flume Agent Health Tests
- Garbage Collector Health Tests
- HBase Health Tests
- HBase REST Server Health Tests
- HBase Thrift Server Health Tests
- HDFS Health Tests
- History Server Health Tests
- Hive Health Tests
- Hive Metastore Server Health Tests
- HiveServer2 Health Tests
- Host Health Tests
- Host Monitor Health Tests
- HttpFS Health Tests
- Hue Health Tests
- Hue Server Health Tests
- Impala Health Tests
- Impala Catalog Server Health Tests
- Impala Daemon Health Tests
- Impala Llama ApplicationMaster Health Tests
- Impala StateStore Health Tests
- JobHistory Server Health Tests
- JobTracker Health Tests
- JournalNode Health Tests
- Kafka Broker Health Tests
- Kafka MirrorMaker Health Tests
- Kerberos Ticket Renewer Health Tests
- Key Management Server Health Tests
- Key Management Server Proxy Health Tests
- Key-Value Store Indexer Health Tests
- Lily HBase Indexer Health Tests
- Load Balancer Health Tests
- Logger Health Tests
- MapReduce Health Tests
- Master Health Tests
- Monitor Health Tests
- NFS Gateway Health Tests
- NameNode Health Tests
- Navigator Audit Server Health Tests
- Navigator Metadata Server Health Tests
- NodeManager Health Tests
- Oozie Health Tests
- Oozie Server Health Tests
- Passive Database Health Tests
- Passive Key Trustee Server Health Tests
- RegionServer Health Tests
- Reports Manager Health Tests
- ResourceManager Health Tests
- SecondaryNameNode Health Tests
- Sentry Health Tests
- Sentry Server Health Tests
- Service Monitor Health Tests
- Solr Health Tests
- Solr Server Health Tests
- Sqoop 2 Health Tests
- Sqoop 2 Server Health Tests
- Tablet Server Health Tests
- TaskTracker Health Tests
- Tracer Health Tests
- WebHCat Server Health Tests
- Worker Health Tests
- YARN (MR2 Included) Health Tests
- ZooKeeper Health Tests
- ZooKeeper Server Health Tests
- Cloudera Manager Metrics
- Accumulo Metrics
- Accumulo 1.6 Metrics
- Active Database Metrics
- Active Key Trustee Server Metrics
- Activity Metrics
- Activity Monitor Metrics
- Agent Metrics
- Alert Publisher Metrics
- Attempt Metrics
- Beeswax Server Metrics
- Cloudera Management Service Metrics
- Cloudera Manager Server Metrics
- Cluster Metrics
- DSSD DataNode Metrics
- DataNode Metrics
- Directory Metrics
- Disk Metrics
- Event Server Metrics
- Failover Controller Metrics
- Filesystem Metrics
- Flume Metrics
- Flume Channel Metrics
- Flume Sink Metrics
- Flume Source Metrics
- Garbage Collector Metrics
- HBase Metrics
- HBase REST Server Metrics
- HBase RegionServer Replication Peer Metrics
- HBase Thrift Server Metrics
- HDFS Metrics
- HDFS Cache Directive Metrics
- HDFS Cache Pool Metrics
- HRegion Metrics
- HTable Metrics
- History Server Metrics
- Hive Metrics
- Hive Metastore Server Metrics
- HiveServer2 Metrics
- Host Metrics
- Host Monitor Metrics
- HttpFS Metrics
- Hue Metrics
- Hue Server Metrics
- Impala Metrics
- Impala Catalog Server Metrics
- Impala Daemon Metrics
- Impala Daemon Resource Pool Metrics
- Impala Llama ApplicationMaster Metrics
- Impala Pool Metrics
- Impala Pool User Metrics
- Impala Query Metrics
- Impala StateStore Metrics
- Isilon Metrics
- Java KeyStore KMS Metrics
- JobHistory Server Metrics
- JobTracker Metrics
- JournalNode Metrics
- Kafka Metrics
- Kafka Broker Metrics
- Kafka Broker Topic Metrics
- Kafka MirrorMaker Metrics
- Kafka Replica Metrics
- Kerberos Ticket Renewer Metrics
- Key Management Server Metrics
- Key Management Server Proxy Metrics
- Key Trustee KMS Metrics
- Key Trustee Server Metrics
- Key-Value Store Indexer Metrics
- Lily HBase Indexer Metrics
- Load Balancer Metrics
- Logger Metrics
- MapReduce Metrics
- Master Metrics
- Monitor Metrics
- NFS Gateway Metrics
- NameNode Metrics
- Navigator Audit Server Metrics
- Navigator Metadata Server Metrics
- Network Interface Metrics
- NodeManager Metrics
- Oozie Metrics
- Oozie Server Metrics
- Passive Database Metrics
- Passive Key Trustee Server Metrics
- RegionServer Metrics
- Reports Manager Metrics
- ResourceManager Metrics
- SecondaryNameNode Metrics
- Sentry Metrics
- Sentry Server Metrics
- Server Metrics
- Service Monitor Metrics
- Solr Metrics
- Solr Replica Metrics
- Solr Server Metrics
- Solr Shard Metrics
- Spark Metrics
- Spark (Standalone) Metrics
- Sqoop 1 Client Metrics
- Sqoop 2 Metrics
- Sqoop 2 Server Metrics
- Tablet Server Metrics
- TaskTracker Metrics
- Time Series Table Metrics
- Tracer Metrics
- User Metrics
- WebHCat Server Metrics
- Worker Metrics
- YARN (MR2 Included) Metrics
- YARN Pool Metrics
- YARN Pool User Metrics
- ZooKeeper Metrics
- Monitoring and Diagnostics
- Cloudera Security
- Security Overview for an Enterprise Data Hub
- Configuring Authentication
- Configuring Authentication in Cloudera Manager
- Cloudera Manager User Accounts
- Configuring External Authentication for Cloudera Manager
- Kerberos Concepts - Principals, Keytabs and Delegation Tokens
- Enabling Kerberos Authentication Using the Wizard
- Step 1: Install Cloudera Manager and CDH
- Step 2: If You are Using AES-256 Encryption, Install the JCE Policy File
- Step 3: Get or Create a Kerberos Principal for the Cloudera Manager Server
- Step 4: Enabling Kerberos Using the Wizard
- Step 5: Create the HDFS Superuser
- Step 6: Get or Create a Kerberos Principal for Each User Account
- Step 7: Prepare the Cluster for Each User
- Step 8: Verify that Kerberos Security is Working
- Step 9: (Optional) Enable Authentication for HTTP Web Consoles for Hadoop Roles
- Enabling Kerberos Authentication for Single User Mode or Non-Default Users
- Configuring a Cluster with Custom Kerberos Principals
- Viewing and Regenerating Kerberos Principals
- Using a Custom Kerberos Keytab Retrieval Script
- Mapping Kerberos Principals to Short Names
- Moving Kerberos Principals to Another OU Within Active Directory
- Using Auth-to-Local Rules to Isolate Cluster Users
- Enabling Kerberos Authentication Without the Wizard
- Step 1: Install Cloudera Manager and CDH
- Step 2: If You are Using AES-256 Encryption, Install the JCE Policy File
- Step 3: Get or Create a Kerberos Principal for the Cloudera Manager Server
- Step 4: Import KDC Account Manager Credentials
- Step 5: Configure the Kerberos Default Realm in the Cloudera Manager Admin Console
- Step 6: Stop All Services
- Step 7: Enable Hadoop Security
- Step 8: Wait for the Generate Credentials Command to Finish
- Step 9: Enable Hue to Work with Hadoop Security using Cloudera Manager
- Step 10: (Flume Only) Use Substitution Variables for the Kerberos Principal and Keytab
- Step 11: (CDH 4.0 and 4.1 only) Configure Hue to Use a Local Hive Metastore
- Step 12: Start All Services
- Step 13: Deploy Client Configurations
- Step 14: Create the HDFS Superuser Principal
- Step 15: Get or Create a Kerberos Principal for Each User Account
- Step 16: Prepare the Cluster for Each User
- Step 17: Verify that Kerberos Security is Working
- Step 18: (Optional) Enable Authentication for HTTP Web Consoles for Hadoop Roles
- Configuring Authentication in the Cloudera Navigator Data Management Component
- Configuring Authentication in CDH Using the Command Line
- Enabling Kerberos Authentication for Hadoop Using the Command Line
- Step 1: Install CDH 5
- Step 2: Verify User Accounts and Groups in CDH 5 Due to Security
- Step 3: If you are Using AES-256 Encryption, Install the JCE Policy File
- Step 4: Create and Deploy the Kerberos Principals and Keytab Files
- Step 5: Shut Down the Cluster
- Step 6: Enable Hadoop Security
- Step 7: Configure Secure HDFS
- Optional Step 8: Configuring Security for HDFS High Availability
- Optional Step 9: Configure secure WebHDFS
- Optional Step 10: Configuring a secure HDFS NFS Gateway
- Step 11: Set Variables for Secure DataNodes
- Step 12: Start up the NameNode
- Step 12: Start up a DataNode
- Step 14: Set the Sticky Bit on HDFS Directories
- Step 15: Start up the Secondary NameNode (if used)
- Step 16: Configure Either MRv1 Security or YARN Security
- FUSE Kerberos Configuration
- Using kadmin to Create Kerberos Keytab Files
- Configuring the Mapping from Kerberos Principals to Short Names
- Enabling Debugging Output for the Sun Kerberos Classes
- Enabling Kerberos Authentication for Hadoop Using the Command Line
- Flume Authentication
- HBase Authentication
- HCatalog Authentication
- Hive Authentication
- HttpFS Authentication
- Hue Authentication
- Impala Authentication
- Llama Authentication
- Oozie Authentication
- Search Authentication
- Spark Authentication
- Sqoop Authentication
- ZooKeeper Authentication
- Hadoop Users in Cloudera Manager and CDH
- Configuring a Cluster-dedicated MIT KDC with Cross-Realm Trust
- Integrating Hadoop Security with Active Directory
- Integrating Hadoop Security with Alternate Authentication
- Authenticating Kerberos Principals in Java Code
- Using a Web Browser to Access an URL Protected by Kerberos HTTP SPNEGO
- Troubleshooting Authentication Issues
- Configuring Authentication in Cloudera Manager
- Configuring Encryption
- TLS/SSL Certificates Overview
- Configuring TLS Security for Cloudera Manager
- Configuring TLS Encryption Only for Cloudera Manager
- Level 1: Configuring TLS Encryption for Cloudera Manager Agents
- Level 2: Configuring TLS Verification of Cloudera Manager Server by the Agents
- Level 3: Configuring TLS Authentication of Agents to the Cloudera Manager Server
- HTTPS Communication in Cloudera Manager
- Troubleshooting TLS/SSL Connectivity
- Deploying the Cloudera Manager Keystore for Level 1 TLS with Self-Signed Certificates
- Configuring TLS/SSL for the Cloudera Navigator Data Management Component
- Configuring TLS/SSL for Cloudera Management Service Roles
- Configuring TLS/SSL Encryption for CDH Services
- Configuring TLS/SSL for HDFS, YARN and MapReduce
- Configuring TLS/SSL for HBase
- Configuring TLS/SSL for Flume Thrift Source and Sink
- Configuring Encrypted Communication Between HiveServer2 and Client Drivers
- Configuring TLS/SSL for Hue
- Configuring TLS/SSL for Impala
- Configuring TLS/SSL for Oozie
- Configuring TLS/SSL for Solr
- Spark Encryption
- Configuring TLS/SSL for HttpFS
- Encrypted Shuffle and Encrypted Web UIs
- Deployment Planning for Data at Rest Encryption
- HDFS Transparent Encryption
- Optimizing for HDFS Data at Rest Encryption
- Enabling HDFS Encryption Using the Wizard
- Managing Encryption Keys and Zones
- Configuring the Key Management Server (KMS)
- Securing the Key Management Server (KMS)
- Migrating Keys from a Java KeyStore to Navigator Key Trustee Server
- Configuring CDH Services for HDFS Encryption
- Troubleshooting HDFS Encryption
- Cloudera Navigator Key Trustee Server
- Cloudera Navigator Key HSM
- Cloudera Navigator Encrypt
- Configuring Encryption for Data Spills
- Configuring Encrypted HDFS Data Transport
- Configuring Encrypted HBase Data Transport
- Configuring Authorization
- Cloudera Manager User Roles
- Cloudera Navigator Data Management Component User Roles
- HDFS Extended ACLs
- Configuring LDAP Group Mappings
- Authorization With Apache Sentry
- Configuring HBase Authorization
- Sensitive Data Redaction
- Overview of Impala Security
- Miscellaneous Topics
- Impala Guide
- Impala Concepts and Architecture
- Planning for Impala Deployment
- Impala Tutorials
- Impala Administration
- Impala SQL Language Reference
- Comments
- Data Types
- Literals
- SQL Operators
- Schema Objects and Object Names
- SQL Statements
- DDL Statements
- DML Statements
- ALTER TABLE
- ALTER VIEW
- COMPUTE STATS
- CREATE DATABASE
- CREATE FUNCTION
- CREATE ROLE
- CREATE TABLE
- CREATE VIEW
- DESCRIBE
- DROP DATABASE
- DROP FUNCTION
- DROP ROLE
- DROP STATS
- DROP TABLE
- DROP VIEW
- EXPLAIN
- GRANT
- INSERT
- INVALIDATE METADATA
- LOAD DATA
- REFRESH
- REVOKE
- SELECT
- SET
- Query Options for the SET Statement
- ABORT_ON_DEFAULT_LIMIT_EXCEEDED Query Option
- ABORT_ON_ERROR Query Option
- ALLOW_UNSUPPORTED_FORMATS Query Option
- APPX_COUNT_DISTINCT Query Option
- BATCH_SIZE Query Option
- COMPRESSION_CODEC Query Option
- DEBUG_ACTION Query Option
- DEFAULT_ORDER_BY_LIMIT Query Option
- DISABLE_CODEGEN Query Option
- DISABLE_ROW_RUNTIME_FILTERING Query Option
- DISABLE_STREAMING_PREAGGREGATIONS Query Option
- DISABLE_UNSAFE_SPILLS Query Option
- EXEC_SINGLE_NODE_ROWS_THRESHOLD Query Option
- EXPLAIN_LEVEL Query Option
- HBASE_CACHE_BLOCKS Query Option
- HBASE_CACHING Query Option
- LIVE_PROGRESS Query Option
- LIVE_SUMMARY Query Option
- MAX_ERRORS Query Option
- MAX_IO_BUFFERS Query Option
- MAX_SCAN_RANGE_LENGTH Query Option
- MAX_NUM_RUNTIME_FILTERS Query Option
- MEM_LIMIT Query Option
- NUM_NODES Query Option
- NUM_SCANNER_THREADS Query Option
- OPTIMIZE_PARTITION_KEY_SCANS Query Option
- PARQUET_COMPRESSION_CODEC Query Option
- PARQUET_FILE_SIZE Query Option
- QUERY_TIMEOUT_S Query Option
- REQUEST_POOL Query Option
- SCHEDULE_RANDOM_REPLICA Query Option
- RESERVATION_REQUEST_TIMEOUT Query Option (CDH 5 only)
- RUNTIME_BLOOM_FILTER_SIZE Query Option
- RUNTIME_FILTER_MODE Query Option
- RUNTIME_FILTER_WAIT_TIME_MS Query Option
- SCAN_NODE_CODEGEN_THRESHOLD Query Option
- SUPPORT_START_OVER Query Option
- SYNC_DDL Query Option
- V_CPU_CORES Query Option (CDH 5 only)
- Query Options for the SET Statement
- SHOW
- TRUNCATE TABLE
- USE
- Built-In Functions
- SQL Differences Between Impala and Hive
- Porting SQL from Other Database Systems to Impala
- Using the Impala Shell (impala-shell Command)
- Tuning Impala for Performance
- Impala Performance Guidelines and Best Practices
- Performance Considerations for Join Queries
- How Impala Uses Statistics for Query Optimization
- Benchmarking Impala Queries
- Controlling Impala Resource Usage
- Runtime Filtering for Impala Queries
- Using HDFS Caching with Impala (CDH 5.1 or higher only)
- Testing Impala Performance
- Understanding Impala Query Performance - EXPLAIN Plans and Query Profiles
- Detecting and Correcting HDFS Block Skew Conditions
- Scalability Considerations for Impala
- Partitioning for Impala Tables
- How Impala Works with Hadoop File Formats
- Using Impala to Query HBase Tables
- Using Impala to Query the Amazon S3 Filesystem (Unsupported Preview)
- Using Impala with Isilon Storage
- Using Impala Logging
- Troubleshooting Impala
- Ports Used by Impala
- Impala Reserved Words
- Cloudera Search Guide
- Cloudera Search User Guide
- Cloudera Search Tutorial
- Managing Solr Using solrctl
- Spark Indexing Reference (CDH 5.2 and higher only)
- MapReduce Batch Indexing Reference
- Flume Near Real-Time Indexing Reference
- Extracting, Transforming, and Loading Data With Cloudera Morphlines
- Using the Lily HBase Batch Indexer for Indexing
- Configuring the Lily HBase NRT Indexer Service for Use with Cloudera Search
- Schemaless Mode Overview and Best Practices
- Using Search through a Proxy for High Availability
- Migrating Solr Replicas
- Using Custom JAR Files with Search
- Troubleshooting Cloudera Search
- Cloudera Search User Guide
- Spark Guide
- Running Your First Spark Application
- Spark Application Overview
- Developing Spark Applications
- Running Spark Applications
- Spark and Hadoop Integration
- Cloudera Glossary