Master Metrics
In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.
In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.
Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".
For example, the following metric names may be valid for Master:
- add_column_above_one_sec_avg_time_across_clusters
- total_add_column_above_one_sec_avg_time_across_clusters
Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.
For more information about metrics, see Cloudera Manager Metrics and Metric Aggregation.
Metric Name | Description | Unit | Parents | CDH Version |
---|---|---|---|---|
add_column_above_one_sec_avg_time | Add Column Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
add_column_above_one_sec_rate | Add Column Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
add_column_avg_time | Add Column Average Time | ms | cluster, hbase, rack | CDH 4 |
add_column_rate | Add Column Operations | operations per second | cluster, hbase, rack | CDH 4 |
alerts_rate | The number of alerts. | events per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
assign_above_one_sec_avg_time | Assign Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
assign_above_one_sec_rate | Assign Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
assign_avg_time | Assign Average Time | ms | cluster, hbase, rack | CDH 4 |
assign_rate | Assign Operations | operations per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_agent_bytes_left_rate | Number of bytes of audit events still to be sent to the Audit Server. | bytes per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_agent_bytes_sent_rate | Number of bytes of audit events successfully sent by Cloudera Manager Agent to the Audit Server. | bytes per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_agent_events_sent_rate | Number of audit events successfully sent by Cloudera Manager Agent to the Audit Server. | events per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_agent_send_failures_rate | Number of time Cloudera Manager Agent encountered failure when sending audit event(s) to the Audit Server. | errors per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_plugin_coalesced_rate | Number of audit events coalesced | events per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_plugin_events_rate | Number of audit events persisted to local filesystem | events per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_plugin_exceptions_rate | Number of exception encountered when processing audit events | errors per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_plugin_filtered_rate | Number of audit events that were filtered based on user specified criteria | events per second | cluster, hbase, rack | CDH 4, CDH 5 |
audit_plugin_parse_errors_rate | Number of audit events that cannot be parsed | errors per second | cluster, hbase, rack | CDH 4, CDH 5 |
balance_above_one_sec_avg_time | Balance Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
balance_above_one_sec_rate | Balance Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
balance_avg_time | Balance Average Time | ms | cluster, hbase, rack | CDH 4 |
balance_rate | Balance Operations | operations per second | cluster, hbase, rack | CDH 4 |
balance_switch_above_one_sec_avg_time | Balance Switch Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
balance_switch_above_one_sec_rate | Balance Switch Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
balance_switch_avg_time | Balance Switch Average Time | ms | cluster, hbase, rack | CDH 4 |
balance_switch_rate | Balance Switch Operations | operations per second | cluster, hbase, rack | CDH 4 |
canary_duration | Duration of the last or currently running canary job | ms | cluster, hbase, rack | CDH 4, CDH 5 |
cgroup_cpu_system_rate | CPU usage of the role's cgroup | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_cpu_user_rate | User Space CPU usage of the role's cgroup | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_mem_page_cache | Page cache usage of the role's cgroup | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_mem_rss | Resident memory of the role's cgroup | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_mem_swap | Swap usage of the role's cgroup | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_read_bytes_rate | Bytes read from all disks by the role's cgroup | bytes per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_read_ios_rate | Number of read I/O operations from all disks by the role's cgroup | ios per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_write_bytes_rate | Bytes written to all disks by the role's cgroup | bytes per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cgroup_write_ios_rate | Number of write I/O operations to all disks by the role's cgroup | ios per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cluster_requests_rate | Cluster requests | requests per second | cluster, hbase, rack | CDH 4, CDH 5 |
cpu_system_rate | Total System CPU | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
cpu_user_rate | Total CPU user time | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
create_table_above_one_sec_avg_time | Create Table Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
create_table_above_one_sec_rate | Create Table Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
create_table_avg_time | Create Table Average Time | ms | cluster, hbase, rack | CDH 4 |
create_table_rate | Create Table Operations | operations per second | cluster, hbase, rack | CDH 4 |
delete_column_above_one_sec_avg_time | Delete Column Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
delete_column_above_one_sec_rate | Delete Column Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
delete_column_avg_time | Delete Column Average Time | ms | cluster, hbase, rack | CDH 4 |
delete_column_rate | Delete Column Operations | operations per second | cluster, hbase, rack | CDH 4 |
delete_table_above_one_sec_avg_time | Delete Table Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
delete_table_above_one_sec_rate | Delete Table Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
delete_table_avg_time | Delete Table Average Time | ms | cluster, hbase, rack | CDH 4 |
delete_table_rate | Delete Table Operations | operations per second | cluster, hbase, rack | CDH 4 |
disable_table_above_one_sec_avg_time | Disable Table Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
disable_table_above_one_sec_rate | Disable Table Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
disable_table_avg_time | Disable Table Average Time | ms | cluster, hbase, rack | CDH 4 |
disable_table_rate | Disable Table Operations | operations per second | cluster, hbase, rack | CDH 4 |
enable_table_above_one_sec_avg_time | Enable Table Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
enable_table_above_one_sec_rate | Enable Table Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
enable_table_avg_time | Enable Table Average Time | ms | cluster, hbase, rack | CDH 4 |
enable_table_rate | Enable Table Operations | operations per second | cluster, hbase, rack | CDH 4 |
events_critical_rate | The number of critical events. | events per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
events_important_rate | The number of important events. | events per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
events_informational_rate | The number of informational events. | events per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
exec_coprocessor_avg_time | Execute Coprocessor Average Time | ms | cluster, hbase, rack | CDH 4 |
exec_coprocessor_rate | Execute Coprocessor Operations | operations per second | cluster, hbase, rack | CDH 4 |
fd_max | Maximum number of file descriptors | file descriptors | accumulo16, cluster, rack | CDH 4, CDH 5 |
fd_open | Open file descriptors. | file descriptors | accumulo16, cluster, rack | CDH 4, CDH 5 |
get_alter_status_above_one_sec_avg_time | Get Alter Status Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
get_alter_status_above_one_sec_rate | Get Alter Status Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
get_alter_status_avg_time | Get Alter Status Average Time | ms | cluster, hbase, rack | CDH 4 |
get_alter_status_rate | Get Alter Status Operations | operations per second | cluster, hbase, rack | CDH 4 |
get_block_cache_column_family_summaries_avg_time | Get Block Cache Column Family Summaries Average Time | ms | cluster, hbase, rack | CDH 4 |
get_block_cache_column_family_summaries_rate | Get Block Cache Column Family Summaries Operations | operations per second | cluster, hbase, rack | CDH 4 |
get_cluster_status_above_one_sec_avg_time | Get Cluster Status Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
get_cluster_status_above_one_sec_rate | Get Cluster Status Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
get_cluster_status_avg_time | Get Cluster Status Average Time | ms | cluster, hbase, rack | CDH 4 |
get_cluster_status_rate | Get Cluster Status Operations | operations per second | cluster, hbase, rack | CDH 4 |
get_h_table_descriptors_above_one_sec_avg_time | Get HTable Descriptors Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
get_h_table_descriptors_above_one_sec_rate | Get HTable Descriptors Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
get_h_table_descriptors_avg_time | Get HTable Descriptors Average Time | ms | cluster, hbase, rack | CDH 4 |
get_h_table_descriptors_rate | Get HTable Descriptors Operations | operations per second | cluster, hbase, rack | CDH 4 |
get_protocol_signature_above_one_sec_avg_time | Get Protocol Signature Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
get_protocol_signature_above_one_sec_rate | Get Protocol Signature Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
get_protocol_signature_avg_time | Get Protocol Signature Average Time | ms | cluster, hbase, rack | CDH 4 |
get_protocol_signature_rate | Get Protocol Signature Operations | operations per second | cluster, hbase, rack | CDH 4 |
get_protocol_version_above_one_sec_avg_time | Get Protocol Version Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
get_protocol_version_above_one_sec_rate | Get Protocol Version Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
get_protocol_version_avg_time | Get Protocol Version Average Time | ms | cluster, hbase, rack | CDH 4 |
get_protocol_version_rate | Get Protocol Version Operations | operations per second | cluster, hbase, rack | CDH 4 |
health_bad_rate | Percentage of Time with Bad Health | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
health_concerning_rate | Percentage of Time with Concerning Health | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
health_disabled_rate | Percentage of Time with Disabled Health | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
health_good_rate | Percentage of Time with Good Health | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
health_unknown_rate | Percentage of Time with Unknown Health | seconds per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
is_master_running_above_one_sec_avg_time | Is Master Running Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
is_master_running_above_one_sec_rate | Is Master Running Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
is_master_running_avg_time | Is Master Running Average Time | ms | cluster, hbase, rack | CDH 4 |
is_master_running_rate | Is Master Running Operations | operations per second | cluster, hbase, rack | CDH 4 |
jvm_blocked_threads | Blocked threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_gc_rate | Number of garbage collections | garbage collections per second | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_gc_time_ms_rate | Total time spent garbage collecting. | ms per second | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_heap_committed_mb | Total amount of committed heap memory. | MB | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_heap_used_mb | Total amount of used heap memory. | MB | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_max_memory_mb | Maximum allowed memory. | MB | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_new_threads | New threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_non_heap_committed_mb | Total amount of committed non-heap memory. | MB | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_non_heap_used_mb | Total amount of used non-heap memory. | MB | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_runnable_threads | Runnable threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_terminated_threads | Terminated threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_timed_waiting_threads | Timed waiting threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_total_threads | Total threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
jvm_waiting_threads | Waiting threads | threads | cluster, hbase, rack | CDH 4, CDH 5 |
kudu_active_scanners | Number of scanners that are currently active | message.units.scanners | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_evictions_rate | Number of blocks evicted from the cache | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_hits_caching_rate | Number of lookups that were expecting a block that found one.Use this number instead of cache_hits when trying to determine how efficient the cache is | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_hits_rate | Number of lookups that found a block | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_inserts_rate | Number of blocks inserted in the cache | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_lookups_rate | Number of blocks looked up from the cache | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_misses_caching_rate | Number of lookups that were expecting a block that didn't yield one.Use this number instead of cache_misses when trying to determine how efficient the cache is | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_misses_rate | Number of lookups that didn't yield a block | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_cache_usage | Memory consumed by the block cache | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_blocks_open_reading | Number of data blocks currently open for reading | blocks | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_blocks_open_writing | Number of data blocks currently open for writing | blocks | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_total_bytes_read_rate | Number of bytes of block data read since service start | bytes per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_total_bytes_written_rate | Number of bytes of block data written since service start | bytes per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_total_readable_blocks_rate | Number of data blocks opened for reading since service start | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_block_manager_total_writable_blocks_rate | Number of data blocks opened for writing since service start | blocks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_code_cache_hits_rate | Number of codegen cache hits since start | hits per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_code_cache_queries_rate | Number of codegen cache queries (hits + misses) since start | queries per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_cpu_stime_rate | Total system CPU time of the process | message.units.milliseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_cpu_utime_rate | Total user CPU time of the process | message.units.milliseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_data_dirs_full | Number of data directories whose disks are currently full | message.units.data_directories | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_generic_current_allocated_bytes | Number of bytes used by the application. This will not typically match the memory use reported by the OS, because it does not include TCMalloc overhead or memory fragmentation. | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_generic_heap_size | Bytes of system memory reserved by TCMalloc. | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_glog_error_messages_rate | Number of ERROR-level log messages emitted by the application. | messages per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_glog_info_messages_rate | Number of INFO-level log messages emitted by the application. | messages per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_glog_warning_messages_rate | Number of WARNING-level log messages emitted by the application. | messages per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_changeconfig_rate | Microseconds spent handling kudu.consensus.ConsensusService.ChangeConfig() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_changeconfig_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.ChangeConfig() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getconsensusstate_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetConsensusState() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getconsensusstate_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetConsensusState() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getlastopid_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetLastOpId() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getlastopid_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetLastOpId() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getnodeinstance_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetNodeInstance() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_getnodeinstance_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.GetNodeInstance() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_leaderstepdown_rate | Microseconds spent handling kudu.consensus.ConsensusService.LeaderStepDown() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_leaderstepdown_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.LeaderStepDown() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_requestconsensusvote_rate | Microseconds spent handling kudu.consensus.ConsensusService.RequestConsensusVote() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_requestconsensusvote_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.RequestConsensusVote() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_runleaderelection_rate | Microseconds spent handling kudu.consensus.ConsensusService.RunLeaderElection() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_runleaderelection_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.RunLeaderElection() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_startremotebootstrap_rate | Microseconds spent handling kudu.consensus.ConsensusService.StartRemoteBootstrap() RPC requests. This is the total number of recorded samples. This metric is no longer produced in current versions of Kudu. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_startremotebootstrap_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.StartRemoteBootstrap() RPC requests. This is the total sum of recorded samples. This metric is no longer produced in current versions of Kudu. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_starttabletcopy_rate | Microseconds spent handling kudu.consensus.ConsensusService.StartTabletCopy() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_starttabletcopy_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.StartTabletCopy() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_unsafechangeconfig_rate | Microseconds spent handling kudu.consensus.ConsensusService.UnsafeChangeConfig() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_unsafechangeconfig_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.UnsafeChangeConfig() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_updateconsensus_rate | Microseconds spent handling kudu.consensus.ConsensusService.UpdateConsensus() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_consensus_consensusservice_updateconsensus_sum_rate | Microseconds spent handling kudu.consensus.ConsensusService.UpdateConsensus() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_altertable_rate | Microseconds spent handling kudu.master.MasterService.AlterTable() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_altertable_sum_rate | Microseconds spent handling kudu.master.MasterService.AlterTable() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_connecttomaster_rate | Microseconds spent handling kudu.master.MasterService.ConnectToMaster() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_connecttomaster_sum_rate | Microseconds spent handling kudu.master.MasterService.ConnectToMaster() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_createtable_rate | Microseconds spent handling kudu.master.MasterService.CreateTable() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_createtable_sum_rate | Microseconds spent handling kudu.master.MasterService.CreateTable() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_deletetable_rate | Microseconds spent handling kudu.master.MasterService.DeleteTable() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_deletetable_sum_rate | Microseconds spent handling kudu.master.MasterService.DeleteTable() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_getmasterregistration_rate | Microseconds spent handling kudu.master.MasterService.GetMasterRegistration() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_getmasterregistration_sum_rate | Microseconds spent handling kudu.master.MasterService.GetMasterRegistration() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettablelocations_rate | Microseconds spent handling kudu.master.MasterService.GetTableLocations() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettablelocations_sum_rate | Microseconds spent handling kudu.master.MasterService.GetTableLocations() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettableschema_rate | Microseconds spent handling kudu.master.MasterService.GetTableSchema() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettableschema_sum_rate | Microseconds spent handling kudu.master.MasterService.GetTableSchema() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettabletlocations_rate | Microseconds spent handling kudu.master.MasterService.GetTabletLocations() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_gettabletlocations_sum_rate | Microseconds spent handling kudu.master.MasterService.GetTabletLocations() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_isaltertabledone_rate | Microseconds spent handling kudu.master.MasterService.IsAlterTableDone() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_isaltertabledone_sum_rate | Microseconds spent handling kudu.master.MasterService.IsAlterTableDone() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_iscreatetabledone_rate | Microseconds spent handling kudu.master.MasterService.IsCreateTableDone() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_iscreatetabledone_sum_rate | Microseconds spent handling kudu.master.MasterService.IsCreateTableDone() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listmasters_rate | Microseconds spent handling kudu.master.MasterService.ListMasters() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listmasters_sum_rate | Microseconds spent handling kudu.master.MasterService.ListMasters() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listtables_rate | Microseconds spent handling kudu.master.MasterService.ListTables() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listtables_sum_rate | Microseconds spent handling kudu.master.MasterService.ListTables() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listtabletservers_rate | Microseconds spent handling kudu.master.MasterService.ListTabletServers() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_listtabletservers_sum_rate | Microseconds spent handling kudu.master.MasterService.ListTabletServers() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_ping_rate | Microseconds spent handling kudu.master.MasterService.Ping() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_ping_sum_rate | Microseconds spent handling kudu.master.MasterService.Ping() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_tsheartbeat_rate | Microseconds spent handling kudu.master.MasterService.TSHeartbeat() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_master_masterservice_tsheartbeat_sum_rate | Microseconds spent handling kudu.master.MasterService.TSHeartbeat() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_checkleaks_rate | Microseconds spent handling kudu.server.GenericService.CheckLeaks() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_checkleaks_sum_rate | Microseconds spent handling kudu.server.GenericService.CheckLeaks() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_flushcoverage_rate | Microseconds spent handling kudu.server.GenericService.FlushCoverage() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_flushcoverage_sum_rate | Microseconds spent handling kudu.server.GenericService.FlushCoverage() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_getstatus_rate | Microseconds spent handling kudu.server.GenericService.GetStatus() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_getstatus_sum_rate | Microseconds spent handling kudu.server.GenericService.GetStatus() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_serverclock_rate | Microseconds spent handling kudu.server.GenericService.ServerClock() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_serverclock_sum_rate | Microseconds spent handling kudu.server.GenericService.ServerClock() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_setflag_rate | Microseconds spent handling kudu.server.GenericService.SetFlag() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_setflag_sum_rate | Microseconds spent handling kudu.server.GenericService.SetFlag() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_setserverwallclockfortests_rate | Microseconds spent handling kudu.server.GenericService.SetServerWallClockForTests() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_server_genericservice_setserverwallclockfortests_sum_rate | Microseconds spent handling kudu.server.GenericService.SetServerWallClockForTests() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_beginremotebootstrapsession_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.BeginRemoteBootstrapSession() RPC requests. This is the total number of recorded samples. This metric is no longer produced in current versions of Kudu. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_beginremotebootstrapsession_sum_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.BeginRemoteBootstrapSession() RPC requests. This is the total sum of recorded samples. This metric is no longer produced in current versions of Kudu. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_checksessionactive_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.CheckSessionActive() RPC requests. This is the total number of recorded samples. This metric is no longer produced in current versions of Kudu. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_checksessionactive_sum_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.CheckSessionActive() RPC requests. This is the total sum of recorded samples. This metric is no longer produced in current versions of Kudu. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_endremotebootstrapsession_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.EndRemoteBootstrapSession() RPC requests. This is the total number of recorded samples. This metric is no longer produced in current versions of Kudu. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_endremotebootstrapsession_sum_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.EndRemoteBootstrapSession() RPC requests. This is the total sum of recorded samples. This metric is no longer produced in current versions of Kudu. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_fetchdata_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.FetchData() RPC requests. This is the total number of recorded samples. This metric is no longer produced in current versions of Kudu. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_remotebootstrapservice_fetchdata_sum_rate | Microseconds spent handling kudu.tserver.RemoteBootstrapService.FetchData() RPC requests. This is the total sum of recorded samples. This metric is no longer produced in current versions of Kudu. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_begintabletcopysession_rate | Microseconds spent handling kudu.tserver.TabletCopyService.BeginTabletCopySession() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_begintabletcopysession_sum_rate | Microseconds spent handling kudu.tserver.TabletCopyService.BeginTabletCopySession() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_checksessionactive_rate | Microseconds spent handling kudu.tserver.TabletCopyService.CheckSessionActive() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_checksessionactive_sum_rate | Microseconds spent handling kudu.tserver.TabletCopyService.CheckSessionActive() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_endtabletcopysession_rate | Microseconds spent handling kudu.tserver.TabletCopyService.EndTabletCopySession() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_endtabletcopysession_sum_rate | Microseconds spent handling kudu.tserver.TabletCopyService.EndTabletCopySession() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_fetchdata_rate | Microseconds spent handling kudu.tserver.TabletCopyService.FetchData() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletcopyservice_fetchdata_sum_rate | Microseconds spent handling kudu.tserver.TabletCopyService.FetchData() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_alterschema_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.AlterSchema() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_alterschema_sum_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.AlterSchema() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_createtablet_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.CreateTablet() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_createtablet_sum_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.CreateTablet() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_deletetablet_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.DeleteTablet() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserveradminservice_deletetablet_sum_rate | Microseconds spent handling kudu.tserver.TabletServerAdminService.DeleteTablet() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_checksum_rate | Microseconds spent handling kudu.tserver.TabletServerService.Checksum() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_checksum_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.Checksum() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_listtablets_rate | Microseconds spent handling kudu.tserver.TabletServerService.ListTablets() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_listtablets_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.ListTablets() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_ping_rate | Microseconds spent handling kudu.tserver.TabletServerService.Ping() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_ping_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.Ping() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_scan_rate | Microseconds spent handling kudu.tserver.TabletServerService.Scan() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_scan_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.Scan() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_scannerkeepalive_rate | Microseconds spent handling kudu.tserver.TabletServerService.ScannerKeepAlive() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_scannerkeepalive_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.ScannerKeepAlive() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_write_rate | Microseconds spent handling kudu.tserver.TabletServerService.Write() RPC requests. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_handler_latency_kudu_tserver_tabletserverservice_write_sum_rate | Microseconds spent handling kudu.tserver.TabletServerService.Write() RPC requests. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_hybrid_clock_error | Server clock maximum error. | message.units.microseconds | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_hybrid_clock_timestamp | Hybrid clock timestamp. | message.units.microseconds | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_involuntary_context_switches_rate | Total involuntary context switches | message.units.context_switches per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_log_block_manager_blocks_under_management | Number of data blocks currently under management | blocks | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_log_block_manager_bytes_under_management | Number of bytes of data blocks currently under management | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_log_block_manager_containers | Number of log block containers | message.units.log_block_containers | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_log_block_manager_full_containers | Number of full log block containers | message.units.log_block_containers | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_log_block_manager_unavailable_containers_rate | Number of non-full log block containers that are under root paths whose disks are full This metric is no longer produced in current versions of Kudu. | message.units.log_block_containers per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_logical_clock_timestamp | Logical clock timestamp. | message.units.units | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_queue_length_rate | Number of operations waiting to be applied to the tablet. High queue lengths indicate that the server is unable to process operations as fast as they are being written to the WAL. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_queue_length_sum_rate | Number of operations waiting to be applied to the tablet. High queue lengths indicate that the server is unable to process operations as fast as they are being written to the WAL. This is the total sum of recorded samples. | tasks per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_queue_time_rate | Time that operations spent waiting in the apply queue before being processed. High queue times indicate that the server is unable to process operations as fast as they are being written to the WAL. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_queue_time_sum_rate | Time that operations spent waiting in the apply queue before being processed. High queue times indicate that the server is unable to process operations as fast as they are being written to the WAL. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_run_time_rate | Time that operations spent being applied to the tablet. High values may indicate that the server is under-provisioned or that operations consist of very large batches. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_op_apply_run_time_sum_rate | Time that operations spent being applied to the tablet. High values may indicate that the server is under-provisioned or that operations consist of very large batches. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_rpc_connections_accepted_rate | Number of incoming TCP connections made to the RPC server | connections per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_rpc_incoming_queue_time_rate | Number of microseconds incoming RPC requests spend in the worker queue. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_rpc_incoming_queue_time_sum_rate | Number of microseconds incoming RPC requests spend in the worker queue. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_rpcs_queue_overflow_rate | Number of RPCs dropped because the service queue was full. | requests per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_rpcs_timed_out_in_queue_rate | Number of RPCs whose timeout elapsed while waiting in the service queue, and thus were not processed. | requests per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_scanner_duration_rate | Histogram of the duration of active scanners on this tablet. This is the total number of recorded samples. | samples per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_scanner_duration_sum_rate | Histogram of the duration of active scanners on this tablet. This is the total sum of recorded samples. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_scanners_expired_rate | Number of scanners that have expired since service start | message.units.scanners per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_spinlock_contention_time_rate | Amount of time consumed by contention on internal spinlocks since the server started. If this increases rapidly, it may indicate a performance issue in Kudu internals triggered by a particular workload and warrant investigation. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_tcmalloc_contention_time_rate | Amount of time consumed by contention on tcmalloc's locks since the server started. If this increases rapidly, it may indicate a performance issue in Kudu internals triggered by a particular workload and warrant investigation. | message.units.microseconds per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_tcmalloc_current_total_thread_cache_bytes | A measure of some of the memory TCMalloc is using (for small objects). | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_tcmalloc_max_total_thread_cache_bytes | A limit to how much memory TCMalloc dedicates for small objects. Higher numbers trade off more memory use for -- in some situations -- improved efficiency. | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_tcmalloc_pageheap_free_bytes | Number of bytes in free, mapped pages in page heap. These bytes can be used to fulfill allocation requests. They always count towards virtual memory usage, and unless the underlying memory is swapped out by the OS, they also count towards physical memory usage. | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_tcmalloc_pageheap_unmapped_bytes | Number of bytes in free, unmapped pages in page heap. These are bytes that have been released back to the OS, possibly by one of the MallocExtension "Release" calls. They can be used to fulfill allocation requests, but typically incur a page fault. They always count towards virtual memory usage, and depending on the OS, typically do not count towards physical memory usage. | bytes | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_threads_running | Current number of running threads | threads | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_threads_started_rate | Total number of threads started on this server | threads per second | cluster, kudu, rack | CDH 4, CDH 5 |
kudu_voluntary_context_switches_rate | Total voluntary context switches | message.units.context_switches per second | cluster, kudu, rack | CDH 4, CDH 5 |
log_error_rate | Logged Errors | messages per second | cluster, hbase, rack | CDH 4, CDH 5 |
log_fatal_rate | Logged Fatals | messages per second | cluster, hbase, rack | CDH 4, CDH 5 |
log_info_rate | Logged Infos | messages per second | cluster, hbase, rack | CDH 4, CDH 5 |
log_warn_rate | Logged Warnings | messages per second | cluster, hbase, rack | CDH 4, CDH 5 |
mem_rss | Resident memory used | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
mem_swap | Amount of swap memory used by this role's process. | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
mem_virtual | Virtual memory used | bytes | accumulo16, cluster, rack | CDH 4, CDH 5 |
modify_column_above_one_sec_avg_time | Modify Column Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
modify_column_above_one_sec_rate | Modify Column Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
modify_column_avg_time | Modify Column Average Time | ms | cluster, hbase, rack | CDH 4 |
modify_column_rate | Modify Column Operations | operations per second | cluster, hbase, rack | CDH 4 |
modify_table_above_one_sec_avg_time | Modify Table Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
modify_table_above_one_sec_rate | Modify Table Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
modify_table_avg_time | Modify Table Average Time | ms | cluster, hbase, rack | CDH 4 |
modify_table_rate | Modify Table Operations | operations per second | cluster, hbase, rack | CDH 4 |
move_above_one_sec_avg_time | Move Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
move_above_one_sec_rate | Move Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
move_avg_time | Move Average Time | ms | cluster, hbase, rack | CDH 4 |
move_rate | Move Operations | operations per second | cluster, hbase, rack | CDH 4 |
offline_above_one_sec_avg_time | Offline Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
offline_above_one_sec_rate | Offline Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
offline_avg_time | Offline Average Time | ms | cluster, hbase, rack | CDH 4 |
offline_rate | Offline Operations | operations per second | cluster, hbase, rack | CDH 4 |
oom_exits_rate | The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. | exits per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
read_bytes_rate | The number of bytes read from the device | bytes per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
region_server_report_above_one_sec_avg_time | RegionServer Report Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
region_server_report_above_one_sec_rate | RegionServer Report Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
region_server_report_avg_time | RegionServer Report Average Time | ms | cluster, hbase, rack | CDH 4 |
region_server_report_rate | RegionServer Report Operations | operations per second | cluster, hbase, rack | CDH 4 |
region_server_startup_above_one_sec_avg_time | RegionServer Startup Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
region_server_startup_above_one_sec_rate | RegionServer Startup Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
region_server_startup_avg_time | RegionServer Startup Average Time | ms | cluster, hbase, rack | CDH 4 |
region_server_startup_rate | RegionServer Startup Operations | operations per second | cluster, hbase, rack | CDH 4 |
report_rs_fatal_error_above_one_sec_avg_time | Report RegionServer Fatal Error Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
report_rs_fatal_error_above_one_sec_rate | Report RegionServer Fatal Error Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
report_rs_fatal_error_avg_time | Report RegionServer Fatal Error Average Time | ms | cluster, hbase, rack | CDH 4 |
report_rs_fatal_error_rate | Report RegionServer Fatal Error Operations | operations per second | cluster, hbase, rack | CDH 4 |
roll_h_log_writer_avg_time | Roll HLog Writer Average Time | ms | cluster, hbase, rack | CDH 4 |
roll_h_log_writer_rate | Roll HLog Writer Operations | operations per second | cluster, hbase, rack | CDH 4 |
rpc_authentication_failures_rate | RPC Authentication Failures | operations per second | cluster, hbase, rack | CDH 4 |
rpc_authentication_successes_rate | RPC Authentication Successes | operations per second | cluster, hbase, rack | CDH 4 |
rpc_authorization_failures_rate | RPC Authorization Failures | operations per second | cluster, hbase, rack | CDH 4 |
rpc_authorization_successes_rate | RPC Authorization Successes | operations per second | cluster, hbase, rack | CDH 4 |
rpc_call_queue_length | RPC Call Queue Length | items | cluster, hbase, rack | CDH 4 |
rpc_num_open_connections | Open RPC Connections | connections | cluster, hbase, rack | CDH 4 |
rpc_processing_time_avg_time | Average RPC Processing Time | ms | cluster, hbase, rack | CDH 4 |
rpc_processing_time_rate | RPCs Processed | operations per second | cluster, hbase, rack | CDH 4 |
rpc_queue_time_avg_time | Average RPC Queue Time | ms | cluster, hbase, rack | CDH 4 |
rpc_queue_time_rate | RPCs Queued | operations per second | cluster, hbase, rack | CDH 4 |
rpc_received_bytes_rate | RPC Received Bytes | bytes per second | cluster, hbase, rack | CDH 4 |
rpc_sent_bytes_rate | RPC Sent Bytes | bytes per second | cluster, hbase, rack | CDH 4 |
rpc_slow_response_avg_time | RPC Slow Response Average Time | ms | cluster, hbase, rack | CDH 4 |
rpc_slow_response_rate | RPC Slow Response Operations | operations per second | cluster, hbase, rack | CDH 4 |
split_avg_size | HLog Split Average Size | bytes | cluster, hbase, rack | CDH 4 |
split_avg_time | HLog Split Average Time | ms | cluster, hbase, rack | CDH 4 |
split_rate | HLog Split Operations | operations per second | cluster, hbase, rack | CDH 4 |
unassign_above_one_sec_avg_time | Unassign Above One Second Average Time | ms | cluster, hbase, rack | CDH 4 |
unassign_above_one_sec_rate | Unassign Operations Above One Second | operations per second | cluster, hbase, rack | CDH 4 |
unassign_avg_time | Unassign Average Time | ms | cluster, hbase, rack | CDH 4 |
unassign_rate | Unassign Operations | operations per second | cluster, hbase, rack | CDH 4 |
unexpected_exits_rate | The number of times the role's backing process exited unexpectedly. | exits per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
uptime | For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. | seconds | accumulo16, cluster, rack | CDH 4, CDH 5 |
web_metrics_collection_duration | Web Server Responsiveness | ms | cluster, hbase, rack | CDH 4, CDH 5 |
write_bytes_rate | The number of bytes written to the device | bytes per second | accumulo16, cluster, rack | CDH 4, CDH 5 |
assign_75th_percentile | Assign Region Time 75% | ms | cluster, hbase, rack | CDH 5 |
assign_95th_percentile | Assign Region Time 95% | ms | cluster, hbase, rack | CDH 5 |
assign_99th_percentile | Assign Region Time 99% | ms | cluster, hbase, rack | CDH 5 |
assign_max | Assign Region Max | ms | cluster, hbase, rack | CDH 5 |
assign_mean | Assign Region Mean | ms | cluster, hbase, rack | CDH 5 |
assign_median | Assign Region Median | ms | cluster, hbase, rack | CDH 5 |
assign_min | Assign Region Min | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_75th_percentile | Balance Cluster Time 75% | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_95th_percentile | Balance Cluster Time 95% | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_99th_percentile | Balance Cluster Time 99% | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_max | Balance Cluster Time Max | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_mean | Balance Cluster Time Mean | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_median | Balance Cluster Time Median | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_min | Balance Cluster Time Min | ms | cluster, hbase, rack | CDH 5 |
balance_cluster_rate | Balance Cluster Operations | operations per second | cluster, hbase, rack | CDH 5 |
balancer_misc_invocations_rate | The number of times the balancer was invoked outside a balance cluster operation. The balancer might be invoked when a new table is created, for example, to place the newly created regions. | operations per second | cluster, hbase, rack | CDH 5 |
bulk_assign_75th_percentile | Bulk Assign Time 75% | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_95th_percentile | Bulk Assign Time 95% | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_99th_percentile | Bulk Assign Time 99% | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_max | Bulk Assign Time Max | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_mean | Bulk Assign Time Mean | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_median | Bulk Assign Time Median | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_min | Bulk Assign Time Min | ms | cluster, hbase, rack | CDH 5 |
bulk_assign_rate | Bulk Assign Operations | operations per second | cluster, hbase, rack | CDH 5 |
dead_regionservers | The number of dead RegionServers. | RegionServers | cluster, hbase, rack | CDH 5 |
gc_count_concurrent_mark_sweep_rate | The number of garbage collections by the Concurrent Mark Sweep Collector. | garbage collections per second | cluster, hbase, rack | CDH 5 |
gc_count_par_new_rate | The number of garbage collections by the Parallel Collector. | garbage collections per second | cluster, hbase, rack | CDH 5 |
gc_time_ms_concurrent_mark_sweep_rate | The total time spent in garbage collections by the Concurrent Mark Sweep Collector. | ms per second | cluster, hbase, rack | CDH 5 |
gc_time_ms_par_new_rate | The total time spent in garbage collections by the Parallel Collector. | ms per second | cluster, hbase, rack | CDH 5 |
get_hadoop_groups_avg_time | Average Time to get Hadoop group for the user | ms | cluster, hbase, rack | CDH 5 |
get_hadoop_groups_rate | Get Hadoop User Operations | operations per second | cluster, hbase, rack | CDH 5 |
hlog_split_rate | HLog Split Operations | operations per second | cluster, hbase, rack | CDH 5 |
hlog_split_size_75th_percentile | HLog Split Size 75% | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_95th_percentile | HLog Split Size 95% | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_99th_percentile | HLog Split Size 99% | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_max | HLog Split Size Max | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_mean | HLog Split Size Mean | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_median | HLog Split Size Median | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_size_min | HLog Split Size Min | bytes | cluster, hbase, rack | CDH 5 |
hlog_split_time_75th_percentile | HLog Split Time 75% | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_95th_percentile | HLog Split Time 95% | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_99th_percentile | HLog Split Time 99% | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_max | HLog Split Time Max | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_mean | HLog Split Time Mean | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_median | HLog Split Time Median | ms | cluster, hbase, rack | CDH 5 |
hlog_split_time_min | HLog Split Time Min | ms | cluster, hbase, rack | CDH 5 |
ipc_authentication_failures_rate | IPC Authentication Failures | operations per second | cluster, hbase, rack | CDH 5 |
ipc_authentication_successes_rate | IPC Authentication Successes | operations per second | cluster, hbase, rack | CDH 5 |
ipc_authorization_failures_rate | IPC Authorization Failures | operations per second | cluster, hbase, rack | CDH 5 |
ipc_authorization_successes_rate | IPC Authorization Successes | operations per second | cluster, hbase, rack | CDH 5 |
ipc_process_rate | IPCs Processed | operations per second | cluster, hbase, rack | CDH 5 |
ipc_process_time_75th_percentile | IPC Processing Time 75% | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_95th_percentile | IPC Processing Time 95% | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_99th_percentile | IPC Processing Time 99% | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_max | IPC Processing Time Max | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_mean | IPC Processing Time Mean | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_median | IPC Processing Time Median | ms | cluster, hbase, rack | CDH 5 |
ipc_process_time_min | IPC Processing Time Min | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_rate | IPCs Queued | operations per second | cluster, hbase, rack | CDH 5 |
ipc_queue_time_75th_percentile | IPC Queue Time 75% | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_95th_percentile | IPC Queue Time 95% | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_99th_percentile | IPC Queue Time 99% | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_max | IPC Queue Time Max | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_mean | IPC Queue Time Mean | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_median | IPC Queue Time Median | ms | cluster, hbase, rack | CDH 5 |
ipc_queue_time_min | IPC Queue Time Min | ms | cluster, hbase, rack | CDH 5 |
ipc_received_bytes_rate | IPC Received Bytes | bytes per second | cluster, hbase, rack | CDH 5 |
ipc_sent_bytes_rate | IPC Sent Bytes | bytes per second | cluster, hbase, rack | CDH 5 |
login_failure_avg_time | Average Failed Login Time | ms | cluster, hbase, rack | CDH 5 |
login_failure_rate | Login Failures | operations per second | cluster, hbase, rack | CDH 5 |
login_success_avg_time | Average Successful Login Time | ms | cluster, hbase, rack | CDH 5 |
login_success_rate | Login Successes | operations per second | cluster, hbase, rack | CDH 5 |
meta_hlog_split_rate | Meta HLog Split Operations | operations per second | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_75th_percentile | Meta HLog Split Size 75% | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_95th_percentile | Meta HLog Split Size 95% | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_99th_percentile | Meta HLog Split Size 99% | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_max | Meta HLog Split Size Max | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_mean | Meta HLog Split Size Mean | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_median | Meta HLog Split Size Median | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_size_min | Meta HLog Split Size Min | bytes | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_75th_percentile | Meta HLog Split Time 75% | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_95th_percentile | Meta HLog Split Time 95% | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_99th_percentile | Meta HLog Split Time 99% | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_max | Meta HLog Split Time Max | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_mean | Meta HLog Split Time Mean | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_median | Meta HLog Split Time Median | ms | cluster, hbase, rack | CDH 5 |
meta_hlog_split_time_min | Meta HLog Split Time Min | ms | cluster, hbase, rack | CDH 5 |
metrics_dropped_pub_all | Dropped Metrics Updates By All Sinks | updates | cluster, hbase, rack | CDH 5 |
metrics_num_active_sinks | Active Metrics Sinks Count | sinks | cluster, hbase, rack | CDH 5 |
metrics_num_active_sources | Active Metrics Sources Count | sources | cluster, hbase, rack | CDH 5 |
metrics_num_all_sinks | All Metrics Sinks Count | sinks | cluster, hbase, rack | CDH 5 |
metrics_num_all_sources | All Metrics Sources Count | sources | cluster, hbase, rack | CDH 5 |
metrics_publish_avg_time | Metrics Publish Average Time | ms | cluster, hbase, rack | CDH 5 |
metrics_publish_rate | Metrics Publish Operations | operations per second | cluster, hbase, rack | CDH 5 |
metrics_snapshot_avg_time | Metrics Snapshot Average Time | ms | cluster, hbase, rack | CDH 5 |
metrics_snapshot_rate | Metrics Snapshot Average Operations | operations per second | cluster, hbase, rack | CDH 5 |
regions_in_transition | Regions in Transition | regions | cluster, hbase, rack | CDH 5 |
regions_in_transition_longest_time | The largest amount of time that any region currently in transition has been in transition. | ms | cluster, hbase, rack | CDH 5 |
regions_in_transition_over_threshold | The number of regions that have been in transition longer than the threshold configured in rit.metrics.threshold.time. The default threshold value is one minute. | regions | cluster, hbase, rack | CDH 5 |
regionservers | The number of RegionServers. | RegionServers | cluster, hbase, rack | CDH 5 |