Documentation
Products
Services & Support
Solutions
List of Pages in Category Data Analysts (229 pages)
Start typing to see matching topic titles in the
Data Analysts
category:
If this category isn't helpful:
List of all categories
|
Back to navigation tree view
A
B
C
D
E
F
G
H
I
J
L
M
N
O
P
Q
R
S
T
U
V
W
*
All Cloudera Documentation Categories
A
ABORT_ON_DEFAULT_LIMIT_EXCEEDED Query Option
(211 words:
)
ABORT_ON_ERROR Query Option
(291 words:
)
(Task)
Accessing Avro Data Files From Spark SQL Applications
(1264 words:
)
(Task)
Accessing Data Stored in Amazon S3 through Spark
(1392 words:
)
(Task)
Accessing Data Stored in Azure Data Lake Store (ADLS) through Spark
(484 words:
)
(Task)
Accessing External Storage from Spark
(403 words:
)
(Task)
Accessing HBase by using the HBase Shell
(577 words:
)
(Task)
Accessing HBase by using the HBase Shell
(576 words:
)
(Task)
Accessing Parquet Files From Spark SQL Applications
(438 words:
)
(Task)
Accessing Table Data with Pig
(206 words:
)
ALTER TABLE Statement
(6162 words:
)
ALTER VIEW Statement
(704 words:
)
Apache Impala - Interactive SQL
(548 words:
)
Apache Impala Overview
(778 words:
)
APPX_COUNT_DISTINCT Query Option (CDH 5.2 or higher only)
(405 words:
)
APPX_MEDIAN Function
(655 words:
)
ARRAY Complex Type (CDH 5.5 or higher only)
(1582 words:
)
AVG Function
(1850 words:
)
B
BATCH_SIZE Query Option
(218 words:
)
(Task)
Benchmarking Impala Queries
(291 words:
)
BIGINT Data Type
(664 words:
)
BOOLEAN Data Type
(759 words:
)
(Task)
Building Spark Applications
(697 words:
)
C
CHAR Data Type (CDH 5.2 or higher only)
(1080 words:
)
Cloudera Search Guide
(444 words:
)
Cloudera Search Overview
(1426 words:
)
Comments
(305 words:
)
Complex Types (CDH 5.5 or higher only)
(13422 words:
)
Components of the Impala Server
(1410 words:
)
COMPRESSION_CODEC Query Option (CDH 5.2 or higher only)
(374 words:
)
COMPUTE STATS Statement
(3630 words:
)
Configuration Settings for HBase
(3481 words:
)
(Task)
Configuring Impala Delegation for Hue and BI Tools
(808 words:
)
(Task)
Connecting to impalad through impala-shell
(970 words:
)
(Task)
Controlling Impala Resource Usage
(391 words:
)
(Task)
Copying Cluster Data Using DistCp
(3496 words:
)
(Task)
Copying Data between a Secure and an Insecure Cluster using DistCp and WebHDFS
(316 words:
)
COUNT Function
(1946 words:
)
CREATE DATABASE Statement
(1048 words:
)
CREATE FUNCTION Statement
(2829 words:
)
CREATE ROLE Statement (CDH 5.2 or higher only)
(371 words:
)
CREATE TABLE Statement
(7052 words:
)
CREATE VIEW Statement
(1043 words:
)
D
Data Types
(278 words:
)
DDL Statements
(692 words:
)
DECIMAL Data Type
(3891 words:
)
DEFAULT_JOIN_DISTRIBUTION_MODE Query Option
(621 words:
)
DEFAULT_ORDER_BY_LIMIT Query Option
(252 words:
)
DELETE Statement (CDH 5.10 or higher only)
(905 words:
)
DESCRIBE Statement
(4503 words:
)
(Task)
Detecting and Correcting HDFS Block Skew Conditions
(1104 words:
)
(Task)
Developing and Running a Spark WordCount Application
(1221 words:
)
(Task)
Developing Impala Applications
(1017 words:
)
DISABLE_ROW_RUNTIME_FILTERING Query Option (CDH 5.7 or higher only)
(383 words:
)
DISABLE_STREAMING_PREAGGREGATIONS Query Option (CDH 5.7 or higher only)
(333 words:
)
DISABLE_UNSAFE_SPILLS Query Option (CDH 5.2 or higher only)
(386 words:
)
DISTINCT Operator
(511 words:
)
DML Statements
(565 words:
)
DOUBLE Data Type
(683 words:
)
DROP DATABASE Statement
(887 words:
)
DROP FUNCTION Statement
(691 words:
)
DROP ROLE Statement (CDH 5.2 or higher only)
(372 words:
)
DROP STATS Statement
(2208 words:
)
DROP TABLE Statement
(1083 words:
)
E
EXEC_SINGLE_NODE_ROWS_THRESHOLD Query Option (CDH 5.3 or higher only)
(550 words:
)
EXPLAIN Statement
(1549 words:
)
EXPLAIN_LEVEL Query Option
(1862 words:
)
F
File Formats and Compression
(346 words:
)
FLOAT Data Type
(676 words:
)
G
GRANT Statement (CDH 5.2 or higher only)
(647 words:
)
GROUP BY Clause
(797 words:
)
GROUP_CONCAT Function
(636 words:
)
(Concept)
Guidelines for Designing Impala Schemas
(1288 words:
)
H
HAVING Clause
(230 words:
)
HBase Filtering
(2438 words:
)
HBASE_CACHE_BLOCKS Query Option
(228 words:
)
HBASE_CACHING Query Option
(203 words:
)
How Impala Fits Into the Hadoop Ecosystem
(873 words:
)
How Impala Works with Hadoop File Formats
(1046 words:
)
I
Impala Aggregate Functions
(267 words:
)
Impala Analytic Functions
(8805 words:
)
Impala Bit Functions
(2871 words:
)
Impala Built-In Functions
(703 words:
)
Impala Concepts and Architecture
(220 words:
)
Impala Conditional Functions
(2042 words:
)
Impala Date and Time Functions
(9233 words:
)
Impala Frequently Asked Questions
(8486 words:
)
Impala Mathematical Functions
(4083 words:
)
Impala Miscellaneous Functions
(640 words:
)
(Concept)
Impala Performance Guidelines and Best Practices
(2241 words:
)
(Release Notes)
Impala Requirements
(1364 words:
)
(Release Notes)
Impala Requirements
(1360 words:
)
Impala Reserved Words
(701 words:
)
Impala Schema Objects and Object Names
(440 words:
)
Impala SQL Language Reference
(417 words:
)
Impala SQL Statements
(424 words:
)
Impala String Functions
(5287 words:
)
Impala Tutorials
(11236 words:
)
Impala Type Conversion Functions
(1416 words:
)
Impala Web User Interface for Debugging
(1388 words:
)
impala-shell Command Reference
(1474 words:
)
impala-shell Configuration Options
(1678 words:
)
(Task)
Importing Data Into HBase
(6191 words:
)
INSERT Statement
(5054 words:
)
INT Data Type
(499 words:
)
INVALIDATE METADATA Statement
(1003 words:
)
J
Joins in Impala SELECT Statements
(3188 words:
)
L
LIMIT Clause
(843 words:
)
Literals
(2175 words:
)
LIVE_PROGRESS Query Option (CDH 5.5 or higher only)
(607 words:
)
LIVE_SUMMARY Query Option (CDH 5.5 or higher only)
(1275 words:
)
LOAD DATA Statement
(1885 words:
)
M
(Task)
Managing Disk Space for Impala Data
(991 words:
)
MAP Complex Type (CDH 5.5 or higher only)
(1618 words:
)
MAX Function
(1752 words:
)
MAX_ERRORS Query Option
(243 words:
)
MAX_IO_BUFFERS Query Option
(161 words:
)
MAX_NUM_RUNTIME_FILTERS Query Option (CDH 5.7 or higher only)
(341 words:
)
MAX_SCAN_RANGE_LENGTH Query Option
(317 words:
)
MEM_LIMIT Query Option
(985 words:
)
MIN Function
(1754 words:
)
MT_DOP Query Option
(707 words:
)
N
NDV Function
(1068 words:
)
NUM_NODES Query Option
(387 words:
)
NUM_SCANNER_THREADS Query Option
(209 words:
)
O
OFFSET Clause
(515 words:
)
OPTIMIZE_PARTITION_KEY_SCANS Query Option (CDH 5.7 or higher only)
(982 words:
)
ORDER BY Clause
(2507 words:
)
(Concept)
Overview of Impala Aliases
(649 words:
)
(Concept)
Overview of Impala Databases
(434 words:
)
(Concept)
Overview of Impala Functions
(636 words:
)
(Concept)
Overview of Impala Identifiers
(615 words:
)
(Concept)
Overview of Impala Tables
(2318 words:
)
(Concept)
Overview of Impala Views
(1874 words:
)
P
PARQUET_ANNOTATE_STRINGS_UTF8 Query Option (CDH 5.8 or higher only)
(325 words:
)
PARQUET_ARRAY_RESOLUTION Query Option (CDH 5.11 or higher only)
(613 words:
)
PARQUET_DICTIONARY_FILTERING Query Option (CDH 5.11 or higher only)
(412 words:
)
PARQUET_FALLBACK_SCHEMA_RESOLUTION Query Option (CDH 5.8 or higher only)
(261 words:
)
PARQUET_FILE_SIZE Query Option
(459 words:
)
PARQUET_READ_STATISTICS Query Option (CDH 5.11 or higher only)
(388 words:
)
(Task)
Partitioning for Impala Tables
(5181 words:
)
Performance Considerations for Join Queries
(2879 words:
)
(Task)
Porting SQL from Other Database Systems to Impala
(3386 words:
)
(Reference)
Ports Used by Impala
(505 words:
)
PREFETCH_MODE Query Option (CDH 5.8 or higher only)
(226 words:
)
Q
Query Hints in Impala SELECT Statements
(2393 words:
)
Query Options for the SET Statement
(320 words:
)
QUERY_TIMEOUT_S Query Option (CDH 5.2 or higher only)
(347 words:
)
R
(Task)
Reading Data from HBase
(1163 words:
)
REAL Data Type
(252 words:
)
REFRESH FUNCTIONS Statement
(255 words:
)
REFRESH Statement
(996 words:
)
REPLICA_PREFERENCE Query Option (CDH 5.9 or higher only)
(346 words:
)
REQUEST_POOL Query Option
(212 words:
)
RESERVATION_REQUEST_TIMEOUT Query Option (CDH 5.0 or higher 5 only)
(206 words:
)
Resource Management for Impala
(460 words:
)
REVOKE Statement (CDH 5.2 or higher only)
(557 words:
)
(Task)
Running Commands and SQL Statements in impala-shell
(613 words:
)
(Task)
Running Spark Applications
(1239 words:
)
(Task)
Running Your First Spark Application
(527 words:
)
Runtime Filtering for Impala Queries (CDH 5.7 or higher only)
(3109 words:
)
RUNTIME_BLOOM_FILTER_SIZE Query Option (CDH 5.7 or higher only)
(517 words:
)
RUNTIME_FILTER_MAX_SIZE Query Option (CDH 5.8 or higher only)
(306 words:
)
RUNTIME_FILTER_MIN_SIZE Query Option (CDH 5.8 or higher only)
(306 words:
)
RUNTIME_FILTER_MODE Query Option (CDH 5.7 or higher only)
(429 words:
)
RUNTIME_FILTER_WAIT_TIME_MS Query Option (CDH 5.7 or higher only)
(266 words:
)
S
S3_SKIP_INSERT_STAGING Query Option (CDH 5.8 or higher only)
(473 words:
)
SCAN_NODE_CODEGEN_THRESHOLD Query Option (CDH 5.7 or higher only)
(447 words:
)
SCHEDULE_RANDOM_REPLICA Query Option (CDH 5.7 or higher only)
(320 words:
)
SCRATCH_LIMIT Query Option
(320 words:
)
SELECT Statement
(1241 words:
)
SET Statement
(862 words:
)
SHOW Statement
(8093 words:
)
SMALLINT Data Type
(535 words:
)
Spark and IPython and Jupyter Notebooks
(274 words:
)
(Task)
Specifying Impala Credentials to Access Data in S3
(281 words:
)
(Task)
Specifying Impala Credentials to Access Data in S3 with Cloudera Manager
(510 words:
)
SQL Differences Between Impala and Hive
(1336 words:
)
SQL Operators
(8511 words:
)
STDDEV, STDDEV_SAMP, STDDEV_POP Functions
(521 words:
)
STRING Data Type
(1130 words:
)
STRUCT Complex Type (CDH 5.5 or higher only)
(2276 words:
)
Subqueries in Impala SELECT Statements
(1611 words:
)
SUM Function
(1926 words:
)
Supported Sources, Sinks, and Channels
(944 words:
)
SYNC_DDL Query Option
(420 words:
)
T
Table and Column Statistics
(7084 words:
)
TABLESAMPLE Clause
(2564 words:
)
(Task)
Testing Impala Performance
(691 words:
)
TIMESTAMP Data Type
(3158 words:
)
TINYINT Data Type
(588 words:
)
(Task)
Troubleshooting Impala
(1588 words:
)
TRUNCATE TABLE Statement (CDH 5.5 or higher only)
(910 words:
)
U
(Task)
Understanding Impala Query Performance - EXPLAIN Plans and Query Profiles
(1554 words:
)
UNION Clause
(692 words:
)
UPDATE Statement (CDH 5.10 or higher only)
(841 words:
)
UPSERT Statement (CDH 5.10 or higher only)
(549 words:
)
USE Statement
(394 words:
)
User-Defined Functions (UDFs)
(7403 words:
)
(Task)
Using Apache Avro Data Files with CDH
(1417 words:
)
(Task)
Using Apache Hive with HBase in CDH
(358 words:
)
(Task)
Using Apache Parquet Data Files with CDH
(3271 words:
)
(Task)
Using Azure Data Lake Store with HBase
(498 words:
)
(Task)
Using CDH with Isilon Storage
(2583 words:
)
(Task)
Using HBase Command-Line Utilities
(2211 words:
)
(Task)
Using HDFS Caching with Impala (CDH 5.3 or higher only)
(3532 words:
)
(Task)
Using Impala Logging
(2002 words:
)
(Task)
Using Impala through a Proxy for High Availability
(2656 words:
)
(Task)
Using Impala to Query HBase Tables
(4040 words:
)
(Task)
Using Impala to Query Kudu Tables
(6802 words:
)
(Task)
Using Impala with Isilon Storage
(825 words:
)
(Task)
Using Impala with the Amazon S3 Filesystem
(4316 words:
)
(Task)
Using Impala with the Azure Data Lake Store (ADLS)
(3441 words:
)
(Task)
Using Pig with HBase
(240 words:
)
(Task)
Using PySpark
(199 words:
)
(Task)
Using Spark MLlib
(521 words:
)
(Task)
Using Spark SQL
(1468 words:
)
(Task)
Using Spark Streaming
(1088 words:
)
(Task)
Using Text Data Files with Impala Tables
(4023 words:
)
(Task)
Using the Avro File Format with Impala Tables
(2652 words:
)
(Task)
Using the Impala Shell (impala-shell Command)
(620 words:
)
(Task)
Using the Parquet File Format with Impala Tables
(7068 words:
)
(Task)
Using the RCFile File Format with Impala Tables
(1095 words:
)
(Task)
Using the SequenceFile File Format with Impala Tables
(1070 words:
)
V
VARCHAR Data Type (CDH 5.2 or higher only)
(1081 words:
)
VARIANCE, VARIANCE_SAMP, VARIANCE_POP, VAR_SAMP, VAR_POP Functions
(591 words:
)
V_CPU_CORES Query Option (CDH 5.0 or higher only)
(205 words:
)
W
WITH Clause
(428 words:
)
(Task)
Writing Data to HBase
(1129 words:
)