Searching Metadata

Required Role: Metadata Viewer (or Metadata Administrator, Full Administrator)

You can access metadata through the Cloudera Navigator console or by using the Navigator API.

Using the Cloudera Navigator Console

Searching Metadata

  1. Open your browser.
  2. Navigate to the host within the cluster running the Cloudera Navigator Metadata Server role.
    http://fqdn-1.example.com:7187/login.html
    In this example, node 1 of the cluster is running the Navigator Metadata Server role, hosted on the default port 7187. The login page displays.
  3. Log in to the Cloudera Navigator console using the credentials assigned by your administrator. The Cloudera Navigator console opens to the Search tab.
  4. To display all entities, click Clear all filters.
  5. Filter the search results by specifying filters or typing search strings in the Search box.

Filter Example

The following filter example demonstrates how to narrow search results by selecting a built-in Source Type filter set to HDFS and the managed property BLOCK SIZE (MIB) with the value >=1024. This example also shows the full query, which you can see by clicking Show full query in the results pane.

Viewing Search Results

The Search Results pane displays the number of matching entries in pages listing 25 entities per page. You can view the pages using the page control at the bottom of each page.

Each entry in the result list contains:
  • Source type
  • Name - A link to a page that displays the entity details and lineage diagram
  • Properties
  • If Hue is running, a link at the far right labeled View in Hue that opens the Hue browser for the entity. For example:

Displaying Entity Details

The entity Details page displays all three types of metadata for an entity—technical, managed, and custom—and entity type-specific information:
  • HDFS directories - Directory contents
  • HDFS datasets and fields - Schema
  • Hive and Impala databases - Tables and views
  • Hive tables - Extended attributes, table schema, partitions
  • Impala tables - Table schema
  • MapReduce, YARN, and Spark operations - Instances
  • Pig operation executions - Tables
  • S3 buckets and folders - Folder contents
All entity types can display inputs and outputs. See Configuring Display of Inputs and Outputs.

If managed properties have been defined for a particular entity type, the Show All checkbox in the Managed Metadata pane displays all properties that can be assigned values for the selected entity. To display only those properties that have values, clear the checkbox. If all properties have values, the checkbox has no effect.

To display entity details:

  1. Perform a search.
  2. In the search results, click an entity name link. The Details tab displays.

Hive Table Entity Details

For example, if you click the Hive table weblogs link in the search result displayed in preceding example, you see the following details:


The caret > indicates fields with content; click to expand them. In addition to the technical metadata, you can view the names of the columns and the inputs and outputs.

Managed Metadata Example

The following account table entity has two managed properties in the realestate namespace: Department and AccessAttempts.

Filtering Search Results

To filter search results, specify filters in the Filters pane or type search strings in the Search box.

The Filters pane lists default properties (source type, type, owner, cluster, and tags) and property values. Add a filter by clicking Add Filters... and scrolling, or by typing in the filter combo box to search for it. To remove non-default filter properties, click the x in the filter.

Specify a property value as follows:

  • Boolean - Select the option to respectively not display, or display only those entries, with the value set to true: Do not show XXX (the default) or Show XXX only, where XXX is the Boolean property.
  • Enumerated or freeform string
    • Select the checkbox next to a value or click a value link.
    • If a property has no values, click add a new value, click the text box, and select from the populated values in the drop-down list or type a value.
  • Timestamp - Timestamps are used for started, ended, created, last accessed, and last modified properties. The server stores the timestamp in UTC, and the UI displays the timestamp converted to the local timezone. Select one of the timestamp options:
    • A Last XXX day(s) link.
    • The Last checkbox, type or specify the value using the spinner control and select the unit minutes, hours, or days.
    • The Custom period checkbox and specify the start and end date.
      • Date - Click the down arrow to display a calendar and select a date, or click a field and click the spinner arrows or press the up and down arrow keys.
      • Time - Click the hour, minute, and AM/PM fields and click the spinner arrows or press the up and down arrow keys to specify the value.
      • Move between fields by clicking fields or by using the right and left arrow keys.

To remove filter values, clear the checkbox.

When you select a specific source type value, additional properties that apply to that source type display. For example, HDFS has size, created, and group properties:


The number to the right a property value is the number of extracted entities that have that property value:


Facet values with the count 0 are not displayed.

When you type values, the value is enclosed in quotes; the value inside the quotes must match the metadata exactly. For example:
  • Typing "sample_*" in the originalName property returns only entities whose names match that exact string.
  • To perform a wildcard search, type the wildcard string in the Search box. For example, typing the string "sample_*" in the Search box returns all entities with "sample_" at the beginning of their original name.
When you construct search strings with filters, use parentheses () to specify multiple values of a property. Add multiple properties by using the + operator. For example, entities of type HDFS or Hive that are of type file or directory:
+(sourceType:hdfs sourceType:hive) +(type:file type:directory)
and:
 ((+sourceType:hdfs +created:[NOW/DAY-30DAYS TO NOW/DAY+1DAY]) sourceType:hive)

Saving Searches

  1. Specify a search string or set of filters.
  2. To the right of the Search box, select Actions > Save, Actions > Save Search_name, or Actions > Save As....
  3. If you have not previously saved the search, specify a name and click Save.

Reusing a Saved Search

  1. To the right of the Search box, select Actions > View saved searches.... A label with the saved search name is added under the Search box.
  2. Click the saved search name. The breadcrumbs and full query if displayed are updated to reflect the saved search, and the search results are refreshed immediately.