Overview

Alation Cloud Service Applies to Alation Cloud Service instances of Alation

Available from Alation version 2024.3.4

The OCF Connector for dbt Gen2 is developed by Alation and is available for download from the Connector Hub on Alation Customer Portal. Ask an Alation admin with access to Customer Portal to download the connector from the Connectors section (Customer Portal > Connectors).

Use the connector to catalog dbt as a data source on Alation Cloud Service instances. It extracts projects, models, model columns, and job runs (jobs executed for a given model in an instance). After the metadata is extracted, it is represented in the data catalog as a hierarchy of catalog pages. Alation users can leverage the full catalog functionality to search for and find the extracted metadata, curate the corresponding catalog pages, create documentation about the data source, and exchange information about it.

Team

You may require assistance from the following administrators to install and configure this connector in Alation:

Alation Server administrator:

  • Installs the connector.

  • Enables and configures extraction from dbt ELT source.

dbt administrator:

  • Provides the API URLs required to access dbt and extract metadata using dbt Admin and Discovery API

  • Provides Account ID required for the API URL

  • Provide Access Token for use with dbt APIs

Scope

The table below shows which metadata objects are extracted by this connector and which operations are supported.

Feature

Scope

Availability

Authentication

dbt access token

Authentication using the access token obtained from dbt account settings

Yes

Metadata Extraction (MDE)

Default MDE

Extraction of metadata using dbt APIs.

Yes

Filter MDE

Extraction of selected metadata based on filters.

Yes

Extracted metadata objects

Model descriptions

Description of the models within a dbt project. The model description is available in the model catalog page.

Yes

Model column descriptions

Description of the model columns. Model column descriptions is available in the model catalog page.

Yes

Test Status

Included as Data Quality for impacted tables.

Yes

Lineage

Jinja code extraction

Extraction of Jinja code from dbt into dataflow content

Yes

Automatic lineage generation

Auto-calculation of source lineage based on the metadata extracted from dbt

Yes

Column-level lineage

Calculation of lineage data at the column level.

Yes

Table-level lineage

Calculation of lineage data at the table level.

Yes

dbt Gen2 Data Architecture

The dbt source architecture contains Projects from the associated RDBMS data sources. Each Project contains models and models in turn have model columns and list of jobs run.

../../../_images/dbt-data-model.png

Use the Alation UI to understand and navigate through dbt metadata hierarchy.

Catalog Page

After extraction, the catalog page for a Project, model, model columns, and jobs are displayed as shown here:

Here is an example of a project catalog and model catalog page.

../../../_images/dbt-project-catalog.png

On the catalog page for a project, Alation displays the RDBMS data sources associated with Alation that are involved in extraction.

The catalog page for a project displays the list of models and the Job Runs table. The Source System information section on the catalog page displays the linked RDBMS data sources associated with an environment.

../../../_images/dbt-model-catalog-page.png

Based on the model execution, Alation displays the linked tables in an environment associated with the extracted RDBMS data source on the respective model catalog page. The Job Status table. For a given model, this table displays the latest job execution summary for an environment.

Similarly, the catalog page for model columns display the associated attributes from the RDBMS data source and the Source System information section displays the columns extracted from the respective tables.

Lineage

To view the lineage for the tables associated with a model, select the table from the Lineage tab on the model catalog page.

By default, the catalog page displays the lineage for the first table listed in the Source System information section.

../../../_images/dbt-model-lineage-tab.png

To learn more about viewing lineage in the Alation catalog, see Discover Lineage.

For information on how to configure Lineage using Alation user interface, see Configure Lineage.

Data Health

To view the health information for any table with associated Data Health rule, navigate to the source table using the link on the Source System information section on the model catalog page and click Data Health. Any failure in model execution gets propagated to downstream objects in the lineage graph making it easy to find the root cause of the failure. Here’s an example:

../../../_images/dbt-data-health.png

To understand more about viewing data health within the Alation data catalog, see View Data Health.

Supported Data Sources

The dbt Gen2 OCF connector supports the extraction of metadata and lineage of following data sources:

  • Snowflake

  • Redshift

  • PostgreSQL

  • Google BigQuery

  • Databricks Unity Catalog

  • Databricks on AWS

  • Azure Databricks

  • Databricks on Google Cloud