Skip to content

The dbt-spark-cde adapter allows you to use dbt along Cloudera Data Platform with CDE API support.

License

Notifications You must be signed in to change notification settings

cloudera/dbt-spark-cde

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dbt-spark-cde

The dbt-spark-cde adapter allows you to use dbt Cloudera Data Platform with CDE API server support. This code bases use the dbt-spark project (https://github.com/dbt-labs/dbt-spark), and provides CDE API based connectivity support over it.

Getting started

Requirements

Python >= 3.9

dbt-core ~= 1.3.0

requests >= 2.28.1

requests-toolbelt >= 0.9.1

pyspark

requests_kerberos

requests-toolbelt

python-decouple

Installing dbt-spark-cde

pip install dbt-spark-cde

Profile Setup

demo_project:
  target: dev
  outputs:
    dev:
     type: spark_cde
     method: cde
     schema: my_db
     auth_endpoint: https://service.spark-cde-gateway.my.org.com/
     host: https://spark-cde-gateway.my.org.com/dex/api/v1/
     user: my_user
     password: my_pass

Caveats

  • While using cde , in the Livy UI if you notice sessions change state to dead from starting instead of idle, make sure there is a proper mapping for the user in the IDBroker mapping section
  • Actions > Manage Access > IDBroker Mappings . Reference
  • Also make sure the workload password is set either through UI or CLI. Reference

Supported features

Please see the original adapter documentation: https://github.com/dbt-labs/dbt-spark and https://docs.getdbt.com/reference/warehouse-profiles/spark-profile

About

The dbt-spark-cde adapter allows you to use dbt along Cloudera Data Platform with CDE API support.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.2%
  • Other 0.8%