For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Log inBook a demo
  • Getting started
    • Introduction
    • Obtaining API keys
    • Quickstart
    • Concepts
    • Embedding authentication
    • Versioning
    • Idempotent requests
  • Configuring your connections
    • CDC streaming from databases
      • Affinity
      • Airtable
      • Amplitude
      • API
      • Apollo.io
      • Asana
      • Ascend
      • Attio
      • AWS S3
      • AWS Athena
      • AWS OpenSearch
      • Azure Blob Storage
      • Azure SQL
      • BiqQuery
      • CloudSQL (MySQL)
      • CloudSQL (Postgres)
      • Chargebee
      • ConstructionWire
      • CosmosDB
      • Customer.io
      • Databricks
      • Datadog
      • dbt Cloud
      • Delighted
      • Dialpad
      • Dittofeed
      • DynamoDB
      • Facebook Ads
      • Freshdesk
      • Front
      • FullStory
      • Google Cloud Storage
      • Github
      • Google Ads
      • Google Workspace
      • Google Sheets
      • Harmonic
      • Heap
      • Honeycomb
      • HubSpot
      • Intercom
      • Iterable
      • Jira
      • Klaviyo
      • Kustomer
      • Linear
      • Linkedin Ads
      • Lob
      • Mailercheck
      • Marketo
      • MongoDB
      • Microsoft Ads
      • MySQL
      • Netsuite
      • Outreach
      • Pardot
      • Pipedrive
      • Postgres
      • PredictLeads
      • Quickbooks
      • Recharge
      • Redshift
      • Redshift Serverless
      • Reply.io
      • Salesforce
      • Segment
      • SFTP
      • Shopify
      • Short.io
      • Slack
      • Snowflake
      • Statsig
      • Stripe
      • Survicate
      • Synapse
      • Tixr
      • Unbounce
      • Uservoice
      • Vanilla Forums
      • Webhooks
      • Zendesk Chat
      • Zendesk Support
      • ZoomInfo
  • API Reference
Logo
Log inBook a demo
On this page
  • Connection
  • Configuration
  • Example
  • Read-only properties
  • Model Sync
  • Source
  • Configuration
  • Example
  • Target
  • All targets
  • Configuration
  • Example
  • Target creation
  • Bulk Sync
  • Source
  • Destination
  • Configuration
  • Example
  • Type handling
  • Destination types
  • Source types
Configuring your connectionsConnection configurations

Databricks

Was this page helpful?
Previous

Datadog

Next

Supports:

  • ✅ Models
  • ✅ Model sync destination
  • ✅ Bulk sync source
  • ✅ Bulk sync destination

Connection

Configuration

NameTypeDescriptionRequired
cloud_providerstringCloud Provider (destination support only)

Accepted values: aws ↓, azure ↓
false
databricks_auth_modestringAuthentication Method

Accepted values: access_token ↓, oauth_service_principal ↓
true
enable_delta_uniformbooleanEnable Delta UniForm tablesfalse
enforce_query_limitbooleanLimit concurrent queries ↓false
http_pathstringHTTP Pathtrue
portintegerPorttrue
server_hostnamestringServer Hostnametrue
sshbooleanConnect over SSH tunnel ↓false
unity_catalog_enabledbooleanUnity Catalog enabledfalse
use_bulk_sync_staging_schemabooleanUse custom bulk sync staging schema ↓false

databricks_auth_mode

When databricks_auth_mode is access_token
NameTypeDescriptionRequired
access_tokenstringAccess Tokentrue
When databricks_auth_mode is oauth_service_principal
NameTypeDescriptionRequired
service_principal_idstringService Principal IDtrue
service_principal_secretstringService Principal Secrettrue

cloud_provider

When cloud_provider is aws
NameTypeDescriptionRequired
auth_modestringAWS Authentication Method

How to authenticate with AWS. Defaults to Access Key and Secret. Accepted values: access_key_and_secret, iam_role
true
s3_bucket_namestringS3 Bucket Name (destinations only)

Name of bucket used for staging data load files
false
s3_bucket_regionstringS3 Bucket Region (destinations only)

Region of bucket
false
set_retention_propertiesbooleanConfigure data retention for tablesfalse
When cloud_provider is azure
NameTypeDescriptionRequired
azure_access_keystringStorage Account Access Key (destination support only)

The access key associated with this storage account
false
azure_account_namestringStorage Account Name (destination support only)

The account name of the storage account
false
container_namestringStorage Container Name (destination support only)

The container which we will stage files in
false

use_bulk_sync_staging_schema

When use_bulk_sync_staging_schema is true
NameTypeDescriptionRequired
bulk_sync_staging_schemastringStaging schema namefalse

enforce_query_limit

When enforce_query_limit is true
NameTypeDescriptionRequired
concurrent_queriesintegerConcurrent query limitfalse

ssh

When ssh is true
NameTypeDescriptionRequired
ssh_blob_storagebooleanUse SSH for cloud storage bucketfalse
ssh_hoststringSSH hostfalse
ssh_portintegerSSH portfalse
ssh_private_keystringPrivate keyfalse
ssh_userstringSSH userfalse

set_retention_properties

When set_retention_properties is true
NameTypeDescriptionRequired
deleted_file_retention_daysintegerDeleted file retentionfalse
log_file_retention_daysintegerLog retentionfalse

auth_mode

When auth_mode is access_key_and_secret
NameTypeDescriptionRequired
aws_access_key_idstringAWS Access Key ID (destinations only)

See https://docs.polytomic.com/docs/databricks-connections#writing-to-databricks
true
aws_secret_access_keystringAWS Secret Access Key (destinations only)true
When auth_mode is iam_role
NameTypeDescriptionRequired
iam_role_arnstringIAM Role ARNtrue
storage_credential_namestringStorage credential namefalse

Example

1{
2 "name": "Databricks connection",
3 "type": "databricks",
4 "configuration": {
5 "access_token": "isoz8af6zvp8067gu68gvrp0oftevn",
6 "auth_mode": "access_key_and_secret",
7 "aws_access_key_id": "AKIAIOSFODNN7EXAMPLE",
8 "aws_secret_access_key": "wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY",
9 "cloud_provider": "aws",
10 "databricks_auth_mode": "access_token",
11 "enable_delta_uniform": false,
12 "enforce_query_limit": false,
13 "http_path": "/sql",
14 "port": 443,
15 "s3_bucket_name": "s3://polytomic-databricks-results/customer-dataset",
16 "s3_bucket_region": "us-east-1",
17 "server_hostname": "dbc-1234dsafas-d0001.cloud.databricks.com",
18 "set_retention_properties": false,
19 "ssh": false,
20 "unity_catalog_enabled": false,
21 "use_bulk_sync_staging_schema": false
22 }
23}

Read-only properties

NameTypeDescriptionRequired

auth_mode

When auth_mode is access_key_and_secret
NameTypeDescriptionRequired
aws_userstringUser ARN (destinations only)false
When auth_mode is iam_role
NameTypeDescriptionRequired
external_idstringExternal ID for the IAM rolefalse

Model Sync

Source

Configuration

NameTypeDescriptionRequired
catalogstringCatalogfalse
querystringQueryfalse
schemastringSchemafalse
tablestringTablefalse

Example

1{
2 ...
3 "configuration": {
4 "catalog": "samples",
5 "query": "SELECT * FROM samples.nyctaxi.trips",
6 "schema": "nyctaxi",
7 "table": "trips"
8 }
9}

Target

Databricks connections may be used as the destination in a model sync.

All targets

Configuration
NameTypeDescriptionRequired
created_columnstring’Created at’ timestamp columnfalse
preserve_table_on_resyncbooleanPreserve destination table when resyncingfalse
updated_columnstring’Updated at’ timestamp columnfalse
write_record_timestampsbooleanWrite row timestamp metadatafalse
Example
1{
2 ...
3 "target": {
4 "configuration": {
5 "created_column": "",
6 "preserve_table_on_resync": false,
7 "updated_column": "",
8 "write_record_timestamps": false
9 }
10 }
11}

Target creation

Databricks connections may be used to create a new target for a model sync. The following parameters are required to create a new target:

NAMEDESCRIPTIONENUM
nameTable namefalse

Bulk Sync

Source

Databricks connections may be used as a bulk sync source. No additional configuration options are required.

Destination

Configuration

NameTypeDescriptionRequired
advancedobjectfalse
catalogstringCatalogfalse
external_location_namestringExternal locationfalse
mirror_schemasbooleanMirror schemasfalse
schemastringOutput schemafalse

Example

1{
2 ...
3 "destination_configuration": {
4 "advanced": {
5 "deleted_file_retention_days": 0,
6 "empty_strings_null": false,
7 "enable_change_data_feed": false,
8 "hard_deletes": false,
9 "log_file_retention_days": 0,
10 "set_retention_properties": false,
11 "table_prefix": "",
12 "truncate_existing": false
13 },
14 "catalog": "samples",
15 "external_location_name": "",
16 "mirror_schemas": false,
17 "schema": "nyctaxi"
18 }
19}

Type handling

Destination types

POLYTOMIC TYPEDATABRICKS TYPE
array<>ARRAY<>
bigintBIGINT
booleanBOOLEAN
dateDATE
datetimeTIMESTAMP
decimal(precision, scale)DECIMAL(precision,scale)
doubleDOUBLE
intINT
jsonSTRING
jsonarraySTRING
numberDECIMAL(38,18)
object{}STRING
singleFLOAT
smallintSMALLINT
stringSTRING
timeTIMESTAMP

Source types

DATABRICKS TYPEPOLYTOMIC TYPE
ARRAY<>array<>
BIGINTbigint
DATEdate
DECIMAL(precision, scale)decimal(precision, scale)
DOUBLEdouble
FLOATsingle
INTint
INTERVALstring
MAP<>object{}
SMALLINTsmallint
STRUCT<>object{}
TIMESTAMPdatetime_tz
TIMESTAMP_NTZdatetime
TINYINTsmallint
VARCHARstring