NEWNative Support for Conversational Data in Label Studio Enterprise
Back to integrations

Databricks Unity Catalog Integration Enterprise integration

Databricks (Enterprise) Overview

Databricks helps teams manage data and AI workloads at scale with Unity Catalog (UC) providing centralized governance and access control. For AI teams using Label Studio Enterprise, this integration connects directly to Databricks UC Volumes so you can move data into annotation workflows and send results back to your governed storage.

  • With Label Studio Enterprise, you can import files from UC Volumes as tasks and export completed annotations as JSON to the same volumes. This keeps your datasets, labels, and review history inside your Databricks environment for consistent governance and lineage.

What you can do

  • Import files from Databricks Unity Catalog Volumes as labeling tasks
  • Export annotations as JSON back to your UC Volumes
  • Keep data flows inside your governed Databricks environment
  • Pair with Label Studio’s review and QA to improve dataset quality before training

How it works

This connector uses the Databricks Files API to read from and write to UC Volumes. It operates only in proxy mode. Presigned URLs are not supported by Databricks, so all access runs through the proxy for secure transfer.

Availability

This integration is available in Label Studio Enterprise only.

Related Integrations

Azure Blob Storage

Azure cloud storage for data labeling

Pachyderm

Automated versioning for data labeling

Google Cloud Storage

Google cloud storage for data labeling

S3

AWS cloud storage for data labeling