Skip to main content

Frequently Asked Questions (FAQs)

Preview

Warehouse Optimization for Databricks is currently in preview. Reach out to Keebo support for access and onboarding.

How Does Warehouse Optimization Connect to a Databricks Environment?

Warehouse Optimization connects using OAuth Machine-to-Machine (M2M) authentication via a service principal. It uses the Databricks REST API to retrieve warehouse metadata and apply real-time optimizations, and queries Databricks system tables to calculate savings. Keebo never accesses organization data directly.

What Data Does Warehouse Optimization Access?

Warehouse Optimization accesses only the metadata necessary for optimization. Via the REST API, it retrieves warehouse configuration, applies warehouse configuration changes, and reads recent query history as input to its algorithms. Via system tables, it reads warehouse events, warehouse configuration changes, billable usage data, and query history. Every piece of data retrieved serves a specific role in the optimization process.

How Long Does It Take to See Savings After Connecting Warehouses?

After connecting a Databricks workspace and SQL warehouses, Warehouse Optimization begins optimizing immediately. Savings are typically reflected in the dashboard within 24 hours.

Can Warehouse Optimization Manage Multiple Databricks Workspaces?

Warehouse Optimization supports multiple workspaces and by default uses a single service principal across all connected workspaces. Contact Keebo support if separate service principals per workspace are needed.

What Permissions Does the Keebo Service Principal Need?

The service principal requires:

  • User-level access to each Databricks workspace
  • Access to Databricks system tables (granted via SQL)
  • Read/write access to the Keebo catalog and schema
  • CAN_MANAGE permission on each SQL warehouse to be optimized

Can Warehouse Optimization Be Disabled for a Specific Warehouse?

All Warehouse Optimization optimizations can be turned off for any warehouse at any time using the toggle in the "Keebo status" column on the Settings page.

What Happens if a Warehouse's Settings Are Changed Outside of Warehouse Optimization?

Warehouse Optimization automatically detects changes made outside of Keebo to connected warehouses' size, auto-stop, and cluster values. By default, Warehouse Optimization continues optimizing after a change is detected, treating the new setting as the default. This behavior can be configured in the warehouse settings.

Does Warehouse Optimization Support Classic and Pro SQL Warehouses?

Warehouse Optimization is currently only works for Databricks Serverless SQL Warehouses. Contact Keebo support for questions about other warehouse types.