Databricks_Logo

Your Models + Databricks

An architectural game-changer for spreadsheet-driven solutions

 

Databricks-technology-partner-badge-validate-2xClearFactr offers seamless support for driving your models with limitless amounts of data stored in Databricks.

And while we know you can do this completely on your own, we'd welcome the chance to bring our decades of application and solution building experiences surrounding data modeling and design into a discussion, all to help assure your maximum success. Driving arbitrarily-complex spreadsheet compute models with external data might be new to your enterprise. But once you get started, you'll wonder how you did it any other way.

In fact, we're so convinced this can be such a revolutionary approach for solution building, empowering teams across the enterprise to collaborate like never before, we wrote a short white paper about it.

You'll want to check out this page, too, for more information about this important aspect of ClearFactr and how it can transform your data and compute solutions.

It's important to note that unlike BI tools that are designed to scroll through tons of data, ClearFactr is a spreadsheet in the traditional compute-model sense, one with arbitrary complexity and flexibility designed entirely by you and your team.

For example, here's a model that contains 4 queries (denoted by the purple cells, automatically highlighted by the External Cell Type button in the toolbar):


DatabricksExample-ExternalCellTypes

The two query cells on the left are driven by a Buy Side and Sell Side State, "NH" and "AR" in this case. When the summary data for those states is retrieved, a checksum value is computed by the model in the middle of the screen. The Cell Ancestry tool shows the downstream effects of one of those:

DatabricksExamples-CellAncestry

For both the Buy Side and Sell Side states, some mortgage detail data is brought in on the right side of the screen, qualified by the checksum values. Some differences between the Buy and Sell sides are shown in row 25.

When we select a different Sell Side state, that data changes as well as everything downstream:

DatabricksExample-NewData

Now we're looking at NH versus CT. Note that all of the Derivative calculations in the lower right have changed, too.

Using the Sensitivity Analyzer, we can better understand how Factor A controls the Derivative calculations:

DatabricksExample-SensitivityAnalyzer

Note that most cells aren't affected... they're the ones that have been grayed out.

All of this can be called and manipulated programmatically via our REST-API, too.

databricks-2

Getting Started

Enabling all of the above is as simple as configure a Databricks-specific DataSource, via this panel, accessible from the Developers menu while viewing any model. Note this is a particularly powerful, and thus, ClearFactr Administrator enabled feature of the platform. We'll discuss all of this as we get you and your company onboarded.

ClearFactrDataSourceEditor-1

The Details

When logged into your Databricks account, here's how to find the specific items you'll need for your DataSource:

  1. Deployment Name: use the GUID at the far left of your browser URL, such as 'abc-123ab4de-1b9a'. You don't need the '.cloud.databricks.com'
  2. Warehouse Name: use the GUID found under 'SQL Warehouses' in the 'Compute' section.
  3. Depending on your chosen Auth Type, you'll need either the username/password of your Personal Access Token, or the OAuth ClientID and Client Secret for your OAuth-enabled connection.
    1. For PAT, look under User icon / Settings / User / Developer / Access Tokens
    2. For OAuth U2M, look under Organization menu / Manage account / Settings / App Connections
  4. For OAuth M2M, look under Organization menu / Manage account / User Management / Service Principals
  5. For all Auth Types, you'll use the 'Test' button to validate your credentials.
  6. For OAuth U2M connections, please specify 'https://cfdemoapp.com/oauth/cdw_redirect' as the Redirect URL


Note: When testing, if you're already logged into Databricks in a browser window, you'll immediately see a JSON Access Token upon test completion. If you're not logged in, the testing process will have you authenticate yourself via Databricks' site.

Driving your Models...

Once your Databricks DataSource is configured, you'll be able to reference it in one or more CFSQL functions to bring data into your model on the fly. Click here to learn more about the Query Builder tool that facilitates this.

Please contact the ClearFactr team with any and all comments, suggestions and requests.