(warning) The Data Prep (Paxata) documentation is now available on the DataRobot public documentation site. See the Data Prep section for user documentation and connector information. After the 2021.2 SP1 release, the content on this site will be removed and replaced with a link to the DataRobot public documentation site.

MS Windows Azure Blob Storage (WASB) Connector Documentation

User Persona: Paxata User - Paxata Admin - Data Source Admin - IT/DevOps

*Note: This document covers all configuration fields available during Connector setup. Some fields may have already been filled out by your Admin at an earlier step of configuration and may not be visible to you. For more information on Paxata’s Connector Framework, please see here.

Also: Your Admin may have named this Connector something else in the list of Data Sources.

Configuring Paxata

This connector allows you to connect to Azure Blob Storage Account for Library imports and exports. The following fields are used to define the connection parameters.

General

  • Name: Name of the data source as it will appear to users in the UI.

  • Description: Description of the data source as it will appear to users in the UI.

Something to consider: You may connect Paxata to multiple Azure Blob Accounts accounts and having a descriptive name can be a big help to users in identifying the appropriate data source.

Azure Blob Storage Configuration

  • Data Store Root Directory: The root path on the Data Store accessible by this connector. Use "/" to access all files in a Container.
  • Azure Storage Account Name: A storage account name may contain lowercase letters and numbers.
  • Blob Service Container Name: A container organizes a set of blobs, similar to a folder in a file system. All blobs reside within a container.
  • Map INT96 to Datetime: For INT96 fields, convert to Datetime values during import. Specifically, this allows Paxata to read Parquet files written by Impala.

Azure Blob Storage Authentication Settings

  • Authentication Type: Two authentication methods are supported: Shared Key and Shared Acess Signatures. See Azure Storage Services Authentication Types for more details.
    • Shared Key: Authenticate with your account access key.
    • Shared Access Signatures (SAS): Authenticate with shared access signatures (SAS) token.

Data Import Information

Via Browsing

Supported

Via SQL Query

Not Supported