Unity Catalog in Empower
What is Unity Catalog?
Unity Catalog is a unified security and governance feature set built by Databricks. It provides several critical features that make Empower a more robust data estate solution.
UC enables Delta Sharing, providing the ability to share read-only copies of your data securely both internally and with your partners and associates without requiring them to be on the Databricks platform. UC also provides the ability to create row-filters and column masks for different users and groups in SQL, giving you fine-grained control over your data security and access.
Unity Catalog comes with the 1.14 Empower release to all customers, and as such there are a few required steps necessary to ready the foundation for 1.14's deployment.
Initial Configuration Values
Steps
-
Provide the Empower-Service service principal with Databricks Account Admin access
- Note: You will need an account with either Databricks Account Admin access or Azure Global Access Administrator permissions.
- Login to the account console at https://accounts.azuredatabricks.net.
- Perform this action through the Databricks Account Console:
- In a Databricks workspace, navigate to the top right drop-down menu and select "Manage Account."
- Under the "User Management" tab, click on the "Service Principals" tab and click the blue button on the right-hand side of the screen to add a new service principal.
- Name the service principal "Empower-Service" and use the service principal's Object ID for the UUID field. (this will be different for each directory, please look it up in AAD).
- Once the service principal is added, click on its name and navigate to the "Roles" tab. Then enable "Account Admin."
-
Create a resource group for the Databricks Metastore
This step is optional; you may ask Hitachi Solutions to create this on your behalf.
- This step must be repeated for every Azure region where your company has an Empower solution deployment.
- Keep in mind the following considerations:
- The metastore will be global for all Databricks workspaces in a given Azure region. Therefore, the resource group name need not be Empower-specific e.g: [company name]-[region]-unitycatalog.
- The name will be immutable. Once deployed, the Unity Catalog cannot be moved.
- The name should conform to your company's resource group naming conventions.
- Please provide a hitachi team member with at least reader on this resource group.
- Please provide Empower-Service with Owner over this resource group.
Required Values for Deployment
Please provide the following values to your Hitachi Empower contact for each metastore deployment:
- Resource group name created in Step 2
- Subscription ID that holds the resource group
- Metastore naming: The metastore will require three names provided, which will be deployed by our team.
- Metastore name (lowercase alphanumeric characters and hyphens). Suggested name: {location}-metastore
- Metastore storage account name (lowercase alphanumeric characters only) Suggested name: {location}metastore
- Metastore storage account container name (lowercase alphanumeric characters and hyphens) Suggested name: {location}-metastore
- A /24 block of IP addresses for the unity catalog Vnet. The Vnet itself does not need to be deployed. We require an unused block of IP addresses to use in our automated deployment.
Updated 9 days ago