Connecting Your Data Warehouse

Step-by-step guide to connect your data warehouse to DataSentry

Connecting your data warehouse to DataSentry is a straightforward process that enables our optimization engine to monitor and manage your costs effectively. This guide covers the connection process for the most popular data warehouse platforms.

Supported Platforms

DataSentry currently supports the following data warehouse platforms:

  • Snowflake
  • Google BigQuery
  • Amazon Redshift
  • Databricks
  • Azure Synapse Analytics

Connection Process

The connection process follows the same general pattern across all platforms:

  1. Access the Connection Wizard
    Navigate to the "Connect" section of your DataSentry dashboard and click "New Connection".
  2. Select Your Platform
    Choose your data warehouse platform from the list of supported options.
  3. Provide Connection Details
    Enter the required connection parameters for your platform. These typically include:
    • Account identifier or connection string
    • Database and warehouse names
    • Authentication credentials
  4. Configure Permissions
    Grant DataSentry the necessary permissions to monitor and optimize your warehouse. This typically requires:
    • Read access to system tables and usage data
    • Permissions to suspend and resume warehouses (for optimization)
    • Access to cost and performance metrics
  5. Test the Connection
    Verify that DataSentry can successfully connect to your warehouse and access the required information.
  6. Complete Setup
    Configure initial optimization settings and complete the connection process.

Platform-Specific Instructions

Snowflake

For Snowflake connections, you'll need to provide:

  • Account identifier (e.g., xy12345.us-east-1)
  • Username and password, or key-based authentication
  • Database, schema, and warehouse names
  • Role with appropriate permissions

DataSentry will create a service user with the minimum required permissions to monitor and optimize your Snowflake warehouse.

Google BigQuery

For BigQuery connections, you'll need:

  • Project ID
  • Service account credentials (JSON key file)
  • Dataset names to monitor
  • Required IAM roles

DataSentry requires specific IAM roles to access BigQuery usage data and cost information.

Amazon Redshift

For Redshift connections, you'll need:

  • Cluster endpoint and port
  • Database name and username
  • Connection password
  • VPC and security group information

Ensure that your Redshift cluster is configured to allow connections from DataSentry's IP addresses.

Security Considerations

When connecting your data warehouse to DataSentry, consider the following security best practices:

  • Use dedicated service accounts with minimal required permissions
  • Implement network-level security (VPC peering, firewall rules)
  • Use encrypted connections (SSL/TLS)
  • Regularly rotate credentials
  • Monitor access logs for unusual activity

Troubleshooting Connection Issues

If you encounter issues connecting your data warehouse, check the following:

  • Verify that all connection parameters are correct
  • Ensure that the provided credentials have the necessary permissions
  • Check network connectivity and firewall settings
  • Confirm that the data warehouse is running and accessible
  • Review any error messages for specific details

Need Help?

If you're having trouble connecting your data warehouse, please visit our troubleshooting guide or contact our support team.