Snowflake is a cloud-based Data Warehouse solution provided as a Saas (Software-as-a-Service) with full support for ANSI SQL. It also has a unique architecture that enables users to just create tables and start querying data with very less administration or DBA activities needed.
It enables data storage, processing, and analytic solutions that are faster, easier to use, and far more flexible than traditional offerings. The Snowflake data platform is not built on any existing database technology or “big data” software platforms such as Hadoop.
Connecting to Snowflake¶
Mammoth allows you to connect to your Snowflake Database and get the data into Mammoth.
Select API & Databases from the ‘Add Data’ menu and click on Snowflake.
Create a new connection and add your database credentials - Host URL, Username, Password, Database and Warehouse.
Select the Schema you want to work on and click on Next.
Once the connection is established, you will be presented with a list of tables and views in that database.
Select the desired table to get a preview.
Write a SQL query or run a test query and preview the result.
Click on Next.
After you have selected the table you want to work on, you get options to schedulw data imports as discussed in the next section.
Scheduling your Data Pulls¶
You can start retrieving the data now or at a specific time. Further schedule the data imports to get the latest data from your Database at a certain time interval - just once, daily, weekly or monthly.
On every data pull from your Database, you also have an option to either replace the older data or combine with older data.
On choosing Combine with older data option, you will get an option to choose a unique sequence column. Using this column, on refresh, Mammoth will pick up all the rows that have greater value in this column than the previous data pull.
Make sure that Mammoth’s public IP address is added to your whitelist.
Mammoth’s public IP is displayed on the create connection window.