Unmatched Performance
Here is the performance benchmark using snowflake warehouse.
How we achieved
- DQ rules are push down. We are not sucking the data out of warehouse for validating rules.
- DQ rules are specially fine tune for snowflake.
- All the DQ rules execute natively on snowflake.
- DQ rules are execute in parallel depending on the size of the hardware.
Data Sample
Item |
Description |
Profiling |
16 columns |
Data Quality rules |
18 rules |
Execution Timings
Row count |
Size |
Compute Warehouse |
Time taken |
6 M |
160MB |
Snowflake x-small |
20 secs |
60 M |
1.6GB |
Snowflake medium |
1 min |
600 M |
16GB |
Snowflake large |
4 mins |