Snowflake Sample Data
Sample data for Snowflake
RudderStack provides a sample data set for the Snowflake warehouse, available in the Snowflake marketplace. You can use this data to run the Profiles project and Propensity Scores through the UI or the CLI.
The number of columns in this data set are intentionally limited to make the data set easily understandable. Also, all email addresses are generated randomly and no PII is used in the generation of this data set.
The following tables, properties, and user information is included in the data set:
Tables
This data set includes below-mentioned RudderStack event data tables:
PAGES
- Page view events from anonymous and known users.TRACKS
- Summarized tracked user actions (like login
, signup
, order_completed
, etc.).IDENTIFIES
- Identify calls run when a user provides a unique identifier (i.e., upon signup
).ORDER_COMPLETED
- Detailed payloads from tracked order_completed
events.
As of January 2023, here are the approximate number of rows in each table:
PAGES
: ~43kTRACKS
: ~14kIDENTIFIES
: ~4.8kORDER_COMPLETED
: ~2.2k
These volumes follow the pattern of a normal eCommerce conversion funnel (pageview, signup, order). Specifically, here’s a rough breakdown of the user journey by volume:
- 30% - Never sign in
- 10% - Sign in but never add an item to cart
- 40% - Add to cart and abandon
- 20% - Make purchases
Note that this data includes future data until Apr 2024, and starts in June 2023. This is to ensure that future users can still run the project with ‘current’ data. RudderStack team will refresh the data periodically throughout the year.
Properties
This data set includes a subset of the standard properties found in the Warehouse schema spec for each table. The required columns for running Profiles and Predictions projects are also present.
The user data includes a subset of our standard properties for identify
calls.
This data set contains a total of ~10k unique users by anonymousId
. About half of these unique users (~4.8k) are known users (with an associated identify
call).
Questions? Contact us by email or on
Slack