Manage your event data with our data retention options.
This guide explains RudderStack’s data retention policy and your options for opting in or out of data storage.
RudderStack’s default policy is not to store your event data, but there are times when it may be useful for you to access or replay recent event data. Refer to the Event Replay guide for more details on this feature.
RudderStack provides 3 options for retaining your event data:
The following sections define different types of RudderStack data and provide steps on opting in to the right setup for your needs.
Data definitions
RudderStack does not permanently store any customer data except the following:
Aggregate “Count” data on Event Name, Event Type, Source ID, Destination ID.
Error codes.
Customer user records (for example, usernames, billing-related details).
The storage options vary with the nature of the data and your RudderStack plan.
All other customer data can be classified as either transient or non-transient and it may either be stored in your location, for example, AWS, or by RudderStack for upto 30 days.
RudderStack’s data retention policy defines data as they pertain to the primary components of its service - the data plane and control plane.
Transient customer data
Transient customer data can be defined as follows:
All data that is in transit, that is, stored for less than 3 hours, as an essential part of delivering the RudderStack product experience.
Data plane: Events that hit the RudderStack gateway. Refer to the data plane architecture for more details.
Control plane: The in-transit data captured in the Live Events tab of the RudderStack dashboard.
Non-transient customer data
Non-transient customer data can be defined as follows:
Data that can persist for more than 3 hours only if configured by the RudderStack user.
Data plane: Processing errors, gateway dumps.
Control plane: Data in the reporting service (sample events, sample responses).
Data retention options
RudderStack provides 3 options for your event data storage. To choose how you want to store the event data, follow these steps:
Choose one of the 3 data storage options in the Data retention section:
The following sections explain the data retention options in detail.
1. Do not store event data
If you choose this option, RudderStack will not store any of your event data. This is the default setting.
2. Store event data in your own cloud storage (Recommended)
This is the recommended event storage option, and available in the Starter, Growth, and Enterprise plans. Selecting this option will bring up a modal allowing you to connect a storage bucket with your RudderStack data.
RudderStack supports storage via AWS, GCS, Azure, and MinIO if you select this option.
When connecting your cloud storage provider to RudderStack, you will first need to create a storage bucket and configure the credentials for RudderStack to access the datastore. Follow the steps listed below depending on your cloud provider:
Connect your storage provider in the RudderStack dashboard.
3. Store event data in RudderStack cloud storage
Choosing this option allows RudderStack to store and delete your event data on a rolling 30-day basis.
Sample event data
When the Sample event data setting is enabled, RudderStack stores and deletes sample events and responses on a rolling 30-day basis. This data may be helpful for debugging your events.
RudderStack does not consider the event name or event type to be Personally Identifiable Information (PII).
Plan-based options
Based on your plan, RudderStack provides different options for event storage, giving you the ability to enable or disable retention for the following kinds of data:
Sample events and responses: As mentioned above, RudderStack will store and delete sample events and responses on a rolling 30-day basis.
Processing errors: These correspond to the events that are rejected at various stages of the data pipeline, including errors from user transformation, destination transformation (internal to RudderStack), and events rejected by the destination after three hours of retry attempts.
Gateway dumps: These correspond to the raw data for every successfully-ingested event.
This site uses cookies to improve your experience while you navigate through the website. Out of
these
cookies, the cookies that are categorized as necessary are stored on your browser as they are as
essential
for the working of basic functionalities of the website. We also use third-party cookies that
help
us
analyze and understand how you use this website. These cookies will be stored in your browser
only
with
your
consent. You also have the option to opt-out of these cookies. But opting out of some of these
cookies
may
have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This
category only includes cookies that ensures basic functionalities and security
features of the website. These cookies do not store any personal information.
This site uses cookies to improve your experience. If you want to
learn more about cookies and why we use them, visit our cookie
policy. We'll assume you're ok with this, but you can opt-out if you wish Cookie Settings.