RudderStack Glossary

Familiarize yourself with the RudderStack-specific features and terminology.

This guide lists the definitions of the RudderStack-related terms that you are likely to encounter throughout the documentation and while using RudderStack.

Airflow Provider

Airflow Provider is a tool that lets you programmatically schedule and trigger your Reverse ETL syncs from outside RudderStack and integrate them with your existing Airflow workflows.

Anonymous ID

An anonymous ID is an auto-generated UUID (Universally Unique Identifier) that gets assigned to each unique and unidentified visitor to your website.

Audit logs

Audit Logs give you a detailed log of all user activities happening within your RudderStack. These include various operations related to sources, destinations, transformations, user management, and more.

Cloud Extract

Cloud Extract is RudderStack’s ELT feature. It lets you collect data from various third-party cloud platforms and send it to your data warehouse.

Cloud mode

In this connection mode, the RudderStack SDKs track and send the event data to RudderStack for processing. RudderStack then routes this data to the specified destination. Use this mode when you want to use RudderStack’s Transformations feature to transform your events before sending them to your destination.

Connection

A connection is a one-to-one directional event flow between a RudderStack source and a destination. You can set up different types of connections in RudderStack to send your events:

Control plane

The control plane manages the configuration of your sources and destinations. The interface for the control plane is the RudderStack dashboard (web app).

Control Plane Lite

The Control Plane Lite utility lets you self-host your source and destination configurations. You can manage these configurations by locally exporting to or importing them from a JSON file.

Customer Data Platform (CDP)

A Customer Data Platform (CDP) is software or a collection of tools that unifies and persists all customer records across multiple data sources in a centralized location. It is accessible to other tools in your tech stack and lets you build a comprehensive customer profile for a variety of use cases.

Data governance

RudderStack’s Data Governance capabilities let you access all your events and their metadata programmatically and identify any inconsistencies in them. This includes vital information related to the event schema, event payload versions, data types, and more.

Data plane

RudderStack’s core engine responsible for receiving, processing, and relaying your event data to the specified destination. For more information, refer to the Architecture guide.

Data plane URL

RudderStack requires the data plane URL for routing and processing events in the backend. You can find the URL in the home page of your RudderStack dashboard. For more information, see the Dashboard Overview guide.

Data Plane URL

Data retention

RudderStack’s data retention settings let you define and manage your event data storage. You can disable event data storage completely, store the events in your own cloud storage, or store them in the RudderStack-hosted cloud storage. For more information, refer to the Data Management guide.

Destination

A destination is a tool or platform where you want to send the event data via RudderStack. RudderStack currently supports over 150 Cloud destinations and Warehouse destinations.

Device mode

In this connection mode, you can send the source events to the destinations using the native client-specific libraries present on your website/mobile app. These libraries allow RudderStack to use the data you collect on your device to call the destination APIs without sending it to the RudderStack server first. Use device mode when you want to send events to a destination directly, without any transformation. For more information, see the Connection Modes guide.

ELT

The ELT (Extract, Load, Transform) process involves obtaining the data from the source, replicating it into the target system (typically data warehouse or data lake), and transforming it depending on the use case.

Event

Events are the fundamental components of clickstream data. They correspond to the user actions on websites or mobile apps such as clicks, page or screen views, logins, registrations, etc. Tracking events in real-time helps businesses to better understand the users and their product journey.

Event spec

The RudderStack Event Spec helps you plan your event data and provides various options for tracking your events across all the RudderStack SDKs and APIs. RudderStack has a unified event semantic for different destination platforms, so you can easily translate your event data to different downstream tools by following this spec.

Event Stream

RudderStack’s Event Streams feature lets you collect your event data from all of your web and mobile apps and route it to a wide array of customer tools and data warehouses.

Identity stitching

Identity Stitching is the process of matching different identifiers across multiple devices and digital touchpoints to build a cohesive and omnichannel customer profile. With RudderStack’s warehouse-first architecture, you can send all your cross-platform data to your warehouse and perform identity stitching on it.

Live events

RudderStack’s Live Events feature lets you view the live events collected from your sources and sent to the connected destinations in real-time. With this feature, you can easily debug any errors in the failing events at a destination level and reduce your troubleshooting time and efforts.

Live Events

Models

RudderStack’s Models feature lets you create models by defining custom SQL queries. You can then run these queries on your warehouse and send the resulting data to specific destinations.

Personal access token

Personal Access Token is a unique key associated with your RudderStack account. It is required to consume all the public RudderStack APIs.

New personal access token in RudderStack dashboard

Profiles

Profiles are the collection of all the events associated with a user. RudderStack lets you create and manage these profiles in your warehouse.

Properties

Properties are additional contextual information you can add to a track call to further describe the action a user takes. RudderStack has reserved some standard properties listed in the following table and handles them in a special manner.

Reverse ETL

Reverse ETL is the process of routing the data residing in your data warehouse to various downstream tools within your customer data stack. This includes various SaaS marketing, analytics, sales, and customer support tools.

Source

A source is a platform or an application (web, mobile, server-side, or a third-party cloud app) from where RudderStack tracks and collects your event data.

Tracking plans

Tracking Plans is a RudderStack feature that lets you proactively monitor and act on non-compliant event data coming into your RudderStack sources based on predefined plans. This can help you prevent or de-risk situations where missing or improperly configured event data can break your downstream destinations.

Create blank tracking plan

Traits

Traits are attributes that describe a user. They can be added to an identify call in the traits object. Some examples of traits include age, gender, or some specific details - for example, a user’s product plan (basic, premium, and so on). After making an identify call, you don’t need to include all the user traits in the subsequent calls every time. You can include only the changed/updated traits since the last identify call.

Transformations

Transformationsis a RudderStack feature that lets you leverage custom JavaScript functions to implement a variety of use cases like event filtering, sampling, removing sensitive PII, or implementing custom logic to enrich your events.

User management

RudderStack’s user management feature lets you manage users and their permissions in your RudderStack workspace. It lets you set access controls and collaborate with other members of your organization.

User suppression API

The User Suppression API is RudderStack’s enterprise feature. It lets you programmatically suppress user data identified by a user ID. With this feature, you can block all the user data for all the sources and destinations in RudderStack.

Visual data mapper

Visual Data Mapper (VDM) is RudderStack’s Reverse ETL feature. It offers an intuitive UI to map your data warehouse columns to specific destination fields without any second-guessing.

Warehouse schema

When sending your events to a data warehouse via RudderStack, you don’t need to define a schema. RudderStack automatically does that for you by following a predefined warehouse schema that defines the different tables and columns created based on different event types. For more information, refer to the Warehouse Schema guide.

Workspace ID

RudderStack uses the workspace ID for tracking the data internally. You can find your workspace ID by navigating to Settings > Company in the RudderStack dashboard.

Workspace token

The workspace token uniquely identifies your RudderStack workspace. You can find your workspace ID by navigating to Settings > Workspace in the RudderStack dashboard. The workspace token is hidden by default - you must have administrative privileges to access the token.

Workspace Token

Write key

The write key (or source write key) is a unique identifier for your source. RudderStack uses this key to send events from a source to the specified destination.

JavaScript SDK source write key

Questions? Contact us by email or on Slack