Glossary

This guide lists the definitions of the RudderStack-related terms that you are likely to encounter throughout the documentation and while using RudderStack.

Audit logs

Audit Logs is RudderStack's enterprise feature that gives you a detailed log of the user activities happening within your RudderStack workspace. These activities include the various user operations related to sources, destinations, transformations, and more.

For more information on the audit logs feature, refer to the Audit Logs guide.

Cloud Extract

Cloud Extract is RudderStack's ELT feature that lets you collect your raw events and data from various third-party cloud platforms and send it to your data warehouse.

For more information on the various Cloud Extract sources and how to set them up in RudderStack, refer to the Cloud Extract documentation.

Connection (Pipeline)

A connection is a one-to-one directional flow of events between a RudderStack source and a destination.

For more information on sources and destinations in RudderStack, refer to the RudderStack Cloud section.

You can set up different types of connections in RudderStack to send your events, based on the type of source:

  • Event Stream: One source to many destinations
  • Cloud Extract: Multiple sources to one warehouse destination
  • Reverse ETL: One warehouse source to one downstream destination (this is mainly due to the mappings required when setting up the connection)

Connection modes (cloud mode and device mode)

You can send the event data from your sources to your desired destinations via RudderStack in two ways:

  • Cloud mode: In this mode, the RudderStack SDKs track and send the event data directly to the RudderStack server for processing. RudderStack processes this data and routes it to the desired destination. This mode is useful when you want to leverage the Transformations feature to transform your events before sending them over to the destinations.
  • Device mode: In this mode, you can send the source events to the destinations using the native client-specific libraries on your website/mobile app. These libraries allow RudderStack to use the data you collect on your device to call the destination APIs without sending it to the RudderStack server first. This mode is useful when you want to send the events to a destination directly, without any transformation.

These two modes are commonly referred to as RudderStack connection modes.

For more information, refer to the Connection Modes guide.

Control plane

The control plane manages the configuration of your sources and destinations. The interface for the control plane is the RudderStack web app.

For more information on the control plane, refer to the Architecture guide.

Control Plane Lite

RudderStack's control plane offers an intuitive UI to configure your event data sources and destinations.

If you want to self-host these configurations, you can use the open source Control Plane Lite utility to set up your control plane. You can then manage the source and destination configurations locally by exporting to or importing them from a JSON file.

For information on Control Plane Lite and how to use it, refer to the Control Plane Lite guide.

Customer Data Platform

A Customer Data Platform (CDP) is a software/collection of tools that unifies and persists all the customer records across multiple data sources in a centralized location accessible to other tools in your tech stack.

A CDP lets you build a comprehensive customer profile and use the insights for a variety of use-cases.

Data governance

RudderStack's Data Governance feature gives you the ability to access all your events and their metadata programmatically and identify any inconsistencies in them. This includes vital information related to the event schema, event payload versions, data types, and more.

Data plane

The data plane is RudderStack's core engine responsible for:

  • Receiving and buffering the event data
  • Transforming the events into the required destination format
  • Relaying the events to the destination

For more information on the data plane, refer to the Architecture guide.

The data plane is intentionally separated from the control plane to give you complete control and ownership of your data. Depending on how RudderStack is deployed, you can set up your control plane and data plane with one of the following approaches:

  • RudderStack Cloud: RudderStack hosts both control plane and data plane
  • RudderStack Open Source: RudderStack hosts the control plane and the user hosts the data plane. Alternatively, the user hosts both the control plane and the data plane.

Data plane URL

For routing and processing the events to the RudderStack backend, a data plane URL is required.

Refer to the Architecture guide for more information on the RudderStack data plane.

An open source data plane URL typically looks like http:localhost:8080 where 8080 is the port where your RudderStack data plane is hosted.

Data Regulation API

Data Regulation API is RudderStack's enterprise feature that lets you programmatically suppress user data identified by a user ID. With this feature, you can block all the user data for all the sources and destinations in RudderStack.

Refer to the Data Regulation API guide for more information.

Destination

A destination is a tool or application where you want to send the data via RudderStack.

RudderStack currently supports over 150 destinations. These include data warehouses, analytics platforms, CRM tools, marketing platforms, and more.

Refer to the Destinations guide for the complete list of the supported destinations.

ELT (Extract, Load, Transform)

The ELT (Extract, Load, Transform) process can be defined as:

  • Extract: Obtaining data from the source platform or application.
  • Load: Replicating the data from the source into the target system, typically a data warehouse or a data lake.
  • Transformation: Transforming the data in the desired format according to the business requirement/use-case.

Refer to the RudderStack blog to read more about ELT and how RudderStack facilitates it via the Cloud Extract feature.

Event

Events are the fundamental components of clickstream data. They correspond to the user actions on websites or mobile apps such as clicks, page/screen views, logins, registrations, etc.

Tracking events in real-time helps businesses to better understand the users and their product journey. This, in turn, allows businesses to deliver better recommendations, send relevant push notifications, and enhance user experience.

Refer to the RudderStack blog post to read more about clickstream analytics.

Event spec

The RudderStack event spec helps you plan your event data and provides various options for tracking your events across all the RudderStack SDKs and APIs. RudderStack has a unified event semantic for different destination platforms, so you can easily translate your event data to different downstream tools by following this spec.

For more information on the RudderStack event spec, refer to the Event Spec guide.

Event Streams

RudderStack's Event Streams feature lets you collect your event data from all of your web and mobile apps and route it to a wide array of customer tools and data warehouses via RudderStack.

For more information on the various Event Streams sources supported by RudderStack, refer to the Event Streams guide.

Live Events

RudderStack's Live Events feature is a debugger that shows the live events collected from your sources and sent to the connected destinations in real-time. With this feature, you can easily debug any errors in the failing events at a destination level and reduce your troubleshooting time and efforts.

Broadly speaking, this feature can be further classified into two major categories:

  • Source Live Events: This feature gives you real-time visibility into the source events collected by RudderStack. This way, you can confirm if your source is configured correctly and is collecting & sending data as expected.
  • Destination Live Events: When routing events to a destination, sometimes events don't show up in your destination. This feature gives you real-time visibility into the destination's responses and helps you troubleshoot the problem.

Reverse ETL

Reverse ETL is the process of routing the data residing in your data warehouse to various downstream tools within your customer data stack. This includes various SaaS marketing, analytics, sales, and customer support tools.

Refer to the RudderStack blog to read more about Reverse ETL. For more information on the supported Reverse ETL sources and setting them up in RudderStack, refer to the Reverse ETL documentation.

SDK

RudderStack offers SDK support for your web, mobile, and server-side sources and lets you track your event data seamlessly.

For the SDK-specific documentation, refer to the SDK Integration Guides.

Source

A source is a platform or an application (web, mobile, server-side, or a third-party cloud app) from where RudderStack tracks and collects your event data.

For more information on RudderStack sources, refer to the Sources guide.

Visual Data Mapper

The Visual Data Mapper (VDM) is RudderStack's Reverse ETL feature. It offers an intuitive UI to map your data warehouse columns to specific destination fields without any second-guessing.

For more information on this feature, refer to the Visual Data Mapper guide.

Teammates (User Management)

RudderStack's Teammates feature lets you add and manage other users in your RudderStack workspace. It facilitates easier collaboration between you and other team members of your organization while using RudderStack.

For more information on this feature, refer to the Teammates guide.

Transformations

RudderStack's Transformations feature lets you leverage your custom JavaScript functions that you can use to implement a variety of use-cases like:

  • Filtering or sampling events
  • Implementing a static logic to enrich your events
  • Removing any sensitive PII information from your customer events, and a lot more.

For more details on this feature and how to use it, refer to the Transformations guide.

Warehouse destination

RudderStack supports sending events to all the leading data warehouses like Redshift, Azure Synapse, BigQuery, Snowflake, PostgreSQL, ClickHouse, and SQL Server. These are called the warehouse destinations.

For more information on how to set up these warehouse destinations, refer to the Warehouse Destinations guide.

Warehouse schema

When sending your events to a data warehouse via RudderStack, you don't need to define a schema for your events before sending them from your source. RudderStack automatically does that for you by following a predefined warehouse schema. This schema defines the different tables and columns created based on different types of events.

Refer to the Warehouse Schema guide for more details.

Token

The token (also referred to as the workspace token) is a unique identifier of your RudderStack workspace. You can find it by logging in to your RudderStack dashboard.

Write key

The write key (also referred to as the source write key) is a unique identifier for your source. It is used while sending events from a source to your specified destination via RudderStack.

Workspace Token vs Source Write Key

For more information on finding the write key for your source, refer to the Sources guide.

Contact us

For any questions on any of the topics covered in this guide, you can contact us. You can also start a conversation in our Slack community.