Rudderstack blog
News from RudderStack and insights for data teams

Understanding event data: The foundation of your customer journey
Understanding event data: The foundation of your customer journey
Understanding your customers isn't just about knowing who they are—it's about understanding what they do. This is where clean, accurate event data becomes a foundational part of your business strategy.
Event streaming: What it is, how it works, and why you should use it
by Brooks Patterson
Data standardization: Why and how to standardize data
by Danika Rockett
How Zoopla transformed real estate experiences with data-driven personalization
by Danika Rockett
Subscribe
Get the latest news and updates in data engineering
All
Data Infrastructure
Data Integration
Identity Resolution
Data Enablement
Data Governance
RudderStack Updates
Data Infrastructure

Part 1: The Evolution of Data Pipeline Architecture
This blog talks about ETL, ELT, and the history, present, and future of data pipelines. You will know the good and bad things about various data pipeline approaches.

Part 2: The Evolution of Data Pipeline Architecture
In this post, we will go through a number of changes that have happened, both in the market and in terms of the technologies that are available. We will propose a new architecture that addresses the issues we mentioned in part 1.

The Complete Customer Data Stack: Data Collection (Part 1)
Check out part one of our two-part series on data collection. You'll learn how to collect event data and why categories are important when it comes to data collection.

The Complete Customer Data Stack: Data Collection (Part 2)
In this post, we cover how to collect relational data from both cloud applications and databases, and we explore two other lesser but still important sources of data.

Why Twilio Acquired Segment
In this post, we examine Twilio's strategic acquisition of Segment and explore how the Segment customer data platform fits into Twilio's vision to create an end-to-end marketing cloud.

RudderStack: How Pachyderm Pipelines Help Parse Customer Event Data
The blog shows how Pachyderm leverages real-time customer event data across different sources to gain deeper insights into user behavior in its product and optimize UX.

Kafka Vs. PostgreSQL: How We Implemented Our Queueing System Using PostgreSQL
Apache Kafka wasn’t the right solution for RudderStack’s core streaming/queueing engine. Instead, we built our own streaming engine on top of PostgreSQL. Here, we discuss the internals of our implementation using the queueing system in more detail.

Warehouse-First, the More Secure, Flexible, and Cost-Effective Application Architecture
Learn how eliminating black-box applications and building a warehouse-first data infrastructure can give you more data control, more flexibility with no duplicated data, and lower costs.

Build or Buy? Lessons From Ten Years Building Customer Data Pipelines
This article summarizes what RudderStack CEO Soumyadeb Mitra learned in both building and buying customer data pipelines over the last ten years. Dive in to know more