December 26, 2019
RudderStack is an open-source customer data pipeline tool. It collects, routes, processes data from your websites, apps, cloud tools, and data warehouse. With RudderStack, you can build efficient data pipelines that connect your entire customer data stack and leverage your warehoused data to trigger your analytics and other activation use-cases.
Some of the key features of RudderStack include:
- Complete Flexibility: Unlike most commercial systems that charge you based on the event volume, RudderStack lets you collect all of your event data without worrying about overrunning your budget.
- Warehouse-first Architecture: Most modern companies are building their CDP on top of a data warehouse. RudderStack treats your data warehouse as a first-class citizen among your destinations. It offers advanced features and configurable, real-time sync to safely collect and route your events to your data warehouse.
- Built for Developers: RudderStack is built API-first and easily integrates with the tools that you already use and love.
- High Availability: RudderStack’s sophisticated error handling and retry system ensures all of your event data will be delivered despite any network or destination downtime.
For more information on RudderStack, feel free to join our Slack channel and start a conversation. We’ll love to hear from you!
How to set up RudderStack
The easiest and fastest way to get started with RudderStack is using the Docker setup. However, if you wish to use RudderStack in production environments, we strongly recommend using our Kubernetes Helm charts.
The steps for setting up RudderStack using Docker are as follows:
- Sign up on the RudderStack app. You can easily set up and configure your event data sources and destinations through the RudderStack dashboard. RudderStack self-hosts these configurations and does not charge you for it.
Note: If you want to host your own source and destination configurations, you can use the open-source RudderStack Config Generator. However, note that this open-source dashboard lacks features such as user-defined transformations and live event debugging, which are present in the RudderStack-hosted dashboard.
- Then, copy the workspace token at the top of the dashboard page, as shown:
- Next, download the docker-compose file rudder-docker.yml.
- Open this file, and replace
<your_workspace_token>with the workspace token that you have copied above:
- Finally, navigate to the location where you want to set up RudderStack and run the command
docker-compose -f rudder-docker.ymlup
- To verify if the setup is successful, send test events to your destination by following this guide.
RudderStack's architecture consists of 2 major components:
- Control Plane: This component handles the source and destination configurations and the user-specified connections.
- Data Plane: This is the RudderStack backend - the core engine that collects, transforms, and routes the events to the specified destinations.
Here’s a broad visual representation of RudderStack’s architecture:
For more details on the architecture, check out our documentation.
RudderStack currently supports more than 80 integrations, with newer sources and destinations added to the catalog almost every week.
RudderStack Event Streams allow you to track and collect event data from your websites and applications in real-time. This feature includes client-side SDKs for website, mobile, and server-side event tracking, as well as integrations with some third-party platforms like Looker, PostHog, and Customer.io.
Read more about this feature in our docs.
With RudderStack ETL, you can seamlessly build ELT pipelines from your cloud applications to your data warehouse. RudderStack also gives you the ability to choose what data you want to ingest, and specify the sync time when the data should be loaded into the warehouse.
RudderStack’s Reverse ETL feature lets you leverage the enriched warehouse data as a source for your entire customer data stack. This way, you can send the warehoused data to your preferred customer tools.
Support for 80+ Destinations
With support for over 80 third-party tools and destinations, RudderStack reliably routes all the tracked customer events to your preferred destinations for various activation use-cases like analytics, attribution, marketing, CRM, and personalization. Check out all our supported destinations here.
For an in-depth comparison of RudderStack and Segment check out this post on Marketing Arsenal: An open source Segment alternative? Rudderstack vs Segment
Sign up for Free and Start Sending Data
Test out our event stream, ELT, and reverse-ETL pipelines. Use our HTTP source to send data in less than 5 minutes, or install one of our 12 SDKs in your website or app. Get started.
We'll send you updates from the blog and monthly release notes.