What is ordinal Data?

Ordinal data is a type of categorical data where the categories have a defined order or ranking, but the intervals between those categories are not necessarily equal or measurable.

A common example is a customer satisfaction survey that asks respondents to rate their experience as "Very Unsatisfied," "Unsatisfied," "Neutral," "Satisfied," or "Very Satisfied." The ranking is unambiguous. "Very Satisfied" is higher than "Neutral," but the gap in actual sentiment between "Neutral" and "Satisfied" may not be the same size as the gap between "Satisfied" and "Very Satisfied."

This makes ordinal data different from interval or ratio data, where distances between values are consistent and quantifiable. Because ordinal data captures direction but not magnitude, it requires specific statistical approaches, and it appears constantly in the surveys, rating systems, and feedback forms that data teams work with every day.

Key concepts

  • Ordinal data ranks categories in a defined order, but the intervals between those categories are unequal or unknown.
  • The four measurement types—nominal, ordinal, interval, and ratio—form a hierarchy of increasing statistical precision.
  • Median and mode are the correct measures of central tendency for ordinal data; the mean should not be used.
  • Non-parametric statistical tests such as Mann-Whitney U and Kruskal-Wallis are designed for ordinal data; parametric tests like t-tests and ANOVA require interval or ratio data.
  • NPS, CSAT, and Likert scale responses are the most common sources of ordinal data in business and customer analytics workflows.
  • Ordinal ≠ interval: you can say one category ranks higher than another, but you cannot say by how much.

How does ordinal data compare to other data types?

Ordinal data belongs to one of four levels of measurement used in statistics and data work. Understanding where it sits in that hierarchy clarifies both what you can do with it and what you cannot.

Nominal

Ordinal

Interval

Ratio

Definition

Definition

Categories with no order or ranking

Ordered categories; intervals between them are unequal or unknown

Ordered values with equal, measurable intervals; no true zero

Ordered values with equal intervals and a true zero point

Example

Customer segment (Enterprise, Mid-Market, SMB)

CSAT rating (Poor, Fair, Good, Excellent)

Temperature in Celsius; calendar dates

Revenue ($), session duration (seconds), event count

Ordered?

No

Yes

Yes

Yes

Equal intervals?

No

No

Yes

Yes

True zero?

No

No

No

Yes

Appropriate measures of center

Mode

Median, Mode

Mean, Median, Mode

Mean, Median, Mode

Typical analysis methods

Frequency counts, chi-square test

Mann-Whitney U, Kruskal-Wallis, Spearman correlation, ordinal logistic regression

Pearson correlation, t-test, ANOVA

All interval methods, plus geometric mean, coefficient of variation

Each level builds on the one before it. Nominal data names categories whereas ordinal data adds order to those names. Interval data adds equal spacing between values. Ratio data adds a meaningful zero point that allows for statements like "twice as much."

A note on NPS: The three-category NPS classification—Detractor, Passive, Promoter—is ordinal. The underlying 0–10 response scale is commonly treated as ordinal or interval depending on the analysis context. The final NPS score (% Promoters minus % Detractors) is a derived numeric metric and does not itself have ordinal properties. For more on how customer metrics fit into a broader analytics framework, see Customer Data Analytics.

What makes data ordinal?

At the core of ordinal data is a single principle: categories are not just labels. They’re steps along a ranked sequence. This distinguishes ordinal data from nominal data, where categories exist independently with no hierarchy. Both types are categorical, but ordinal data imposes an order that lets you say "this response is higher than that one."

However, the steps are not evenly spaced. Consider a customer satisfaction scale with the options Excellent, Good, Fair, and Poor. It is clear that "Excellent" ranks above "Good," but the difference in actual satisfaction between "Good" and "Excellent" may not be the same as the difference between "Fair" and "Poor." This ambiguity in interval size is the defining limitation of ordinal data, and it’s what separates it from interval and ratio data where each unit on the scale represents a consistent, measurable distance.

Despite this limitation, the presence of order still enables meaningful analysis. You can identify the most common response (mode) and the middle value in a ranked dataset (median). You can compare distributions across groups. And you can apply non-parametric statistical tests specifically designed for data without fixed intervals. Ordinal data is less powerful than interval or ratio data, but it is considerably more informative than nominal data—and far better suited than either for capturing human judgments, preferences, and attitudes.

Ordinal data examples

The following examples illustrate how ordinal data appears across different contexts.

Likert Scales: The most widely used format for measuring attitudes and opinions. A standard five-point Likert scale typically includes "Strongly Disagree," "Disagree," "Neutral," "Agree," "Strongly Agree.” It has a clear order, but the psychological distance between adjacent steps varies by respondent and context. Likert scales are the backbone of product feedback forms, employee engagement surveys, and user research.

Customer Satisfaction (CSAT) Ratings: CSAT surveys ask customers to rate a specific interaction, typically on a scale from "Very Unsatisfied" to "Very Satisfied." The ranking is unambiguous; the interval is not. A customer moving from "Unsatisfied" to "Neutral" has not necessarily experienced the same improvement as one moving from "Neutral" to "Satisfied."

Education Levels: Classifying respondents as "High School," "Some College," "Bachelor's Degree," or "Postgraduate" establishes a clear order by credential attainment. However, the knowledge and skills gained between levels vary considerably and are not equal steps on a uniform scale.

Movie and Product Ratings: Star ratings (1–5) provide an ordered preference signal, but a movie the viewer considers a 3 is not necessarily half as good as a 5-star film. The perceived difference between 3 and 4 stars may be quite different from the perceived difference between 4 and 5.

Military and Organizational Ranks: From Private to General, or from Associate to Vice President, rank hierarchies impose a clear order. But the responsibilities, decision-making authority, and skills required at each level differ considerably and do not increase in uniform increments.

Pain and Symptom Scales: Clinical scales like the 0–10 pain intensity scale or depression symptom severity ratings (None, Mild, Moderate, Severe) are ordinal. A score of 8 is worse than a score of 4, but not necessarily twice as painful.

How is ordinal data collected and represented?

Methods for collecting ordinal data

Surveys and Questionnaires: Surveys are the primary vehicle for ordinal data collection. Likert scales, star ratings, and ranking tasks are standard formats. The structured nature of surveys ensures categories are applied consistently across respondents, which is essential for valid comparison.

Interviews and Structured Observations: In qualitative research, interviewers may assign ordinal scores to participant responses based on pre-defined criteria—for example, rating the severity of a described problem as Low, Medium, or High. Structured observations can similarly yield ordinal data when researchers rank the frequency or intensity of observed behaviors.

Existing Data Sources: Customer feedback databases, product review platforms, support ticket systems, and HR records often contain ordinal data that has already been collected. These sources can be rich inputs for analysis without additional collection effort. See Types of Customer Data for a broader view of the data types that flow through a typical customer data stack.

How to represent ordinal data

Ordered Category Labels: The simplest approach: list categories in their natural sequence, such as "Strongly Disagree → Disagree → Neutral → Agree → Strongly Agree." This preserves the rank order and is self-documenting. It does not imply anything about the size of the intervals.

Numeric Encoding: Assigning integer codes to each category (e.g., 1 through 5) facilitates sorting, filtering, and some forms of analysis. The risk is that these integers may be passed into arithmetic operations that assume equal intervals—for example, averaging the codes. When using numeric encoding, always retain the original label as well, and document the assumed sort order explicitly in your schema or data dictionary.

Visual Representations: Ordered bar charts, stacked bar charts, and box plots are the appropriate choices for visualizing ordinal data. They display the distribution across categories without implying continuous numeric spacing. Line charts should be avoided—they imply a continuous scale with equal intervals between points, which is not a valid assumption for ordinal data.

Data collection best practices

Write clear category definitions. Ambiguous labels produce noisy data. "Good" and "Very Good" are too close in most contexts; "Fair," "Good," and "Excellent" are more meaningfully distinct. Test your categories with a small sample before deploying at scale.

Pilot test before launch. A pilot survey identifies whether respondents interpret the scale as intended. If pilot respondents cluster at the extremes or consistently skip a category, the scale likely needs adjustment.

Use consistent scales across touchpoints. If CSAT is measured at multiple points in the customer journey, use the same scale each time. Mixing a 5-point scale in one survey with a 3-point scale in another makes cross-survey comparison invalid.

Respect respondent privacy. Be transparent about how ordinal response data will be stored, analyzed, and shared. This is especially relevant when survey responses are linked to individual customer records or used for segmentation.

What are the advantages and limitations of ordinal data?

Advantages

Captures order and preference. Ordinal data records not just what something is, but how it ranks relative to other options. This makes it valuable anywhere the direction of a response matters (e.g., how satisfied, how likely, how difficult).

Easy to collect. Surveys, rating scales, and ranking tasks are familiar formats for respondents. Collection infrastructure is straightforward, and ordinal questions can be embedded in broader surveys without adding friction.

Flexible applications. Ordinal data is useful across domains—customer experience, product research, HR, healthcare, academic research—wherever ranked categories are a meaningful representation of the underlying construct.

More informative than nominal. While nominal data tells you what category something falls into, ordinal data adds directional signal. A CSAT rating tells you more than a simple yes/no satisfaction flag.

Limitations

No quantifiable interval. You cannot calculate a precise difference between ordinal categories. Knowing a customer moved from "Fair" to "Good" tells you direction but not magnitude. This rules out many standard statistical operations.

Mean is not technically valid. Calculating the mean of ordinal responses (averaging CSAT scores, for example) assumes equal intervals between categories. This is a common simplification in practice, but it is an assumption, not a mathematical fact.

Susceptible to scale design bias. The way response options are worded and ordered influences what respondents choose. Anchoring effects, label ambiguity, and the number of options all affect the distribution of responses in ways that are hard to disentangle from the underlying attitude being measured.

Limited statistical power. Non-parametric tests, which are required for ordinal data, are generally less statistically powerful than their parametric counterparts. Detecting small effects requires larger sample sizes.

How do you analyze ordinal data?

When working with ordinal variables, the analysis methods you choose must respect the rank order without assuming consistent intervals between categories. The following are the primary approaches.

Measures of central tendency

The median and mode are the appropriate measures of central tendency for ordinal data. The median identifies the middle category when responses are sorted in rank order. The point that divides the dataset into equal halves. The mode identifies the most frequently selected category.

The mean should not be used for ordinal data in its strict form. Calculating an average requires the assumption that the distance between each pair of adjacent categories is the same, an assumption ordinal data does not support. In practice, many teams compute averages for Likert scale responses. This is a pragmatic simplification and is acceptable if explicitly acknowledged as an assumption, particularly when the scale has five or more points.

Distribution analysis

Frequency tables tally the count and percentage of responses in each category. They are the most direct description of how responses are distributed across the ordinal scale and are useful for both internal reporting and communicating results to stakeholders.

Percentiles describe where a given value falls within the full distribution. The 75th percentile response, for example, means 75% of respondents selected that category or lower. Percentile analysis is particularly useful when comparing subgroups—for example, comparing the median CSAT response across different customer segments.

Non-parametric statistical tests for ordinal data

Because ordinal data does not meet the assumptions of normality and equal variance required by parametric tests, standard options like ANOVA and t-tests are not valid. Non-parametric alternatives are used instead.

  • Mann-Whitney U test: Compares the rank distributions of two independent groups. Equivalent in purpose to an independent-samples t-test, but appropriate for ordinal data.
  • Wilcoxon signed-rank test: Compares two related groups (e.g., the same respondents measured at two points in time).
  • Kruskal-Wallis H test: Compares three or more independent groups. Equivalent in purpose to a one-way ANOVA.
  • Spearman's rank correlation: Measures the strength and direction of association between two ordinal variables. An ordinal equivalent to Pearson correlation.

For a deeper look at how statistical methods relate to data analytics practice, see The Difference Between Data Analytics and Statistics.

Ordinal regression

When the goal is to predict an ordinal outcome based on one or more predictor variables, ordinal logistic regression (also called proportional odds regression) is the appropriate technique. It models the cumulative probability of a response falling at or below each category threshold. This is well-suited for modeling outcomes like customer satisfaction tier or support priority level based on behavioral or demographic predictors.

Visualizing ordinal data

The visualization choices for ordinal data should preserve category order without implying equal spacing.

  • Ordered bar charts: Categories displayed in rank sequence, with bar length representing frequency or percentage. The default for most ordinal distributions.
  • Stacked bar charts: Useful for comparing the ordinal distribution of a variable across multiple groups side by side.
  • Diverging bar charts: Particularly effective for Likert scale data, where responses are split around a neutral midpoint and displayed in both directions.
  • Box plots: Display the median and interquartile range, which are meaningful for ordinal data. Useful for comparing central tendency and spread across groups.

Note: Avoid line charts for ordinal data. They imply a continuous numeric scale with uniform intervals, which is a property ordinal data does not have.

Ordinal data in business and customer analytics

For data engineers, analysts, and product teams, ordinal data is most visible in the feedback and measurement systems that drive day-to-day decisions. These are the most common patterns you will encounter in a business data environment.

NPS (Net Promoter Score)

NPS surveys ask customers to rate their likelihood to recommend a product on a 0–10 scale, producing three ordinal categories: Detractors (0–6), Passives (7–8), and Promoters (9–10). The categories are ordered—Promoters rank above Passives, who rank above Detractors—but the emotional distance between a score of 6 and a score of 7 is not assumed to equal the distance between a score of 9 and a score of 10. When storing NPS data in a warehouse, retain the raw numeric response alongside the derived category to preserve analytical flexibility downstream.

CSAT (Customer Satisfaction) Surveys

CSAT surveys typically use a 5-point ordinal scale: “Very Unsatisfied,” “Unsatisfied,” “Neutral,” “Satisfied,” “Very Satisfied.” The standard CSAT score (calculated as the percentage of respondents who selected the top one or two tiers) is a derived metric built from underlying ordinal responses. Because the raw responses are ordinal, averaging them directly involves the equal-interval assumption noted above. Reporting the median response alongside the derived score is a more statistically rigorous practice.

Likert scales in product feedback

Likert scales are the most common format for product feedback forms, onboarding surveys, and user research. A question like "How easy was it to complete this task?" on a scale from "Very Difficult" to "Very Easy" produces ordinal data. Product teams that compute average Likert scores should recognize this as a deliberate simplification. For formal user research, median and frequency distribution are the more defensible summary statistics.

Feature prioritization and rank-order user research

When researchers ask participants to rank features in order of importance, the output is ordinal. A feature ranked first is more important than one ranked third, but not necessarily three times more important. Rank-order data from user research is best analyzed by looking at frequency of top-rank placement across respondents rather than averaging rank positions.

Customer segmentation and health scoring

Ordinal data surfaces frequently in segmentation and health scoring. Customer health scores (At Risk, Neutral, Healthy), support ticket priority levels (P1, P2, P3), and subscription plan tiers (Free, Starter, Pro, Enterprise) are all ordinal variables. When these are modeled in a data warehouse, encoding them as integers enables sorting and range filtering but risks unintended arithmetic operations downstream. Defining explicit sort keys or using ordered categorical types in your semantic layer prevents charts and aggregations from treating these values as continuous numeric data.

Storing and querying ordinal data in a pipeline

When ordinal data flows through a customer data pipeline—raw ingestion, dbt transformation, and BI layer—a few practices reduce downstream misuse:

  • Store the original text label alongside any numeric encoding so the rank order is self-documenting.
  • Define ordinal dimensions with explicit sort orders in your semantic layer so charts preserve categorical sequence rather than defaulting to alphabetical sorting.
  • Avoid applying SUM or AVG aggregations to encoded ordinal values without a comment noting the equal-interval assumption being made.
  • When joining ordinal survey responses to behavioral event data for segmentation or modeling, treat the ordinal value as a categorical feature rather than a numeric one unless a deliberate interval assumption is documented.

Ordinal data encodes one of the most practical signals in analytics: direction. It cannot tell you how much things differ, but it reliably tells you which is more. Approached with the right statistical methods and clean data infrastructure, ordinal data from surveys, feedback forms, and rating systems can surface patterns in customer sentiment, product performance, and user experience that would otherwise remain invisible in aggregate counts alone.

For more on how different customer data types flow through a modern data stack, see Customer Data Analytics.

FAQs

  • Ordinal data is a type of categorical data where the categories have a defined order or ranking, but the distances between categories are not equal or measurable. Common examples include customer satisfaction ratings, Likert scale survey responses, and subscription tier classifications (Free, Starter, Pro, Enterprise).

  • Nominal data assigns items to categories with no implied ranking. For example, customer industry (SaaS, retail, healthcare) or product category. Ordinal data adds a rank: the categories are ordered, but the gaps between them are not quantified. Both are categorical data types, but ordinal carries directional information that nominal does not.

  • Both ordinal and interval data have ordered categories, but interval data has equal, measurable distances between each value. Temperature in Celsius is interval data: the gap between 20°C and 30°C is identical to the gap between 30°C and 40°C. A satisfaction rating from "Poor" to "Excellent" is ordinal: the step from "Fair" to "Good" does not represent the same amount of change as the step from "Good" to "Excellent."

  • Technically, you should not. Computing a mean assumes equal intervals between values, which ordinal data does not have. The median and mode are the statistically appropriate measures of central tendency. In practice, many teams average Likert scale responses as a simplification. This is acceptable if the equal-interval assumption is explicitly acknowledged, but it is an assumption, not a mathematical given.

  • Ordinal data requires non-parametric statistical tests that do not assume equal intervals or a normal distribution. Common choices include the Mann-Whitney U test (two independent groups), the Wilcoxon signed-rank test (two related groups), the Kruskal-Wallis H test (three or more groups), and Spearman's rank correlation (relationship between two ordinal variables). Ordinal logistic regression is used when modeling an ordinal outcome variable.

  • Likert scale data is technically ordinal. Response options from "Strongly Disagree" to "Strongly Agree" have a clear order, but the psychological distance between adjacent steps is not guaranteed to be equal. Some researchers treat Likert data as interval for convenience, particularly when using multi-item composite scales. For rigorous analysis, non-parametric methods are the more defensible choice.

  • The three-category NPS classification (Detractor, Passive, Promoter) is ordinal. The underlying 0–10 response scale is commonly treated as ordinal or interval depending on the analysis. The final NPS score, calculated as the percentage of Promoters minus the percentage of Detractors, is a derived numeric metric and does not itself have ordinal properties.

  • Common business examples include CSAT survey ratings (Very Unsatisfied to Very Satisfied), NPS response categories (Detractor, Passive, Promoter), Likert scale product feedback questions, customer health scores (At Risk, Neutral, Healthy), support ticket priority levels (P1–P4), and subscription plan tiers (Free, Starter, Pro, Enterprise).

  • Ordered bar charts, with categories arranged in their natural ranked sequence, are the standard choice. Stacked bar charts work well for comparing ordinal distributions across groups. Diverging bar charts are effective for Likert scale data centered around a neutral midpoint. Box plots can display median and interquartile spread. Line charts should be avoided for ordinal data because they imply continuous, equally-spaced values between points.

Can't find what you're looking for? Give us a shout!