Correlation analysis

Last updated:

|Edit this page

Correlation analysis is a feature within Funnels that will automatically highlight significant factors that affect the conversion rate of users within the funnel.

The Funnels has the ability to directly compare conversion rates of users for a specific property via the breakdown feature. An example usage for the breakdown feature might be to see how the user's Operating System affects the conversion rate. However, this requires you to already have an initial hypothesis that Operating System may be a significant factor.

If you're not sure what might be a significant event or property in determining how likely someone is to convert, you can use Correlation analysis. This report will automatically highlighting significant events or properties. Given a funnel, PostHog will calculate the conversion rate for different slices (either by events performed, properties they have or properties for the events they performed) and surface any relevant signals.

Below is an example of what correlation analysis might look like:

Correlation analysis

Using correlation analysis

Suppose we have a funnel and we want to optimize it. The first thing we might want to do is identify if there is anything special about user that make them more or less likely to convert. With correlation analysis we can automatically identify which user properties are significant:

Person property correlations

Above we can see that the conversion rate of people using Android and iOS are 4.1x and 3.1x times less likely to convert in our example funnel. However, we also see other properties that may be of interest, for instance the "Country Name". This gives us some threads to pull on, perhaps we might look into internationalization, or ensure the app loads well in different regions.

Equally, we can see specific events that are correlated:

Event property correlations

Above we see that in our example funnel, users that performed the "opted into navigation redesign" event were 3.1x times less likely to convert.

We can drill down further to see how this events properties are correlated as well. Click on the expand event button to the left and we see that people that opted for the redesign and did not have a billing plan were 3.3x times less likely to convert.

Where we go from here requires further investigation, but the important point is that we now have strings to pull on which hopefully end up with actionable insights.

What to do with this new found knowledge?

One route for further exploration might be to drill down to the people that were considered for this specific event correlation. We can do so by clicking on the "Completed" and "Dropped off" numbers within a row. In the above case, users for both "Completed" and "Dropped off" will have performed the "opted into navigation redesign". "Completed" means the users fully completed the funnel, where as "Dropped off" users will include users who entered the funnel, but failed to make it through to the end.

Another option might be to take, for instance, the "Country Name" property and add this as a funnel breakdown to further explore and understand the affect it has on the funnel:

Breakdown by country

Making correlations accurate

You have better context on your product, and what could be a relevant signal or not. There will be some events and/or properties that perhaps have trivial correlations, or don't actually provide an actionable signal. To remove this potential distractions, use either the property names drop down to ignore them in analysis for a funnel in particular, or use the Exclude from project feature which will ensure that you and your team will not see correlations for these events and properties whilst within this project.

It's important to remember that the accuracy and therefore usefulness of any correlation is dependent on the sample size, if you've only got a handful of users for a specific correlation then it would be wise to try to increase this number. You can do this by, for instance, increasing the date range size of the funnel.

Further reading

Want to know more about what's possible with Correlation in PostHog? Try these tutorials:

Want more? Check our full list of PostHog tutorials.

Questions?

Was this page useful?

Next article

Retention

Retention is a type of Insight that shows you how many users return on subsequent days after first visiting your site or product. Go to 'Insights' in the left-hand menu and select the 'Retention' tab. Understanding the retention table The retention table is structured as follows: Column 1: Cohort The table starts from 11 days before the day you are viewing it, and each row is incremented by one day until "yesterday" (last row). For Recurring Retention , the cohort is defined by the group of…

Read next article