Previously we discussed how to write <a href="https://www.sisense.com/blog/use-subqueries-and-window-functions-to-compute-running-averages/">rolling averages in Postgres</a>. By popular demand, we’re showing you how to do the same in MySQL and SQL Server.



We’ll cover how to annotate noisy charts like this:



<figure class="wp-block-image fancybox"><img decoding="async" src="https://cdn.sisense.com/wp-content/uploads/New-customers-rolling-blog.png" alt="New customers chart" class="wp-image-73876"/></figure>



With a 7-day preceding average line like this:



<figure class="wp-block-image fancybox"><img decoding="async" src="https://cdn.sisense.com/wp-content/uploads/New-customers-7-day-rolling-blog.png" alt="New customers 7 day chart" class="wp-image-73881"/></figure>



The right reporting will transform your finance enterprise.



<a class="action-btn " href="https://www.sisense.com/dashboard-examples/finance/" target="_blank" rel="noopener noreferrer">See examples</a>



<h2 class="wp-block-heading">The Big Idea</h2>



Our first graph above is pretty noisy and hard to get useful information from. We can smooth it out by plotting a 7-day average on top of the underlying data. This can be done with window functions, self-joins, or correlated subqueries — we’ll cover the first two.



We’ll start with a preceding average, which means that the average point on the 7th of the month is the average of the first seven days.



Visually this shifts the spikes in the graph to the right, as a big spike is averaged over the following seven days.



<h2 class="wp-block-heading">First, Create an Intermediate Count Table</h2>



We want to compute an average over the total signups for each day. Assuming we have a typical users table with a row per new user and a timestamp created_at, we can create our aggregate signups table like so:



<pre class="wp-block-code"><code>select
 created_at::date as date,
 count(1) as value
from new_customers
group by 1</code></pre>



In Postgres and SQL Server, you can use this as a CTE. In MySQL, you can save it as a temporary table.



Postgres Rolling Average



Fortunately Postgres has window functions which are the simplest way to compute a running average.



<pre class="wp-block-code"><code>select
 date,
 value,
 avg(value) 
 over (order by date asc
 rows between 6 preceding and current row) as avg,
from signups
order by 1 desc</code></pre>



This query assumes that the dates do not have gaps. The query is averaging over the past seven rows, not the past seven dates. If your data has gaps, fill them in with generate_series or joining against a table with dense date rows.



<h2 class="wp-block-heading">MySQL Rolling Average</h2>



MySQL lacks window functions, but we can do a similar computation using self joins. For each row in our count table, we join every row that was within the past seven days and take the average.



<pre class="wp-block-code"><code>select signups.date, signups.count, avg(signups_past.count)
from signups
join signups as signups_past 
 on signups_past.date between signups.date - 6 and signups.date
group by 1, 2</code></pre>



This query automatically handles date gaps, as we are looking at rows within a date range rather than the preceding N rows.



<h2 class="wp-block-heading">SQL Server Rolling Average</h2>



SQL Server has window functions, so computing the rolling average can be done in either the Postgres style or MySQL style. For simplicity, we’re using the MySQL version with a self join.



This is conceptually the same as in MySQL. The only translations are the dateadd function and explicitly named group by columns.



<pre class="wp-block-code"><code>select signups.date, signups.count, avg(signups_past.count)
from signups
join signups as signups_past 
 on signups_past.date 
 between dateadd(day, -6, signups.date) and signups.date
group by signups.date, signups.count</code></pre>



<h2 class="wp-block-heading">Other Averages</h2>



We focused on the 7-day trailing average in this post. If we wanted to look at the 7-day leading average, it’s as simple as sorting the dates in the other direction. If we wanted to look at a centered average, we’d use:



<ul><li>Postgres: rows between 3 preceding and 3 following</li><li>MySql: between signups.date &#8211; 3 and signups.date + 3 in MySQL</li><li>SQL Server: between dateadd(day, -3, signups.date) and dateadd(day, 3, signups.date)</li></ul>



The right reporting will transform your finance enterprise.



<a class="action-btn " href="https://www.sisense.com/dashboard-examples/finance/" target="_blank" rel="noopener noreferrer">See examples</a>

Rolling Averages in MySQL and SQL Server

SQL Superstar

LinkedIn

Twitter

GitHub

curve-image-unique-image-unique

curve

3-dark-2-image-unique-image-unique

3 DARK 2

Get the latest in analytics right in your inbox.

Article