Create a Cumulative Sum Column in MySQL

Question

I have a table that looks like this:

I want to add a new column called cumulative_sum, so the table would look like this:

id   count  cumulative_sum
1    100    100
2    50     150
3    10     160

Is there a MySQL update statement that can do this easily? What's the best way to accomplish this?

OMG Ponies · Accepted Answer · 2010-04-28 14:41:40Z

119

Using a correlated query:

  SELECT t.id,
         t.count,
         (SELECT SUM(x.count)
            FROM TABLE x
           WHERE x.id <= t.id) AS cumulative_sum
    FROM TABLE t
ORDER BY t.id

Using MySQL variables:

  SELECT t.id,
         t.count,
         @running_total := @running_total + t.count AS cumulative_sum
    FROM TABLE t
    JOIN (SELECT @running_total := 0) r
ORDER BY t.id

Note:

The JOIN (SELECT @running_total := 0) r is a cross join, and allows for variable declaration without requiring a separate SET command.
The table alias, r, is required by MySQL for any subquery/derived table/inline view

Caveats:

MySQL specific; not portable to other databases
The ORDER BY is important; it ensures the order matches the OP and can have larger implications for more complicated variable usage (IE: psuedo ROW_NUMBER/RANK functionality, which MySQL lacks)

edited Apr 28, 2010 at 14:41

answered Apr 1, 2010 at 21:54

OMG Ponies

334k85 gold badges536 silver badges508 bronze badges

Sign up to request clarification or add additional context in comments.

9 Comments

Wacek Over a year ago

I would add "ORDER BY t.id ASC" to the main query, just to make sure it'll always work

Dercsár Over a year ago

My first thought also was to add ORDER BY. But it does not matter. Until addition turns into non-associative, at least :)

Daniel Vassallo Over a year ago

@OMG Poines: I think you need to use a SELECT in the JOIN (SELECT @running_total := 0) part of the variables example.

allan.simon Over a year ago

for "using a correlated query" where does your table x come from ?

Marc L. Over a year ago

Unless there is optimization happening internally, the correlated subquery is the equivalent of a triangular join performing in O(N^2) time--which will not scale.

|

Andomar · Accepted Answer · 2010-04-01 22:08:26Z

99

If performance is an issue, you could use a MySQL variable:

set @csum := 0;
update YourTable
set cumulative_sum = (@csum := @csum + count)
order by id;

Alternatively, you could remove the cumulative_sum column and calculate it on each query:

set @csum := 0;
select id, count, (@csum := @csum + count) as cumulative_sum
from YourTable
order by id;

This calculates the running sum in a running way :)

answered Apr 1, 2010 at 22:08

Andomar

239k55 gold badges387 silver badges412 bronze badges

7 Comments

OMG Ponies Over a year ago

Use a cross join to define the variable without needing to use SET.

Kirk Ouimet Over a year ago

My table has 36 million records, so this was really helpful to speed things up!

matt Over a year ago

Note that ordering by cumulative_sum might force full table scan.

zaitsman Over a year ago

This does work and seems quite fast; any suggestions how this can be extended to do a cumulative sum in a group? e.g. group by Name or similar, and then do a cumulative sum only for records with the same name

Yuki Inoue Over a year ago

Prefer answer of OLAP function in MySQL 8.0+, as stated in stackoverflow.com/a/52278657/3090068

|

Lukasz Szozda · Accepted Answer · 2018-09-11 14:49:00Z

53

MySQL 8.0/MariaDB supports windowed SUM(col) OVER():

SELECT *, SUM(cnt) OVER(ORDER BY id) AS cumulative_sum
FROM tab;

Output:

┌─────┬──────┬────────────────┐
│ id  │ cnt  │ cumulative_sum │
├─────┼──────┼────────────────┤
│  1  │ 100  │            100 │
│  2  │  50  │            150 │
│  3  │  10  │            160 │
└─────┴──────┴────────────────┘

db<>fiddle

answered Sep 11, 2018 at 14:49

Lukasz Szozda

181k26 gold badges278 silver badges326 bronze badges

3 Comments

DatabaseCoder Over a year ago

I am looking for Cumulative sum using windows function.Thank you.

kejo Over a year ago

@lukasz szozda, how would you insert this data into a database table column so it can be used in other tables? Thanks

Lukasz Szozda Over a year ago

@kejo INSERT INTO table_name(id, cnt, cumulative_sum) SELECT ... FROM ... or CREATE TABLE table_name AS SELECT ... FROM ...

Dercsár · Accepted Answer · 2010-04-01 21:59:06Z

3

UPDATE t
SET cumulative_sum = (
 SELECT SUM(x.count)
 FROM t x
 WHERE x.id <= t.id
)

answered Apr 1, 2010 at 21:59

Dercsár

1,7042 gold badges15 silver badges26 bronze badges

1 Comment

Matthew Flaschen Over a year ago

Although the OP did ask for an update, this is denormalized and will probably be inconvenient to maintain correctly.

raisercostin · Accepted Answer · 2017-02-13 14:41:13Z

3

select Id, Count, @total := @total + Count as cumulative_sum
from YourTable, (Select @total := 0) as total ;

edited Feb 13, 2017 at 14:41

raisercostin

9,3575 gold badges75 silver badges81 bronze badges

answered Jul 10, 2014 at 10:06

Ashutosh SIngh

1,0193 gold badges17 silver badges28 bronze badges

2 Comments

Rohit Gupta Over a year ago

Please explain your answer

raisercostin Over a year ago

The answer works and is one liner. It also initializes/resets the variable to zero at the begining of select.

Bjarki Heiðar · Accepted Answer · 2011-07-04 12:04:05Z

2

Sample query

SET @runtot:=0;
SELECT
   q1.d,
   q1.c,
   (@runtot := @runtot + q1.c) AS rt
FROM
   (SELECT
       DAYOFYEAR(date) AS d,
       COUNT(*) AS c
    FROM  orders
    WHERE  hasPaid > 0
    GROUP  BY d
    ORDER  BY d) AS q1

edited Jul 4, 2011 at 12:04

Bjarki Heiðar

3,1456 gold badges29 silver badges41 bronze badges

answered Jul 4, 2011 at 11:03

Jazz

312 bronze badges

Comments

Pavan Bashetty · Accepted Answer · 2019-02-24 04:24:49Z

2

select id,count,sum(count)over(order by count desc) as cumulative_sum from tableName;

I have used the sum aggregate function on the count column and then used the over clause. It sums up each one of the rows individually. The first row is just going to be 100. The second row is going to be 100+50. The third row is 100+50+10 and so forth. So basically every row is the sum of it and all the previous rows and the very last one is the sum of all the rows. So the way to look at this is each row is the sum of the amount where the ID is less than or equal to itself.

edited Feb 24, 2019 at 4:24

answered Feb 22, 2019 at 1:32

Pavan Bashetty

214 bronze badges

3 Comments

Tyl Over a year ago

While this might solve the problem, it's better to explain it a bit so it will benefit others :)

Raymond Nijland Over a year ago

this isn't a co-related subquery or a subquery for that matter... co-related subquery follows SELECT ...., (SELECT .... FROM table2 WHERE table2.id = table1.id ) FROM table1 what you have is a window query..

Zhanxiong Over a year ago

A more detailed explanation of this windowing technique: dev.mysql.com/doc/refman/8.0/en/window-functions-frames.html

Greg · Accepted Answer · 2016-07-08 07:59:46Z

1

You could also create a trigger that will calculate the sum before each insert

delimiter |

CREATE TRIGGER calCumluativeSum  BEFORE INSERT ON someTable
  FOR EACH ROW BEGIN

  SET cumulative_sum = (
     SELECT SUM(x.count)
        FROM someTable x
        WHERE x.id <= NEW.id
    )

    set  NEW.cumulative_sum = cumulative_sum;
  END;
|

I have not tested this

edited Jul 8, 2016 at 7:59

answered Apr 1, 2010 at 22:05

Greg

1,7292 gold badges16 silver badges30 bronze badges

Comments

Flavio_cava · Accepted Answer · 2020-05-20 14:12:51Z

0

  select t1.id, t1.count, SUM(t2.count) cumulative_sum
    from table t1 
        join table t2 on t1.id >= t2.id
    group by t1.id, t1.count

Step by step:

1- Given the following table:

select *
from table t1 
order by t1.id;

id  | count
 1  |  11
 2  |  12   
 3  |  13

2 - Get information by groups

select *
from table t1 
    join table t2 on t1.id >= t2.id
order by t1.id, t2.id;

id  | count | id | count
 1  | 11    | 1  |  11

 2  | 12    | 1  |  11
 2  | 12    | 2  |  12

 3  | 13    | 1  |  11
 3  | 13    | 2  |  12
 3  | 13    | 3  |  13

3- Step 3: Sum all count by t1.id group

select t1.id, t1.count, SUM(t2.count) cumulative_sum
from table t1 
    join table t2 on t1.id >= t2.id
group by t1.id, t1.count;


id  | count | cumulative_sum
 1  |  11   |    11
 2  |  12   |    23
 3  |  13   |    36

edited May 20, 2020 at 14:12

answered May 20, 2020 at 2:25

Flavio_cava

11 bronze badge

1 Comment

Flavio_cava Over a year ago

Added some step by step to understand the final query

Collectives™ on Stack Overflow

Create a Cumulative Sum Column in MySQL

9 Answers 9

Using a correlated query:

Using MySQL variables:

9 Comments

7 Comments

3 Comments

1 Comment

2 Comments

Comments

3 Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

Using a correlated query:

Using MySQL variables:

9 Comments

7 Comments

3 Comments

1 Comment

2 Comments

Comments

3 Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related