Q
Problem solve Get help with specific problems with your technologies, process and projects.

# Counting rows in runs

## I have a table with fields MarketId and XY, where MarketID is a unique character field and for this example XY can only contain "X" or "Y." I need to count the occurrences of field XY and end up with a result like this.

If I have a table with fields MarketId and XY, where MarketID is a unique character field and for this example...

XY can only contain "X" or "Y." My example data is this:

```MarketID  XY
--------  --
1         X
2         X
3         X
4         Y
5         Y
6         X
7         X```

What I need to do is count the occurrences of field XY but in MarketId order to end up with a result something like this:

```XY    Count
--    -----
X       3
Y       2
X       2```

These are called "runs" or "sequences" and the SQL can sure be tricky.

In this table, the MarketID column provides the inherent ordering of data. Image the IDs as points on a line:

`------o------o------o------o------`

Now we will self-join this table:

```  from Market_XY as t1
inner
join Market_XY as t3
on t3.MarketID > t1.MarketID
and t3.XY = t1.XY ```

We will pair up each row (t1) to all other rows (t3) that have a higher ID as well as the same XY. The same XY means these two rows are a potential run:

```------o------o------o------o------
|      |      |      |
t1     t3---->t3---->t3---->```

Next, we self-join once more, but with a LEFT OUTER JOIN, using an IS NULL condition in the WHERE clause to make sure no matches are found:

```  from Market_XY as t1
inner
join Market_XY as t3
on t3.MarketID > t1.MarketID
and t3.XY = t1.XY
left outer
join Market_XY as t2
on t2.MarketID between t1.MarketID and t3.MarketID
and t2.XY <> t3.XY
where t2.MarketID is null```

Essentially, we want all t1-t3 runs, where there is no t2 between them with a different XY. The LEFT OUTER JOIN looks for them, but the IS NULL keeps only the t1-t3 pairs where no t2 exists.

But obviously there will be overlaps of runs. In the given sample data, notice that the run (1,2,3) of Xs, has runs (1,2) and (2,3) inside it. What remains now is simply to take the longest runs. This involves careful grouping and the use of both MIN() and MAX().

```select XY
, hiID - min(loID) + 1 as Count
from (
select t1.MarketID as loID
, t1.XY
, max(t3.MarketID) as hiID
from Market_XY as t1
inner
join Market_XY as t3
on t3.MarketID > t1.MarketID
and t3.XY = t1.XY
left outer
join Market_XY as t2
on t2.MarketID
between t1.MarketID
and t3.MarketID
and t2.XY <> t3.XY
where t2.MarketID is null
group
by t1.MarketID
, t1.XY
) as hiIDs
group
by XY
, hiID
order
by hiID```

It looks tricky but it really isn't. To understand it, run the subquery alone, without the MAX() or GROUP BY, but showing the t3 columns. Then see what happens with the MAX() and GROUP BY. Then apply the outer query with its GROUP BY and MIN(). Note that MAX - MIN + 1 gives the count for the run.

P.S. If anyone has an analytic SQL solution for this, please do send it in. I would love to see it.

This was last published in March 2007

#### Have a question for an expert?

Get answers from a TechTarget expert on whatever's puzzling you.

You will be able to add details on the next page.

#### Start the conversation

Send me notifications when other members comment.

## SearchDataManagement

• ### Chief data officer role: Searching for consensus

The chief data officer role is about many things -- regulations, innovation, AI and more. Consultant Randy Bean discussed the ...

• ### How graph data modeling can help evaluate database tools

Mapping data to a graph model can be challenging -- but it can also help an organization create prototypes to evaluate graph ...

• ### eHarmony hooks up with Redis NoSQL database for hot storage

The Redis key-value store finds use in a system to match would-be romantic partners on dating site eHarmony, which employs a ...

• ### Heat map view sets table for food warehouse optimization

Inspired by the vivid views of stadium heat maps, a Midwest food distributor worked with Information Builders to gain a better ...

• ### Streamlining predictive analytics in retail marketing

Online flash-sale retailer Zulily uses BigQuery and Tableau to help power its predictive analytics, which, in turn, boosts its ...

• ### Airbnb, Univision highlight best practices in BI

At the Real Business Intelligence conference, Airbnb and Univision execs presented some of the BI strategies their organizations ...

## SearchSAP

• ### On-premises, hosted most popular S/4HANA deployment options

The pure cloud -- SaaS -- version of SAP's newest ERP, S/4HANA Cloud, lacks some of the same features of the on-premises version....

• ### S/4HANA public cloud version can get lost in cloud confusion

The 'true' public cloud is the streamlined SaaS version of on-premises S/4. But private cloud options are often conflated with ...

• ### SAP S/4HANA migration: What you need to know

There's a lot to consider when contemplating a move to SAP S/4HANA, and this essential guide provides a starting point, including...

## SearchSQLServer

• ### Six sample databases for SQL Server and how to find them

SQL Server sample databases are useful for test and dev, but they can be difficult to parse. Use this SQL database sample ...

• ### A quick tutorial on SQL Server maintenance plans

SQL Server maintenance plans get a bad rap, but for DBAs who need a simple way to maintain databases, Microsoft's built-in tools ...

• ### Proposed Microsoft-GitHub buy confirms open source role in cloud

The looming Microsoft-GitHub pairing confirms the company's rebirth as an open source friend. Data tools on the Azure cloud are ...

## TheServerSide.com

• ### Attain Jenkins Git integration with a GitHub pull request

This Jenkins Git integration tutorial demonstrates how to create a freestyle build job that performs a Jenkins GitHub pull ...

• ### Financial firms, vendors push self-service software delivery

Self-service DevOps automation appeals to enterprises that must push out new code as they adapt to changing requirements.

• ### IT projects and software teams need to include Agile people

Not every idea deserves equal weight in a software development project, but Agile people know that garnering input from a wide ...

## SearchDataCenter

• ### Rackspace colocation program hosts users' legacy servers

Rackspace now has a managed colocation program that it hopes to upsell its customers with additional services, once their servers...

Broadcom has acquired CA Technologies in a move some believe is largely financially motivated, while others see an opportunity ...

• ### Ten Linux process management commands that simplify admin workflows

If you work in Linux, chances are you have to do some process management. Here are some commands to simplify that workflow.

## SearchContentManagement

• ### Augmented reality devices speed van repairs at Volkswagen U.K.

Augmented reality headsets for garage mechanics speed collaboration between repair shops and experts in the home office to solve ...

• ### Endpoint security tool fueled OpenText's Guidance Software acquisition

Endpoint security was the primary draw for OpenText's Guidance Software acquisition. But plans to improve e-discovery and data ...

• ### Digital transformation benefits follow a not-so-fast track

Choosing among the many digital transformation strategies in the content management sphere is not easy but can pay off when ...

## SearchHRSoftware

• ### Automated recruiting solves Groupon's sourcing talent woes

Building a talent pool through effective sourcing is a major effort by Groupon. It is using a recruiting automation tool to find ...

• ### New HR tools for hourly workers, employee retention announced

This week's news roundup includes an HR tool designed just for hourly workers, a new offering from Limeade to help with talent ...

• ### Eight human capital management functions every HR department needs

Employee self-service and wellness portals are no longer enough. Now, you need a multipronged strategy that tackles the most ...

Close