Q
Problem solve Get help with specific problems with your technologies, process and projects.

Groups that have over 50% females

From a table, I want to select the site_ids that have a majority (over 50%) females.

I have a table:

```site_id  gender  client_id
1        M       1a
1        F       1b
1        F       1c
2        F       2a
2        M       2b
3        M       3a```

1. I want to select the site_ids that have a majority (over 50%) of female gender.

2. Alternatively, select a list of client_ids corresponding to all the site_ids selected above.

The first query is pretty straightforward. We simply do a GROUP BY on the site_id, and count how many females as compared to the overall count:

```select site_id
from daTable
group
by site_id
having 100.0
* count(case when gender='F'
then 937 end)
/ count(*) > 50.0```

You might be wondering why the HAVING clause uses 100.0 in its calculation. This is because counts are integers, and whenever an arithmetic calculation involves only integers, the result will be an integer. So without the 100.0, which is a decimal, the result of the division would be either 0 or 1. With the 100.0, the calculation changes to a decimal calculation, and can be compared to a percentage.

Next, you may wonder what the 937 is doing. Fair question, and the use of that value is, admittedly, a wee red herring. The point is, 937 is not NULL. Notice that in the CASE expression, there is no ELSE, which means that, by default, the ELSE value is NULL. So the CASE expression evaluates to a non-NULL value for females, and NULL for males. Now, recall that aggregate functions like COUNT ignore NULLs, and the solution becomes clear.

Sometimes you will also see the partial count implemented like this:

```having 100.0
* sum(case when gender='F'
then 1 else 0 end)
/ count(*) > 50.0```

Here SUM is used instead of COUNT, and the CASE returns 1s and 0s instead. It would also be okay to omit the "ELSE 0" here.

As for your second question, you probably meant to say "additionally" instead of "alternatively." The answer is:

```select distinct client_id
from daTable
where site_id in
( first query goes here )```
This was last published in January 2007

Content

Find more PRO+ content and other member only offers, here.

Have a question for an expert?

Get answers from a TechTarget expert on whatever's puzzling you.

You will be able to add details on the next page.

Start the conversation

Send me notifications when other members comment.

SearchDataManagement

• Streaming tool from StreamSets eyes data in motion for GDPR

StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the ...

• Data expert: GDPR deadline is an opportunity, not a burden

There is stress as the EU's General Data Protection Regulation compliance deadline nears, but the GDPR privacy movement is a good...

• Google TPUs open up on cloud; LinkedIn intros Hadoop Dynamometer

In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a ...

• Rethinking analytics processes spurs enterprise innovation

By taking a fresh look at the makeup of their analytics organizations, enterprises can innovate their business models and take ...

• Diversified data sets for analytics deliver top results

Analytics teams should focus on data diversity to ensure that their projects deliver the most meaningful insights -- but they ...

• How to boost the value of BI in today's analytics landscape

Traditional BI reporting still gives businesses valuable information. But its value can be increased by incorporating it into a ...

SearchSAP

• SAP Ariba Live focuses on procurement for purpose

SAP Ariba Live 2018 focused on the idea that businesses can use procurement technology to do good in the world; for example, by ...

• SAP debuts consumption-based pricing model for SAP Cloud

SAP Cloud Platform is now available as a consumption-based model, an alternative to the subscription model. SAP also updated the ...

• SAP buys lead-to-money leader CallidusCloud to take on Salesforce

SAP paid \$2.4 billion to acquire lead-to-money vendor CallidusCloud, and analysts agree that the significant price may be worth ...

SearchSQLServer

• Microsoft SQL Operations Studio eases SQL Server admin tasks

SQL Operations Studio simplifies routine administration of SQL Server and Azure SQL databases, making database development and ...

• Meltdown and Spectre fixes eyed for SQL Server performance issues

Microsoft has responded to the Spectre and Meltdown chip vulnerabilities with patches and other fixes. But IT teams need to sort ...

• Five SQL Server maintenance steps you should take -- ASAP

Putting off SQL Server administration tasks can lead to database problems. Enact these often-neglected maintenance items to help ...

TheServerSide.com

• Why the Waterfall or Agile debate will be around forever

Which is the right methodology to use for your project: Waterfall or Agile? The industry may be at peak Agile, as the ...

• Chef's InSpec 2.0 brings compliance automation to the cloud

Enterprises have been quick to adopt automation tools for development and deployment but only recently have organizations started...

• Application security vulnerabilities are often known exploits

How hard is it to secure an enterprise application? It's not hard, especially given the fact that most application security ...

SearchDataCenter

• IBM cloud services to secure mainframes out to the edge

Big Blue will introduce IBM cloud services that use blockchain, containers and its z14 mainframes to deliver improved security ...

• Four disadvantages of hyper-converged infrastructure systems

Problems with scalability and unexpected licensing costs can create problems for organizations that deploy hyper-converged ...

• IBM Power9 servers seek market inroads to AI, cloud

IBM follows up its first Power9 server with a raft of systems designed to appeal to a wider array of markets -- most notably, AI ...

SearchContentManagement

• Scrivito unveils serverless CMS product

By building the CMS with ReactJS, Scrivito gained attraction with development community, according to an analyst.

• Content personalization tools sharpen focus on customers

Content personalization isn't new; Amazon weaponized it, and Jeff Bezos is the world's richest man. New tools are putting it ...

• Leading brands see the need for personalized content

Content personalization continues to expand within companies as maturing technologies make it a viable marketing option for ...

SearchHRSoftware

• How people analytics can improve HR effectiveness

Getting insight into your workforce can reveal everything from training issues to the reasons for turnover or missed corporate ...

• At Ceridian, role of CIO requires constant learning, adjusting

You might say Warren Perlman, CIO at Ceridian, a global HCM software company, has been preparing for the role of CIO all his life...

• Human resources help desk multiplies HR staff efficiency

Here's how a college used a human resources help desk to better serve its 5,000-plus faculty and staff and enabled HR to focus on...

Close