Q
Problem solve Get help with specific problems with your technologies, process and projects.

# Weighted percentiles, part 1

From this table:

```SELECT    item_id
,    units
,    price
FROM      items
ORDER BY  price
;

ITEM_ID      UNITS      PRICE
--------- ---------- ----------
1          4       1.99
3          6       2.29
4         11       2.49
2          1       2.99
...
```

...and so on, I can find the average price, min, max, etc. What I need to know is the fifth percentile and the 95th percentile of the price data, weighted by units. I have noticed that there are functions like percentile_disc. But my problem is that I have to weight the price by the units sold.

Do you have any suggestions in Oracle SQL?

Let's assume your data consists of just the four rows shown above. To restate the problem, you want to find the price such that 5% of the units sold had a lower (or equal) price, and the price such that 95% of the units sold had a lower (or equal) price. Since the total number of units is 22, you want the prices of the "1.1st" unit (.05 * 22 = 1.1) and the "20.9th" unit.

The analytic function percentile_disc was introduced in Oracle 9 to answer questions just like yours on unweighted data. Unfortunately, like you, I don't see how it can be used on weighted data. There is, however, another analytic function, sum, that can help by telling you how many units sold at a lower (or equal) price for each row of your table. With that number, and the grand total of the units sold, you can compute the percentiles.

```SELECT    item_id            -- Query 1
,    units
,    price
,    SUM (units)
OVER  (ORDER BY price
RANGE UNBOUNDED PRECEDING)
AS running_total
,    (
SELECT  SUM (units)
FROM    items
)       AS grand_total
FROM      items
ORDER BY  price
;

ITEM_ID      UNITS      PRICE RUNNING_TOTAL GRAND_TOTAL
---------- ---------- ---------- ------------- -----------
1          4       1.99             4          22
3          6       2.29            10          22
4         11       2.49            21          22
2          1       2.99            22          22
```

You can query the result set above to get the information you need, one item at a time, as shown below. Notice that the in-line view (the subquery) is (almost) exactly the same as Query 1 shown above.

```SELECT  price   AS price_05          -- Query 2
FROM    (   --  begin Sub-query 2a, same as Query 1
SELECT    item_id
,    units
,    price
,    SUM (units)
OVER  (ORDER BY price
RANGE UNBOUNDED PRECEDING)
AS running_total
,    (
SELECT  SUM (units)
FROM    items
)       AS grand_total
FROM      items
)   --  end sub-query 2a
WHERE   (.05 * grand_total)  BETWEEN  (running_total - units)
AND  units
;

PRICE_05
----------
1.99
```

## SearchDataManagement

• ### Oracle brings GoldenGate data integration service to cloud

Oracle is making its GoldenGate real time data technology available on its second-generation Oracle Cloud Infrastructure platform...

• ### Era Software raises \$15.25M for enterprise data management

The startup that began as the EraDB time series database is advancing its efforts with new funding and a cloud service for its ...

• ### Dgraph GraphQL database users detail graph use cases

Graph DB vendor Dgraph Labs is expanding its AWS cloud footprint with new regions and adding change data capture capabilities in ...

• ### TigerGraph unveils support for GCP, adds new connectors

Graph database vendor TigerGraph unveiled support for Google Cloud and new connectors to Snowflake and Tableau on April 21 during...

• ### Startup Veezoo emerges from stealth with NLQ-based platform

Aiming to be 'Siri for enterprises,' an analytics startup emerged from stealth with a platform that enables users to interact ...

• ### 15 data science tools to consider using in 2021

Numerous tools are available for data science applications. Read about 15, including their features, capabilities and uses, to ...

## SearchSAP

• ### S/4HANA Cloud SaaS ERP: Buying team overview

SAP's multi-tenant SaaS ERP, S/4HANA Cloud, is a viable choice for companies that need ease in their infrastructure management. ...

• ### SAP forms financial services partnership with Dediq

SAP and financial industry investment firm Dediq are forming a new business unit to develop applications that help banks and ...

• ### Unpatched applications threaten SAP security

Cyberattacks are a significant threat to unpatched, unprotected SAP applications, according to a new threat intelligence report ...

## SearchSQLServer

• ### SQL Server database design best practices and tips for DBAs

Good database design is a must to meet processing needs in SQL Server systems. In a webinar, consultant Koen Verbeeck offered ...

• ### SQL Server in Azure database choices and what they offer users

SQL Server databases can be moved to the Azure cloud in several different ways. Here's what you'll get from each of the options ...

• ### Using a LEFT OUTER JOIN vs. RIGHT OUTER JOIN in SQL

In this book excerpt, you'll learn LEFT OUTER JOIN vs. RIGHT OUTER JOIN techniques and find various examples for creating SQL ...

## TheServerSide.com

• ### Incorporate diversity and inclusion in technology design

DEI in technology is about more than creating a diverse workplace. We talked to a few DEI professionals about how teams build ...

• ### Microsoft previews OpenJDK distro to the delight of devs

In a move meant to attract more Java developers to its Azure cloud and further support the Java community, Microsoft launched a ...

• ### Supreme Court ruling on Java APIs eases developer worries

Now that the Supreme Court has ruled for Google over Oracle in their high-stakes copyright battle over Java APIs, developers can ...

## SearchDataCenter

• ### Nvidia SDK simulates quantum computing circuits on GPU systems

Nvidia edged its way into the quantum computing market with an SDK that simulates quantum circuits by adding horsepower to ...

• ### Programmable processor technology for next-gen data centers

The right processing technology can benefit your data center. Learn about advancements in CPU technologies, recent vendor ...

• ### Data processing units accelerate infrastructure performance

DPUs often run on networking packets to move information in the data center, instead of supporting processing workflows. Get an ...

## SearchContentManagement

• ### Hyland gets digital asset management tech with Nuxeo buy

By acquiring a smaller competitor's digital asset management platform, Hyland looks to build on its 2020 purchase of Alfresco, ...

• ### OpenText releases Cloud Editions content services updates

OpenText CE 21.2 includes federated document compliance that extends to Microsoft Office 365, along with a revamped content ...

• ### Adobe courts small businesses with Acrobat Pro package

As the pandemic disrupts paper workflows, Adobe courts small business users with simple webforms, digital signatures and payments...

## SearchHRSoftware

• ### Shift to HR shared services could save Connecticut millions

By consolidating 17 separate HR operations into one shared service, Connecticut expects to save significantly on costs, most of ...

• ### 10 steps to support your HCM system post go-live

To ensure a new system's success, HR leaders need to develop a plan for how they will support their new HCM system post go-live. ...

• ### Face mask detection a newcomer to employee surveillance

Face mask detection has emerged as another form of employee surveillance technology, and its adoption may be helped indirectly by...

Close