PostgreSQL HAVING Clause



PostgreSQL HAVING Clause

The PostgreSQL HAVING clause is used to filter groups of rows created by the GROUP BY clause based on a specified condition. This clause is essential for performing aggregate filtering on grouped data.


Syntax

SELECT column1, aggregate_function(column2)
FROM table_name
GROUP BY column1
HAVING condition;

The HAVING clause has the following components:

  • column1, column2, ...: The columns to retrieve from the table.
  • aggregate_function(column2): The aggregate function to apply to each group.
  • table_name: The name of the table from which to retrieve data.
  • condition: The condition to filter groups.

Example PostgreSQL HAVING Clause Queries

Let's look at some examples of PostgreSQL HAVING clause queries:

1. Basic HAVING Clause Example

SELECT department_id, COUNT(*) AS employee_count
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 10;

This query retrieves the department_id and the count of employees in each department from the employees table. It groups the results by department_id and filters to include only departments with more than 10 employees.

2. HAVING with Multiple Conditions

SELECT department_id, AVG(salary) AS average_salary
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 50000 AND COUNT(*) > 5;

This query retrieves the department_id and the average salary of employees in each department from the employees table. It groups the results by department_id and filters to include only departments with an average salary greater than 50000 and more than 5 employees.

3. HAVING with Aggregate Function

SELECT department_id, SUM(salary) AS total_salary
FROM employees
GROUP BY department_id
HAVING SUM(salary) > 200000;

This query retrieves the department_id and the total salary of employees in each department from the employees table. It groups the results by department_id and filters to include only departments with a total salary greater than 200000.

4. HAVING with ORDER BY Clause

SELECT department_id, COUNT(*) AS employee_count
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 10
ORDER BY employee_count DESC;

This query retrieves the department_id and the count of employees in each department from the employees table. It groups the results by department_id, filters to include only departments with more than 10 employees, and sorts the results by employee_count in descending order.


Full Example

Let's go through a complete example that includes creating a table, inserting data, and using the HAVING clause to filter grouped data.

Step 1: Creating a Table

This step involves creating a new table named employees to store employee data.

CREATE TABLE employees (
    id SERIAL PRIMARY KEY,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    email VARCHAR(100),
    salary NUMERIC(10, 2),
    department_id INT
);

In this example, we create a table named employees with columns for id, first_name, last_name, email, salary, and department_id.

PostgreSQL HAVING

Step 2: Inserting Data into the Table

This step involves inserting some sample data into the employees table.

INSERT INTO employees (first_name, last_name, email, salary, department_id)
VALUES ('John', 'Doe', 'john.doe@example.com', 50000, 1);

INSERT INTO employees (first_name, last_name, email, salary, department_id)
VALUES ('Jane', 'Smith', 'jane.smith@example.com', 60000, 2);

INSERT INTO employees (first_name, last_name, email, salary, department_id)
VALUES ('Jim', 'Brown', 'jim.brown@example.com', 55000, 1);

INSERT INTO employees (first_name, last_name, email, salary, department_id)
VALUES ('Emily', 'Jones', 'emily.jones@example.com', 70000, 2);

Here, we insert data into the employees table.

PostgreSQL HAVING

Step 3: Filtering Grouped Data with HAVING

This step involves using the HAVING clause to filter grouped data in the employees table.

-- Basic HAVING Clause


SELECT department_id, COUNT(*) AS employee_count
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 1;
PostgreSQL HAVING

-- HAVING with Multiple Conditions


SELECT department_id, AVG(salary) AS average_salary
FROM employees
GROUP BY department_id
HAVING AVG(salary) > 50000 AND COUNT(*) > 1;
PostgreSQL HAVING

-- HAVING with Aggregate Function


SELECT department_id, SUM(salary) AS total_salary
FROM employees
GROUP BY department_id
HAVING SUM(salary) > 100000;
PostgreSQL HAVING

-- HAVING with ORDER BY Clause


SELECT department_id, COUNT(*) AS employee_count
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 1
ORDER BY employee_count DESC;
PostgreSQL HAVING

These queries demonstrate how to use the HAVING clause to filter grouped data in the employees table, including using basic conditions, multiple conditions, aggregate functions, and sorting grouped results with the ORDER BY clause.

Conclusion

The PostgreSQL HAVING clause is a fundamental tool for filtering groups of rows created by the GROUP BY clause based on specified conditions. Understanding how to use the HAVING clause and its syntax is essential for effective data analysis and reporting in PostgreSQL databases.