- SQL Basics
- SQL Introduction
- SQL Syntax
- SQL Sample Database
- SQL SELECT
- SQL WHERE
- SQL ORDER BY
- SQL DISTINCT
- SQL LIMIT
- SQL FETCH
- SQL AND
- SQL OR
- SQL BETWEEN
- SQL IN
- SQL LIKE
- SQL IS NULL
- SQL Comparison Operators
- SQL Logical Operators
- SQL Alias
- SQL CASE
- Joins and Subqueries
- SQL INNER JOIN
- SQL LEFT JOIN
- SQL RIGHT JOIN
- SQL FULL OUTER JOIN
- SQL SELF JOIN
- SQL CROSS JOIN
- SQL Subquery
- SQL Correlated Subquery
- SQL UNION
- SQL INTERSECT
- SQL EXCEPT
- Aggregate Functions
- SQL AVG
- SQL COUNT
- SQL MAX
- SQL MIN
- SQL SUM
- SQL GROUP BY
- SQL HAVING
- SQL ROLLUP
- SQL CUBE
- SQL GROUPING SETS
- Database Management
- SQL CREATE DATABASE
- SQL ALTER DATABASE
- SQL DROP DATABASE
- SQL BACKUP DATABASE
- SQL SHOW DATABASES
- SQL SELECT DATABASE
- Table Management
- SQL CREATE TABLE
- SQL ALTER TABLE
- SQL ADD COLUMN
- SQL DROP COLUMN
- SQL DROP TABLE
- SQL TRUNCATE TABLE
- SQL SHOW TABLES
- SQL RENAME TABLE
- SQL Constraints
- SQL Primary Key
- SQL Foreign Key
- SQL UNIQUE Constraint
- SQL CHECK Constraint
- SQL NOT NULL Constraint
SQL GROUP BY
The GROUP BY statement in SQL is used to arrange identical data into groups. It is often used with aggregate functions such as COUNT()
, SUM()
, AVG()
, MAX()
, and MIN()
to perform calculations on each group of data.
Syntax of SQL GROUP BY
SELECT column_name, aggregate_function(column_name)
FROM table_name
WHERE condition
GROUP BY column_name;
- column_name: The column(s) by which the data will be grouped.
- aggregate_function(): An aggregate function applied to each group.
- table_name: The table containing the data.
- condition: Optional filter to apply before grouping.
Key Points
- The GROUP BY clause follows the
WHERE
clause but precedes theORDER BY
clause. - Each column in the
SELECT
statement must be included in either the GROUP BY clause or an aggregate function. - It groups rows that have the same values in specified columns.
Example of SQL GROUP BY
Consider the following table:
Table: Orders
To calculate the total quantity ordered by each customer:
SELECT customer_name, SUM(quantity) AS total_quantity
FROM orders
GROUP BY customer_name;
Output:
customer_name | total_quantity |
---|---|
Alice | 8 |
Bob | 6 |
Charlie | 7 |
Explanation:
- The GROUP BY customer_name groups rows based on the
customer_name
column, and the SUM(quantity) calculates the total quantity for each customer.
SQL GROUP BY with Multiple Columns
You can group data by multiple columns to create more specific groups.
Example: Total quantity ordered by each customer for each product:
SELECT customer_name, product_name, SUM(quantity) AS total_quantity
FROM orders
GROUP BY customer_name, product_name;
Output:
customer_name | product_name | total_quantity |
---|---|---|
Alice | Apple | 3 |
Alice | Banana | 5 |
Bob | Apple | 2 |
Bob | Cherry | 4 |
Charlie | Banana | 7 |
Explanation:
- The GROUP BY customer_name, product_name creates groups for each combination of
customer_name
andproduct_name
.
SQL GROUP BY with HAVING Clause
The HAVING clause is used to filter grouped data, similar to how the WHERE
clause filters rows.
Example: Show customers who ordered more than 5 items in total:
SELECT customer_name, SUM(quantity) AS total_quantity
FROM orders
GROUP BY customer_name
HAVING SUM(quantity) > 5;
Output:
customer_name | total_quantity |
---|---|
Alice | 8 |
Charlie | 7 |
Explanation:
- The HAVING SUM(quantity) > 5 filters out groups where the total quantity is 5 or less.
SQL GROUP BY with Aggregate Functions
GROUP BY is commonly used with aggregate functions to summarize data.
Example: Total quantity and average quantity ordered for each product:
SELECT product_name, SUM(quantity) AS total_quantity, AVG(quantity) AS avg_quantity
FROM orders
GROUP BY product_name;
Output:
product_name | total_quantity | avg_quantity |
---|---|---|
Apple | 5 | 2.5 |
Banana | 12 | 6.0 |
Cherry | 4 | 4.0 |
Summary
The GROUP BY clause is a powerful feature in SQL for organizing data into meaningful groups. It allows you to perform aggregate calculations for each group, and combined with the HAVING clause, it enables filtering based on group conditions. Use it to derive insights and summaries from large datasets.