Numerous analytical functions in SQL enable us to carry out intricate computations and data analysis jobs. SQL window functions called LEAD and LAG let us access data from other rows that are part of the same resultant row. They are frequently utilized in conjunction with the OVER clause, which specifies how the result set is divided and arranged. The value for the LAG function comes from a row that comes before the current one, whereas the value for the LEAD function comes from a row that comes after the current one.
The offset parameter, which indicates how many rows to advance or retract from the current row, is accepted by both functions. Let's examine the syntax, usage cases, and examples of LEAD and LAG to better understand how they work.
LEAD and LAG Functions Syntax
--LAEAD Function Syntax:
LEAD(column_name, offset, default_value) OVER (
PARTITION BY partition_expression
ORDER BY order_expression
)
--LAG Function Syntax:
LAG(column_name, offset, default_value) OVER (
PARTITION BY partition_expression
ORDER BY order_expression
)
- column_name: Name of the column from which you want to retrieve the value.
- offset: Number of rows to move forward (LEAD Function) or backward (LAG Function) from the current row. The default value is 1 and should be positive.
- default_value (optional): Default value to return if the lead or lag value is NULL or if there is no subsequent row within the partition.
- PARTITION BY (optional): Divides the result set into partitions based on the specified expression.
- ORDER BY: Specifies the ordering of rows within each partition.
Lets take an example for LEAD and LAG functions.create a table named sales with the following structure:
-- Create salas table
CREATE TABLE sales (
product VARCHAR(50),
year INT,
sales_amount DECIMAL(10,2)
);
-- Add some dummy data into the table
INSERT INTO sales (product, year, sales_amount) VALUES
('Apples', 2021, 1500.50),
('Bananas', 2021, 2500.75),
('Carrots', 2021, 3200.00),
('Apples', 2022, 1700.30),
('Bananas', 2022, 2900.20),
('Carrots', 2022, 3400.60),
('Apples', 2023, 1800.00),
('Bananas', 2023, 3100.45),
('Carrots', 2023, 3600.80),
('Oranges', 2021, 1100.25),
('Oranges', 2022, 1300.50),
('Oranges', 2023, 1400.75),
('Tomatoes', 2021, 1200.00),
('Tomatoes', 2022, 1500.35),
('Tomatoes', 2023, 1600.90);
Retrieving the next product's sales amount using LEAD function.
In above Query , we use the LEAD function to retrieve the sales amount of the next product in the same year. If there is no subsequent product, the default_value of 0 is returned.
Retrieving the prev product's sales amount using LAG function
In above Query , we use the LEAD function to retrieve the sales amount of the next product in the same year. If there is no subsequent product, the default_value of 0 is returned.
Use Case
Suppose we want to analyze the sales data for each product, not only by comparing the current year's sales with the previous year but also by looking forward to the next year's sales. We can achieve this by combining the LEAD and LAG functions in a single query.
SELECT product, year, sales_amount,
sales_amount - LAG(sales_amount, 1) OVER (
PARTITION BY product
ORDER BY year
) AS year_over_year_diff,
LEAD(sales_amount, 1, 0) OVER (PARTITION BY product
ORDER BY year) - sales_amount AS next_year_diff
FROM sales ORDER BY product, year;
-- Sample O/P
+----------+------+---------------+------------------+----------------+
| product | year | sales_amount | year_over_year_diff | next_year_diff |
+----------+------+---------------+------------------+----------------+
| Apples | 2021 | 1500.5 | NULL | 199.8 |
| Apples | 2022 | 1700.3 | 199.8 | 99.7 |
| Apples | 2023 | 1800.0| 99.7 | -1800.0 |
| Bananas | 2021 | 2500.75| NULL | 399.45|
| Bananas | 2022 | 2900.2 | 399.45| 200.25|
| Bananas | 2023 | 3100.45| 200.25| -3100.45|
| Carrots | 2021 | 3200.0| NULL | 200.6 |
| ..... ... .... ..... ....... ...
+----------+------+---------------+------------------+----------------+
We use the LAG Functions to calculate the sales difference between the current and prior years by deducting the former year's sales figure from the latter. We are computing the difference between the sales amount for the current year and the sales amount for the upcoming year in the LEAD Functions. We may evaluate the sales data for each product by comparing the year-over-year difference and the anticipated difference for the following year by putting both expressions in the same query.
HostForLIFEASP.NET SQL Server 2022 Hosting
