Pandas DataFrame.cummax: Cumulative Maximum in a DataFrame

Pandas DataFrame.cummax

The DataFrame.cummax method in pandas is used to compute the cumulative maximum of DataFrame values along a specified axis. This method is useful for analyzing trends where the maximum value needs to be tracked incrementally.

Syntax

The syntax for DataFrame.cummax is:

DataFrame.cummax(axis=None, skipna=True, *args, **kwargs)

Here, DataFrame refers to the pandas DataFrame on which the cumulative maximum operation is performed.

Parameters

Parameter	Description
`axis`	The axis along which the cumulative maximum is computed. Use `0` or `'index'` for columns, and `1` or `'columns'` for rows. Defaults to `None`.
`skipna`	Excludes `NaN` values when computing the cumulative maximum. If `False`, `NaN` values propagate. Defaults to `True`.
`args, *kwargs`	Additional arguments and keyword arguments to pass to the method.

Returns

A DataFrame with the cumulative maximum values computed along the specified axis.

Examples

Computing Cumulative Maximum Along DataFrame Columns

This example demonstrates how to compute the cumulative maximum along columns in a DataFrame.

Python Program

import pandas as pd

# Create a DataFrame
df = pd.DataFrame({
    'A': [1, 3, 2, 5],
    'B': [7, 2, 5, 1]
})

# Compute cumulative maximum along columns
result = df.cummax(axis=0)
print(result)

Output

Computing Cumulative Maximum Along DataFrame Rows

This example shows how to compute the cumulative maximum along rows in a DataFrame.

Python Program

import pandas as pd

# Create a DataFrame
df = pd.DataFrame({
    'A': [1, 3, 2],
    'B': [7, 2, 5],
    'C': [4, 6, 8]
})

# Compute cumulative maximum along rows
result = df.cummax(axis=1)
print(result)

Output

Handling Missing Values in a DataFrame

This example demonstrates how DataFrame.cummax handles missing values (NaN) when computing the cumulative maximum.

Python Program

import pandas as pd
import numpy as np

# Create a DataFrame with missing values
df = pd.DataFrame({
    'A': [1, np.nan, 3, 5],
    'B': [7, 2, np.nan, 1]
})

# Compute cumulative maximum while skipping NaN values
result = df.cummax()
print(result)

Output

     A    B
0  1.0  7.0
1  1.0  7.0
2  3.0  7.0
3  5.0  7.0

Summary

In this tutorial, we explored the DataFrame.cummax method in pandas. Key takeaways include:

Using cummax to compute cumulative maximum values along columns or rows in a DataFrame.
Handling missing values during cumulative maximum computation.
Applying the axis parameter to control the direction of computation.