Pandas DataFrame.items
Pandas DataFrame.items
The DataFrame.items method in pandas is used to iterate over a DataFrame's columns as (column_name, column_data) pairs. This method is particularly useful for iterating through and processing each column individually.
Syntax
The syntax for DataFrame.items is:
DataFrame.items()Here, DataFrame refers to the pandas DataFrame whose columns are being iterated over.
Returns
Generates pairs of:
column_name: The name of the column (string).column_data: The column data as a pandas Series.
Examples
Iterating Over Columns
Use items to iterate over all columns in a DataFrame and access their names and data.
Python Program
import pandas as pd
# Create a DataFrame
data = {
'Name': ['Arjun', 'Ram', 'Priya'],
'Age': [25, 30, 35],
'Salary': [70000.5, 80000.0, 90000.0]
}
df = pd.DataFrame(data)
# Iterate over columns using items()
print("Iterating over columns:")
for column_name, column_data in df.items():
print(f"Column Name: {column_name}")
print(f"Column Data:\n{column_data}\n")Output
Iterating over columns:
Column Name: Name
Column Data:
0 Arjun
1 Ram
2 Priya
Name: Name, dtype: object
Column Name: Age
Column Data:
0 25
1 30
2 35
Name: Age, dtype: int64
Column Name: Salary
Column Data:
0 70000.5
1 80000.0
2 90000.0
Name: Salary, dtype: float64Processing Each Column
Perform specific operations on each column while iterating using items.
Python Program
import pandas as pd
# Create a DataFrame
data = {
'Name': ['Arjun', 'Ram', 'Priya'],
'Age': [25, 30, 35],
'Salary': [70000.5, 80000.0, 90000.0]
}
df = pd.DataFrame(data)
# Calculate the mean for numeric columns
print("Calculating mean for numeric columns:")
for column_name, column_data in df.items():
if column_data.dtype in ['int64', 'float64']:
print(f"Mean of {column_name}: {column_data.mean()}")Output
Calculating mean for numeric columns:
Mean of Age: 30.0
Mean of Salary: 80000.16666666667Appending Data to a Dictionary
Convert each column to a dictionary using items.
Python Program
import pandas as pd
# Create a DataFrame
data = {
'Name': ['Arjun', 'Ram', 'Priya'],
'Age': [25, 30, 35],
'Salary': [70000.5, 80000.0, 90000.0]
}
df = pd.DataFrame(data)
# Convert columns to a dictionary
columns_dict = {}
for column_name, column_data in df.items():
columns_dict[column_name] = column_data.tolist()
print("Dictionary of columns:")
print(columns_dict)Output
Dictionary of columns:
{'Name': ['Arjun', 'Ram', 'Priya'], 'Age': [25, 30, 35], 'Salary': [70000.5, 80000.0, 90000.0]}Counting Null Values in Each Column
Use items to count the number of null values in each column.
Python Program
import pandas as pd
# Create a DataFrame with missing values
data = {
'Name': ['Arjun', 'Ram', None],
'Age': [25, 30, None],
'Salary': [70000.5, None, 90000.0]
}
df = pd.DataFrame(data)
# Count null values in each column
print("Counting null values in each column:")
for column_name, column_data in df.items():
print(f"{column_name}: {column_data.isnull().sum()} null values")Output
Counting null values in each column:
Name: 1 null values
Age: 1 null values
Salary: 1 null valuesSummary
In this tutorial, we explored the DataFrame.items method in pandas. Key takeaways include:
- Using
itemsto iterate over columns as (column_name, column_data) pairs. - Performing operations like calculations, data conversion, and null value counting for each column.
The DataFrame.items method is a powerful tool for iterating through and processing columns in a pandas DataFrame.