edaflow.impute_numerical_median

edaflow.impute_numerical_median(df, columns=None, inplace=False)[source]

Impute missing values in numerical columns using median values with rich formatting.

This function identifies numerical columns and fills missing values (NaN) with the median value of each column. It provides detailed reporting of the imputation process and handles edge cases safely.

Parameters:
  • df (pandas.DataFrame) – The DataFrame containing data to impute

  • columns (list, optional) – Specific columns to impute. If None, all numerical columns will be processed

  • inplace (bool, default False) – If True, modify the original DataFrame. If False, return a new DataFrame

Returns:

If inplace=False, returns the DataFrame with imputed values If inplace=True, returns None and modifies the original DataFrame

Return type:

pandas.DataFrame or None

Examples

>>> import pandas as pd
>>> import edaflow
>>>
>>> # Create sample data with missing values
>>> df = pd.DataFrame({
...     'age': [25, None, 35, None, 45],
...     'salary': [50000, 60000, None, 70000, None],
...     'name': ['Alice', 'Bob', 'Charlie', 'Diana', 'Eve']
... })
>>>
>>> # Impute all numerical columns
>>> df_imputed = edaflow.impute_numerical_median(df)
>>>
>>> # Impute specific columns only
>>> df_imputed = edaflow.impute_numerical_median(df, columns=['age'])
>>>
>>> # Impute in place
>>> edaflow.impute_numerical_median(df, inplace=True)