edaflow.ml.configure_model_pipeline
- edaflow.ml.configure_model_pipeline(data_config: Dict[str, Any], numerical_strategy: str = 'standard', categorical_strategy: str = 'onehot', handle_missing: str = 'drop', verbose: bool = True) Pipeline[source]
Configure a preprocessing pipeline for the ML experiment.
Parameters:
- data_configDict[str, Any]
Configuration dictionary from setup_ml_experiment
- numerical_strategystr, default=’standard’
Scaling strategy for numerical features (‘standard’, ‘minmax’, ‘robust’, ‘none’)
- categorical_strategystr, default=’onehot’
Encoding strategy for categorical features (‘onehot’, ‘target’, ‘none’)
- handle_missingstr, default=’drop’
Missing value strategy (‘drop’, ‘impute’, ‘flag’)
- verbosebool, default=True
Whether to print pipeline configuration details
Returns:
- Pipeline
Configured sklearn Pipeline for preprocessing