Machine Learning
537 learners
Data Preprocessing for Predictive Modeling
Unveil how preprocessing refines data to make predictive models more effective. Learn to handle missing values, outliers and categorical variables, ensuring data consistency and integrity.
MatPlotLib
Pandas
Python
Scikit-learn
See path
5 lessons
24 practices
3 hours
Badge for Data Cleaning and Preprocessing,
Data Cleaning and Preprocessing
Lessons and practices
Describing California Housing Data
Navigating the Data Cosmos: Correlation Matrix Calculation Challenge
Exploring Room Count Correlation
Plotting the Data Distribution
Navigating the Stars: Creating a Correlation Matrix
Counting Missing Values in the Housing Market Dataset
Cleaning Real Estate Data by Listwise Deletion
Enhancing Data Integrity with Mean Imputation
Utilizing k-NN Imputation to Handle Missing Data
Crafting Indicator Columns for Missing Data Awareness
Outlier Treatment in Housing Data
Expanding the Frontier: Elevating z-score Outlier Detection
Adjusting Outlier Detection Sensitivity in Housing Data
Implementing z-score for Outlier Detection
Detecting Outliers with IQR in Housing Data
Mitigating Outlier Impact with Log Transformation
Exploring the Stars of the Housing Market
Expanding the Feature Selection Horizon
Navigating the Stars of Feature Selection
Unveiling the Most Influential Features
Scaling the Space-Time: Normalizing House Ages
Scaling the Stars: Normalization in the Housing Galaxy
Implementing Min-Max Scaling
Scaling Heights with Min-Max Normalization
Meet Cosmo:
The smartest AI guide in the universe
Our built-in AI guide and tutor, Cosmo, prompts you with challenges that are built just for you and unblocks you when you get stuck.
Sign up
Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal