Friday, June 3, 2022

AI/ML: Dataframe correlation analysis using heatmap

import pandas as pd

import numpy as np

import seaborn as sns

import matplotlib.pyplot as plt

df = pd.read_csv('train.csv')

# print(df.columns)

# df.corr()

df = df[['OverallQual', 'TotalBsmtSF', 'GarageArea', 'GarageCars','SalePrice']]

# print(df['GarageCars'].value_counts())

print(df.dtypes)

sns.heatmap(df.corr());


this is a very small example. the dataset is from Kaggle https://www.kaggle.com/c/house-prices-advanced-regression-techniques/data




No comments:

Post a Comment