# Importing necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

df = pd.read_csv("StudentsPerformance.csv")

df.head()

df.describe()

for i in df.columns:
    df[i].value_counts().plot(kind='bar')
    plt.title(i)
    plt.show()

df.plot(subplots=True, kind='hist')

plt.tight_layout()
plt.show()

	gender	race/ethnicity	parental level of education	lunch	test preparation course	math score	reading score	writing score
0	female	group B	bachelor's degree	standard	none	72	72	74
1	female	group C	some college	standard	completed	69	90	88
2	female	group B	master's degree	standard	none	90	95	93
3	male	group A	associate's degree	free/reduced	none	47	57	44
4	male	group C	some college	standard	none	76	78	75

Welcome to analysis of students performance¶

Data exploration phase¶