ASSIGNMENT - 5, G0_STP_8113


  1. Load the IMDb Dataset and read

import numpy as np
import pandas as pd
df = pd.read_csv('IMDB-Movie-Data.csv')
print(df)

2. View the dataset

print(df.head(12))

3. Understand some basic information about the dataset and Inspect the dataframe Inspect the dataframe's columns, shapes, variable types etc.


print(df.columns)
print(df.shape)
print(df.dtypes)

4. Data Selection – Indexing and Slicing data

print(df.iloc[0])
print(df[1:3])

5. Data Selection – Based on Conditional filtering

df['Votes']>8

6. Groupby operations

print(df.groupby(['Rating''Votes']).groups)
print(df.groupby('Votes').groups)
grouped = df.groupby('Year')
print(grouped.get_group(2014))

7. Sorting operation

x = df.sort_values(by='Revenue (Millions)')
print(x)

8. Dealing with missing values

print(df.isnull())


9. Dropping columns and null values

df = df.dropna(axis=1)
print(df)

10. Apply( ) functions

y=df.apply(lambda x: [12], axis=1)
print(y)














Comments

Popular posts from this blog

Data Science Matplotlib Library Data Visualization, ASSIGNMENT - 6, GO_STP_8113