ASSIGNMENT - 5, G0_STP_8113
- Load the IMDb Dataset and read
import numpy as np
import pandas as pd
df = pd.read_csv('IMDB-Movie-Data.csv')
print(df)
2. View the dataset
print(df.head(12))
3. Understand some basic information about the dataset and Inspect the dataframe Inspect the dataframe's columns, shapes, variable types etc.
print(df.columns)
print(df.shape)
print(df.dtypes)
4. Data Selection – Indexing and Slicing data
print(df.iloc[0])
print(df[1:3])
5. Data Selection – Based on Conditional filtering
df['Votes']>8
6. Groupby operations
print(df.groupby(['Rating', 'Votes']).groups)
print(df.groupby('Votes').groups)
grouped = df.groupby('Year')
print(grouped.get_group(2014))
7. Sorting operation
x = df.sort_values(by='Revenue (Millions)')
print(x)
8. Dealing with missing values
print(df.isnull())
9. Dropping columns and null values
df = df.dropna(axis=1)
print(df)
10. Apply( ) functions
y=df.apply(lambda x: [1, 2], axis=1)
print(y)
Comments
Post a Comment