5a

Load The Dataset And Build The Required Date Features

Notebook section: Load, audit, and feature-engineering cells

Requirement: Load the dataset, audit nulls, convert Date, and derive day, month, weekday, and weekend features.

The notebook loads the staged CSV with Latin-1 handling, audits missing values, and derives the calendar fields required by the copied PDF before modeling begins.

Results Capture

Dataset shape is [8760,18].
Derived fields include day, month, day_of_week, and is_weekend.
The null-value audit is recorded before model training.

df = pd.read_csv(DATASET_PATH, encoding='latin1')
df['Date'] = pd.to_datetime(df['Date'], dayfirst=True)
df['day'] = df['Date'].dt.day
df['month'] = df['Date'].dt.month
df['day_of_week'] = df['Date'].dt.day_name()
df['is_weekend'] = df['Date'].dt.dayofweek >= 5