Menu

Understanding Features and Labels

Understanding Features and Labels

In supervised learning, data is divided into two components:

Features (Independent Variables)

Features are the input variables used by the model to make predictions.

Examples from the California Housing Dataset include:

  • MedInc
  • HouseAge
  • AveRooms
  • AveBedrms
  • Population
  • AveOccup

These variables describe different aspects of each property and its environment.

Labels (Target Variable)

Labels are the values that the model is trying to predict.

For this project:

PRICE

represents the median value of owner-occupied homes.

The machine learning model learns the relationship between the features and the target variable during training.