Skip to main content

What is Machine Learning?

At its simplest, Machine Learning (ML) is the field of study that gives computers the ability to learn without being explicitly programmed. Instead of a human writing a thousand "if-then" statements, we provide an algorithm with data, and the algorithm "finds" the patterns itself.

1. The Paradigm Shift​

To understand ML, we must compare it to Traditional Programming.

Traditional Programming​

In traditional software engineering, a human provides the Rules (code) and the Data. The computer follows the rules to produce an Output.

Machine Learning​

In ML, we provide the Data and the Output (labels). The computer analyzes these to produce the Rules (the Model).

2. The Three Main Types of Learning​

Machine Learning is generally divided into three main categories based on how the agent "learns."

A. Supervised Learning​

The model is trained on labeled data. You give it inputs and the correct answers. It’s like a student learning with a teacher who corrects their homework.

  • Regression: Predicting a continuous number (e.g., Home prices).
  • Classification: Predicting a category (e.g., Is this email Spam or Not Spam?).

B. Unsupervised Learning​

The model is given unlabeled data and must find hidden structures or patterns on its own. There is no "teacher."

  • Clustering: Grouping customers by similar buying habits.
  • Association: Finding that people who buy bread also tend to buy butter.

C. Reinforcement Learning (RL)​

The model (agent) learns by interacting with an environment. It receives rewards for good actions and penalties for bad ones. It’s how AI learns to play chess or drive autonomous cars.

3. The Core Ingredients of ML​

Every Machine Learning problem requires three components:

  1. The Dataset: High-quality, representative data.
  2. The Features: The specific attributes or variables the model looks at (e.g., mileage, year, and brand for a car).
  3. The Algorithm: The mathematical process used to find patterns (e.g., Linear Regression, Neural Networks).

4. The Lifecycle of an ML Project​

Building a model isn't just writing code; it's a circular process:

  1. Define the Goal: What are we trying to predict?
  2. Data Collection: Gathering raw information.
  3. Data Preprocessing: Cleaning and scaling (what you learned in the Data Engineering module).
  4. Model Training: Feeding data to the algorithm.
  5. Evaluation: Testing the model on data it hasn't seen before.
  6. Deployment: Putting the model into a real-world app.

5. When NOT to use Machine Learning​

ML is powerful, but it isn't always the right tool. Avoid ML if:

  • You have very little data.
  • The problem can be solved with simple, static logic.
  • You need 100% mathematical certainty (ML is probabilistic, not deterministic).

References for More Details​


Now that you understand the "Big Picture," let's look at the most fundamental math behind almost every predictive model.