Learning from Data¶

Learning from data is a practice of extracting compressed knowledge about the world from data. There are many frameworks of learning. For example, the induction, deduction, and transduction schema shows different possible paths to produce predictions.

graph LR
    P[Prediction]
    D[Data]
    M[Model]
    D --"Induction"--> M
    M --"Deduction"--> P
    D --"Transduction"--> P

There are two different approaches to making predictions based on some given data.

Perform induction to find a good model from the data, this is called induction. Once we have a model, we can use it to make predictions, this is called deduction.
Directly make predictions from the data, this is called transduction.

The Nature of Statistical Learning Theory

Vapnik's seminal book The Nature of Statistical Learning Theory is a very good read for the fundamentals of learning theories¹.

Vapnik also discussed some of the key ideas in a book chapter Estimation of Dependences Based on Empirical Data².

In the context of machine learning, Abu-Mostafa, Magdon-Ismail, and Lin summarized the machine learning problem using the following chart ³. Ultimately, we need to find an approximation \(g\) of the true map \(f\) from features \(\mathcal X\) to targets \(\mathcal Y\) on a specific probability distribution of features \(P\). This process is done by using an algorithm to select a hypothesis that works.

flowchart LR
    X[Data Samples]
    A[Algorithm]
    H[Hypotheses Set]
    SH[Selected Hypothesis]

    X --> A
    H --> A
    A --> SH

Based on this framework, a machine learning process usually consists of three core components⁴.

Representation: Encoded data and the problem representation.
Evaluation: An objective function to be evaluated that guides the model.
Optimization: An algorithm to optimize the model so it learns what we want it to do.

We will reuse this framework again and again in the following sections of this chapter.

Vapnik V. The nature of statistical learning theory. Springer: New York, NY, 2010 doi:10.1007/978-1-4757-3264-1. ↩
Vapnik V. Estimation of dependences based on empirical data. 1st ed. Springer: New York, NY, 2006 doi:10.1007/0-387-34239-7. ↩
Abu-Mostafa YS, Magdon-Ismail M, Lin H-T. Learning from data: A short course. AMLBook, 2012https://www.semanticscholar.org/paper/Learning-From-Data-Abu-Mostafa-Magdon-Ismail/1c0ed9ed3201ef381cc392fc3ca91cae6ecfc698. ↩
Domingos P. A few useful things to know about machine learning. Communications of the ACM 2012; 55: 78–87. ↩

Contributors: LM