Data Science and Machine Learning Glossary

  • Machine Learning Algorithm Regression

    Dummy Variable Trap

  • Probability Distributions

    Benford's Law

    Benford’s Law, also called the first digit law, states that the leading digits of numbers in datasets that span large orders of magnitude are distributed in a non-uniform way.

  • Programming

    Binary Variable

    A binary variable is a categorical variable that can only take one of two values, usually represented as a Boolean — True or False — or an integer variable — 0 or 1

  • Machine Learning Algorithm Decision Trees

    Gini Impurity

    A measurement used to build Decision Trees to determine how the features of a dataset should split nodes to form the tree.

  • Linear Algebra


    An Eigenspace is a basic concept in linear algebra, and is commonly found in data science and in engineering and science in general.

    Get updates in your inbox

    Join over 7,500 data science learners.