e-learning

Machine learning: classification and regression

Abstract

Machine learning is a subset of artificial intelligence (AI) that provides machines with the ability to automatically learn from data without being explicitly programmed. It is a combined field of computer science, mathematics and statistics to create a predictive model by learning patterns in a dataset. The dataset may have an output field which makes the learning process supervised. The supervised learning methods in machine learning have outputs (also called as targets or classes or categories) defined in the datasets in a column. These targets can either be integers or real (continuous) numbers. When the targets are integers, the learning task is known as classification. Each row in the dataset is a sample and the classification is assigning a class label/target to each sample. The algorithm which is used for this learning task is called a classifier. When the targets are real numbers, the learning task is called regression and the algorithm which is used for this task is called a regressor. We will go through classification first and look at regression later in this tutorial.

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • what are classification and regression techniques?
  • How they can be used for prediction?
  • How visualizations can be used to analyze predictions?

Learning Objectives

  • Explain the types of supervised machine learning - classification and regression.
  • Learn how to make predictions using the training and test dataset.
  • Visualize the predictions.

Licence: Creative Commons Attribution 4.0 International

Keywords: Statistics and machine learning

Target audience: Students

Resource type: e-learning

Version: 12

Status: Active

Prerequisites:

Introduction to Galaxy Analyses

Learning objectives:

  • Explain the types of supervised machine learning - classification and regression.
  • Learn how to make predictions using the training and test dataset.
  • Visualize the predictions.

Date modified: 2024-03-05

Date published: 2019-03-07

Authors: Anup Kumar, Bérénice Batut

Scientific topics: Statistics and probability


Activity log