e-learning

Regression in Machine Learning

Abstract

In this tutorial you will learn how to use Galaxy tools to solve regression problems. First, we will introduce the concept of regression briefly, and then examine linear regression, which models the relationship between a target variable and some explanatory variables (also known as independent variables). Next, we will discuss gradient boosting regression, an more advanced regressor model which can model nonlinear relationships between variables. Then, we will show how to visualize the results in each step. Finally, we will discuss how to train our models by finding the values of their parameters that minimize a cost function. We will work through a real problem to learn how the models and learning algorithms work.

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • How to use regression techniques to create predictive models from biological datasets?

Learning Objectives

  • Learn regression background
  • Apply regression based machine learning algorithms
  • Learn ageing biomarkers and predict age from DNA methylation datasets
  • Learn how visualizations can be used to analyze predictions

Licence: Creative Commons Attribution 4.0 International

Keywords: Statistics and machine learning

Target audience: Students

Resource type: e-learning

Version: 17

Status: Active

Prerequisites:

Introduction to Galaxy Analyses

Learning objectives:

  • Learn regression background
  • Apply regression based machine learning algorithms
  • Learn ageing biomarkers and predict age from DNA methylation datasets
  • Learn how visualizations can be used to analyze predictions

Date modified: 2024-10-11

Date published: 2020-01-25

Authors: Alireza Khanteymoori, Anup Kumar, Simon Bray

Contributors: Alireza Khanteymoori, Björn Grüning, Helena Rasche, Saskia Hiltemann

Scientific topics: Statistics and probability


Activity log