Date: 12 - 16 October 2020

This course provides an introduction to the use of bioinformatics in biological research, giving participants guidance for using bioinformatics in their work whilst also providing hands-on training in tools and resources appropriate to their research.

Participants will initially be introduced to bioinformatics theory and practice, including best practices for undertaking bioinformatics analysis, data management and reproducibility. To enable specific exploration of resources in their particular field of interest, participants will be divided into focused groups to work on a small project set by EMBL-EBI resource and research staff, ending in a presentation from each group on the final day of the course to bring together learnings from all participants.

The course includes training and mentoring by experts from EMBL-EBI and external institutes.

Group projects

A major element of this course is a group project, where participants will be placed in small groups to work together on a challenge set by trainers from EMBL-EBI. This allows people to explore the bioinformatics tools and resources available in their area of interest and apply them to a set problem, providing participants with hands-on experience relevant to their own research. The group work will culminate in a presentation session involving all participants on the final day of the course, giving an opportunity for wider discussion on the benefits and challenges of working with biological data.

Groups are mentored and supported by the trainers who set the initial challenge, but the groups will be responsible for driving their projects forward, with all members expected to take an active role. Groups are pre-organised before the course, and all group members will be sent some short “homework” in preparation for their project work prior to the start of the course.

Basic outlines of the projects on offer this year are given below. In your application you must indicate your first and second choice of project, based on which you think would benefit your research most. Not all projects may be offered, and final decisions on which projects will be run during the course will be made based on the number of applicants per project.

This year’s projects are as follows:

Networks and pathways

This project will make use of gene expression data (RNA-seq) to build protein-protein interaction networks, which can be used to explore functional relationships between the (potentially) expressed protein products. You will use Cytoscape to visualise protein networks, identify key regulators of biological pathways and explore biological function through network analysis, integration and co-visualisation of additional data, and ontology/functional enrichment analysis - helping to build a better view of the wider biological context.

Modelling cell signalling pathways

Curating models of biological processes is an effective training in computational systems biology, where the curators gain an integrative knowledge of biological systems, modelling and bioinformatics. You will learn to encode computational models of signalling pathways from a recent publication using COPASI and how to reproduce the simulation results. Furthermore, you will learn how to annotate models and re-use pre-existing models from open repositories such as BioModels.

Genome variation across human populations

Natural variation between individuals or between different human populations is a result of genome mutations throughout evolutionary history. Some mutations may become fixed because of their beneficial effect while most drift among individuals. During this project, you will investigate genomic variation between two separate human populations of European and Asian descent. Using sequence data from a number of individuals from each population, you will use a range of bioinformatics tools to discover variants that exist between them. In the second section of the project, you will attempt to analyse the functional consequences of the variants you have identified, linking them to phenotypes.

Metabolic network engineering using a systems model-based approach

Metabolic pathway analysis helps to identify the structure and dynamics of a metabolic network and thereby also allow us to have an insight into cell physiology which is the foundation of metabolic engineering. You will work with a curated model related to a metabolism network chosen from BioModels, and learn how to carry out computational analyses to find common patterns in the networks. These might include computing feasible pathways through the network and minimal reactions to knock out specific metabolic functions, along with visualisation and exploration to gain a further insight into the results.

Finding and interpreting publicly available structural data 

This project will introduce you to the wealth of publicly available structural data and give you the opportunity to investigate how this data can be used to analyse macromolecular structures. You will firstly explore the search and entry pages at PDBe to identify the type of data available for analysis. Then using this knowledge, you will discover how to access this data programmatically and analyse a subset of your results to interpret biological relevance. 

Functional annotation of proteins

Functional annotation refers to discovering the functions of proteins. Participants will  annotate a protein of interest which has not been annotated previously, either manually or by any UniProt automatic system. Participants will use different ways to discover the function of their protein of interest from alignment to machine learning. You will learn the different annotation types in UniProtKB and how to use the UniProt resources for their investigation. You will also learn how to build machine learning models, from dataset construction to prediction.

Keywords: Cross domain (cross-domain), Introduction to bioinformatics, Data analysis

Organizer: European Bioinformatics Institute (EBI)

Capacity: 30

Event types:

  • Workshops and courses

Scientific topics: Systems biology, Biological pathway or network format, Database management, Bioinformatics


Activity log