Programmatic access to UniProt using Python
Date: 29 September 2022
UniProt is a comprehensive, expert-led, publicly available database of protein sequence, function and variation information.
This webinar will give an overview of programmatic access to the UniProt database using Python and cover key aspects of protein entry searches, data filtering, batch downloads and give examples of further processing of downloaded target data.
Following a brief introduction to UniProt services, where to find relevant documentation and help features, the webinar will focus on worked examples. These will include how to programmatically search and retrieve protein entries and sequences, within the results we will then show how to align orthologous sequences and filter for features of interest.
The webinar will also cover programmatic examples of the UniProt Retrieve/ID mapping service, batch downloads, processing, filtering data by annotation type, and retrieval of recently published proteomics-derived post-translational modification data.
Keywords: UniProt: The Universal Protein Resource, Proteins (proteins), MetaboLights: Metabolomics repository and reference database, Chemical Entities of Biological Interest, Cross domain (cross-domain), ChEBI, Metabolites, Molecular building blocks of life, Human Cell Atlas Data Coordination Platform, Single-cell transcriptomics, HCA data portal, Programmatic access, API, Python, Complex Portal, macromolecular assembly, InterPro, Boolean modelling, Europe PubMed Central, Literature (literature), Open access, Protein Data Bank in Europe - Knowledge Base, 3D structure, AlphaFold Database, DeepMind, Artificial intelligence, AI, Structure prediction, cancer, Boolean, Ensembl Genomes, DNA & RNA (dna-rna), European Nucleotide Archive, Data archive, Raw sequencing data, RNAcentral, Non-coding RNA, ncRNA, GPU, Data protection, Job dispatcher, Bioimage analysis resource, Accessibility, Missense variation, Biostatistics, Rfam, non-coding RNA, Infernal software, Sequence annotation, Root microbiome, Abiotic stress, land management, Plant genotype, Plant webinar series, HPC, database development, cross-linked databases, Plant database, data infrastructure, Plant breeding, Data standards, data managemnet, data sharing, Hyb-Seq method, Flowering plants, Crop improvement, Pangenomics, Pangenomes, Virtual humans, Drug-target identification, plant-microbe interactions, Spatial transcriptomics, Plant research, Open Targets Platform, Drug targets, Machine learning, Mathematical modelling, plant science, Data integration, plant-environment interaction, Phenotyping, field phenotyping, Deep phenotyping, EOSC-Life, NHGRI-EBI GWAS Catalog, clinical data, genome-wide association, Proteomics, Proteomes, Peptide search, plants, European Variation Archive, EVA, Variant clusters, Variant data annotation, Constraint-based metabolic modelling, UniProt knowledgebase, protein variant impact, disease-associated protein variants, Bioethics, FAIR principles, ELSI, cohort data, translational research, BioModels database, Mathematical modeling, Reproducibity, Systems biology models, workflows, federated analysis, polygenic risk scores, IntAct Molecular Interaction Database, PSICQUIC, IMEx, Complex portal, Agent-based modelling, Macrophages, Tumorigenesis, Ensembl, Training (Training), On-demand, teaching, introduction, Building blocks, Data analysis, COSMIC, Cancer mutation, Somatic mutation, UniRule, UniFIRE, ARBA, Protein annotation, ChEMBL: Bioactive data for drug discovery, Chemical compounds, drug-like molecules, Chemogenomics, Biocurator, Programming, Data management, Green Algorithms, Open data, Environmental impact, Carbon footprint, HPC workflows, Orchestrator, Gene expression (gene-expression), Chemosensitivity assay, Experimental protocols, Drug screening, MICHA, European Genome-phenome Archive, EGA, restricted access, UniProt, Introduction, UniProtKB, Proteome, Protein Data Bank in Europe, genes, Introductory, GDPR, Data security, Expression Atlas, UniParc, UniRef
Organizer: European Bioinformatics Institute (EBI)
Target audience: Plant research, Plant research
Capacity: 1000
Event types:
- Workshops and courses
Scientific topics: Proteins
Activity log