Jupyter notebook

ObjTables Python tutorials

ObjTables is a toolkit for creating re-usable datasets that are both human and machine-readable, combining the ease of spreadsheets (e.g., Excel workbooks) with the rigor of schemas (classes, their attributes, the type of each attribute, and the possible relationships between instances of classes). ObjTables consists of a format for describing schemas for spreadsheets, numerous data types for science, a markup format for indicating the class and attribute represented by each table and column in a workbook, and software for using schemas to rigorously validate, merge, split, compare, and revision datasets. ObjTables is ideal for supplementary materials of journal article, as well as for emerging domains which need to quickly build new formats for new types of data and associated software with minimal effort.

The tutorials provide a brief introduction to the ObjTables format for schemas for spreadsheets, the ObjTables markup syntax for spreadsheets, and the ObjTables Python package for parsing, validating, querying, editing, comparing, merging, splitting, revisioning, migrating, and analyzing spreadsheets.

Licence: MIT License

Keywords: spreadsheet, schema, table, workbook, worksheet, XLSX, Excel, standard, reuse, compose, integrate, quality control

Target audience: Researchers, Scientists, Data scientists, computational scientists

Resource type: Jupyter notebook

Authors: Jonathan Karr

Scientific topics: Data management, Data integration and warehousing, Data quality management, Data curation and archival

Operations: Validation

External resources:

Activity log