October 04, 2022
Conference proceedings talk, ELRIG Drug Discovery 2022, ExCeL London
Data integration workflows for multiparametric phenotypic data take many forms across academia and industry. Efforts with limited resources and smaller teamsoften encountered in academia can easily fall short of data integration best practices for processing and combining high content imaging, proteomics,metabolomics and other omics data. The additional difficulty in applying literature techniques with example code in disparate repositories, languages and compute environments means new techniques require significant investment before use. We present Phenonaut, a Python software package designed to address the data workflow needs of integration, migration, automation, and auditability in the application of literature and proprietary techniques for data source and structure agnostic workflow creation. The application of Phenonaut for omics analysis by non-computational specialists is exemplified by several use cases to demonstrate how the framework can be used for knowledge extraction from diverse biological datasets.