Multiomics
Biological analysis approach From Wikipedia, the free encyclopedia
Multiomics, multi-omics, integrative omics, "panomics" or "pan-omics" is a biological analysis approach in which the data consists of multiple "omes", such as the genome, epigenome, transcriptome, proteome, metabolome, exposome, and microbiome (i.e., a meta-genome and/or meta-transcriptome, depending upon how it is sequenced);[1][2][3] in other words, the use of multiple omics technologies to study life in a concerted way. By combining these "omes", scientists can analyze complex biological big data to find novel associations between biological entities, pinpoint relevant biomarkers and build elaborate markers of disease and physiology. In doing so, multiomics integrates diverse omics data to find a coherently matching geno-pheno-envirotype relationship or association.[4] The OmicTools service lists more than 99 pieces of software related to multiomic data analysis, as well as more than 99 databases on the topic.
![]() | You can help expand this article with text translated from the corresponding article in French. (August 2021) Click [show] for important translation instructions.
|

Systems biology approaches are often based upon the use of multiomic analysis data.[5][6] The American Society of Clinical Oncology (ASCO) defines panomics as referring to "the interaction of all biological functions within a cell and with other body functions, combining data collected by targeted tests ... and global assays (such as genome sequencing) with other patient-specific information."[7]
Combined multiomic data collection
Summarize
Perspective
Combined multiomic data collection approaches have evolved to address the limitations of traditional multiomics research, which typically requires separate sample processing for different molecular classes then subsequent computational integration, introducing variability and increasing costs. Early advances in this field include sequential extraction,[8] TRIzol-based sequential isolation methods, which demonstrated that a reagent traditionally used for RNA isolation could simultaneously extract DNA, RNA, proteins, metabolites, and lipids from a single sample. Similar approaches like the Metabolite, Protein, and Lipid extraction (MPLEx)[9] and the "Three-in-One"[10] method adapted biphasic fractionation to extract proteins, metabolites, and lipids for LC-MS/MS analysis. More recent technological developments include the Multi-Omic Single-Shot Technology (MOST),[11] which integrates proteome and lipidome analysis in a single LC-MS run using one reverse-phase column and a binary mobile phase system, and the Bead-enabled Accelerated Monophasic Multi-omics (BAMM)[12] method that combines n-butanol-based monophasic extraction with magnetic beads and accelerated protein digestion for the separate analysis of metabolites, lipids, and proteins. One of the most comprehensive technologies in this space is Dalton Bioanalytics Inc.'s Omni-MS®, a multiomic assay that uses its proprietary method to simultaneously profile proteins, lipids, electrolytes, metabolites, and other small molecules in a single preparation and single LC-MS analysis. This platform has been applied to biomarker discovery, identifying potential biomarkers across multiple molecular classes and across various conditions and diseases[13][14] including COVID severity during pregnancy,[15] 22q11.2 deletion syndrome,[16] and hereditary angioedema.[17] These integrated approaches significantly reduce sample requirements, processing time, and technical variation while improving correlation analysis across different molecular classes, making them increasingly valuable for precision medicine and systems biology research.
Single-cell multiomics
Summarize
Perspective
A branch of the field of multiomics is the analysis of multilevel single-cell data, called single-cell multiomics.[18][19] This approach gives us an unprecedent resolution to look at multilevel transitions in health and disease at the single cell level. An advantage in relation to bulk analysis is to mitigate confounding factors derived from cell to cell variation, allowing the uncovering of heterogeneous tissue architectures.[18]
Methods for parallel single-cell genomic and transcriptomic analysis can be based on simultaneous amplification[20] or physical separation of RNA and genomic DNA.[21] They allow insights that cannot be gathered solely from transcriptomic analysis, as RNA data do not contain non-coding genomic regions and information regarding copy-number variation, for example. An extension of this methodology is the integration of single-cell transcriptomes to single-cell methylomes, combining single-cell bisulfite sequencing[22][23] to single cell RNA-Seq.[24] Other techniques to query the epigenome, as single-cell ATAC-Seq[25] and single-cell Hi-C[26] also exist.
A different, but related, challenge is the integration of proteomic and transcriptomic data.[27][28] One approach to perform such measurement is to physically separate single-cell lysates in two, processing half for RNA, and half for proteins.[27] The protein content of lysates can be measured by proximity extension assays (PEA), for example, which use DNA-barcoded antibodies.[29] A different approach uses a combination of heavy-metal RNA probes and protein antibodies to adapt mass cytometry for multiomic analysis.[28]
Related to Single-cell multiomics is the field of Spatial Omics which assays tissues through omics readouts that preserve the relative spatial orientation of the cells in the tissue. The number of Spatial Omics methods published still lags behind the number of methods published for Single-Cell multiomics, but the numbers are catching up (Single-cell and Spatial methods).
Multiomics and machine learning
In parallel to the advances in high-throughput biology, machine learning applications to biomedical data analysis are flourishing. The integration of multi-omics data analysis and machine learning has led to the discovery of new biomarkers.[30][31][32] For example, one of the methods of the mixOmics project implements a method based on sparse Partial Least Squares regression for selection of features (putative biomarkers).[33] A unified and flexible statistical framewok for heterogeneous data integration called "Regularized Generalized Canonical Correlation Analysis" (RGCCA [34][35][36][37]) enables identifying such putative biomarkers. This framework is implemented and made freely available within the RGCCA R package .
Multiomics in health and disease
Summarize
Perspective

Multiomics currently holds a promise to fill gaps in the understanding of human health and disease, and many researchers are working on ways to generate and analyze disease-related data.[38] The applications range from understanding host-pathogen interactions and infectious diseases,[39][40] cancer,[41] to understanding better chronic and complex non-communicable diseases[42] and improving personalized medicine.[43]
Integrated Human Microbiome Project
The second phase of the $170 million Human Microbiome Project was focused on integrating patient data to different omic datasets, considering host genetics, clinical information and microbiome composition.[44][45] The phase one focused on characterization of communities in different body sites. Phase 2 focused in the integration of multiomic data from host & microbiome to human diseases. Specifically, the project used multiomics to improve the understanding of the interplay of gut and nasal microbiomes with type 2 diabetes,[46] gut microbiomes and inflammatory bowel disease[47] and vaginal microbiomes and pre-term birth.[48]
Systems Immunology
The complexity of interactions in the human immune system has prompted the generation of a wealth of immunology-related multi-scale omic data.[49] Multi-omic data analysis has been employed to gather novel insights about the immune response to infectious diseases, such as pediatric chikungunya,[50] as well as noncommunicable autoimmune diseases.[51] Integrative omics has also been employed strongly to understand effectiveness and side effects of vaccines, a field called systems vaccinology.[52] For example, multiomics was essential to uncover the association of changes in plasma metabolites and immune system transcriptome on response to vaccination against herpes zoster.[53]
List of software used for multi-omic analysis
The Bioconductor project curates a variety of R packages aimed at integrating omic data:
- omicade4, for multiple co-inertia analysis of multi omic datasets[54]
- MultiAssayExperiment, offering a bioconductor interface for overlapping samples[55]
- IMAS, a package focused on using multi omic data for evaluating alternative splicing[56]
- bioCancer, a package for visualization of multiomic cancer data[57]
- mixOmics, a suite of multivariate methods for data integration[33]
- MultiDataSet, a package for encapsulating multiple data sets[58]
The RGCCA package implements a versatile framework for data integration. This package is freely available on the Comprehensive R Archive Network (CRAN).
The OmicTools[59] database further highlights R packages and othertools for multi omic data analysis:
- PaintOmics, a web resource for visualization of multi-omics datasets[60][61]
- SIGMA, a Java program focused on integrated analysis of cancer datasets[62]
- iOmicsPASS, a tool in C++ for multiomic-based phenotype prediction[63]
- Grimon, an R graphical interface for visualization of multiomic data[64]
- Omics Pipe, a framework in Python for reproducibly automating multiomic data analysis[65]
Multiomic Databases
Summarize
Perspective
A major limitation of classical omic studies is the isolation of only one level of biological complexity. For example, transcriptomic studies may provide information at the transcript level, but many different entities contribute to the biological state of the sample (genomic variants, post-translational modifications, metabolic products, interacting organisms, among others). With the advent of high-throughput biology, it is becoming increasingly affordable to make multiple measurements, allowing transdomain (e.g. RNA and protein levels) correlations and inferences. These correlations aid the construction or more complete biological networks, filling gaps in our knowledge.
Integration of data, however, is not an easy task. To facilitate the process, groups have curated database and pipelines to systematically explore multiomic data:
- Multi-Omics Profiling Expression Database (MOPED),[66] integrating diverse animal models,
- The Pancreatic Expression Database, integrating data related to pancreatic tissue,
- LinkedOmics,[67][68] connecting data from TCGA cancer datasets,
- OASIS,[69] a web-based resource for general cancer studies,
- BCIP,[70] a platform for breast cancer studies,
- C/VDdb,[71] connecting data from several cardiovascular disease studies,
- ZikaVR,[72] a multiomic resource for Zika virus data
- Ecomics,[73] a normalized multi-omic database for Escherichia coli data,
- GourdBase,[74] integrating data from studies with gourd,
- MODEM,[75] a database for multilevel maize data,
- SoyKB,[76] a database for multilevel soybean data,
- ProteomicsDB,[77] a multi-omics and multi-organism resource for life science research
See also
References
Wikiwand - on
Seamless Wikipedia browsing. On steroids.