Supported Data Types
The following Illumina and thrid-party data types are supported in Connected Multiomics:
Bulk RNA-Seq
Illumina
Illumina Stranded Total RNA Prep Ligation with Ribo-Zero Plus
Illumina Stranded mRNA Prep Ligation
TruSeq Stranded Total RNA Library Prep Gold
TruSeq Stranded Total RNA Library Prep Globin
TruSeq Stranded mRNA Library Prep
.sf
.sf.gz
Multiple samples per file
Included in demo data
Bulk RNA-Seq
Various
Various
quant.genes.sf
Multiple samples per file
Gene Expression Omnibus - Search GSM7103647
Bulk Methylation
Illumina
Illumina 5-base DNA Prep
Illumina 5-base DNA Prep with Enrichment
.CX_report.txt.gz
.methyl_metrics.csv
.mapping_metrics.csv
.wgs_coverage_metrics.csv
.M-bias.txt
One sample per set of files
Included in demo data
Microarray Methylation
Illumina
N/A
.idat
One sample per set of files
Included in demo data
Requires 2 .idat
files per sample. Red.idat
and Grn.idat
Bulk Proteomics
Illumina
Illumina Protein Prep 9.5K Plasma
Illumina Protein Prep 9.5K Serum
.adat
Multiple samples per file
Included in demo data
Single-cell scATAC-Seq
10X Genomics
.h5
.csv
fragments.tsv.gz
fragments.tsv.gz.tbi
peaks.bed
One sample per set of files
Each file must begin with the sample name. Multiple files can be ingested at the same time
Single-cell scATAC-Seq
Various
Various
.qs
or .rds
One sample per file
Not available
R object for data processed by Seurat (ATAC)
Single-cell
sc-RNA
Illumina
.barcodes.tsv.gz
.features.tsv.gz
.matrix.mtx.gz
One sample per set of files
Included in demo data
Any additional descriptors prior to main extensions are supported (eg .scRNA.filtered.matrix.mtx). Each file name must begin with the sample name. Multiple samples can be ingested at the same time.
Single-cell
sc-RNA
10X Genomics
Parse Biosciences
.tsv.gz
or .csv.gz
.mtx.gz
One sample per set of files
Sparse matrix output. Each sample has 3 files: two .csv with one .mtx or two .tsv with one .mtx. Each file name must begin with the sample name. Multiple samples can be ingested at the same time.
Single-cell
sc-RNA
10X Genomics
Chromium assays including: Universal 3' Gene Expression
.h5
One sample per file
This compressed binary format is preferred for 10x Genomics Cell Ranger output. One filtered .h5 file per sample. Multiple samples can be ingested at the same time.
Single-cell
sc-RNA
Various
Various
.h5ad
One sample per file
Not available
AnnData object in the h5ad file format
Single-cell
sc-RNA
Various
Various
.qs
or .rds
One sample per file
Gene Expression Omnibus - Search GSE186892
R object for data processed by Seurat (RNA)
Spatial Transcriptomics
Illumina
Ilumina Spatial Transcriptome Prep
DRAGEN Spatial Transcriptome
.h5ad
.ome.tiff
One sample per set of files
Included in demo data
At least one .ome.tiff and multiple .h5ad files are required.
Spatial Transcriptomics
10X Genomics
.h5
or
barcodes.tsv.gz
features.tsv.gz
matrix.mtx.gz
or
barcodes.csv.gz
features.csv.gz
matrix.mtx.gz
with _spatial.tar.gz
Optional: .tif
One sample per set of files
Either count matrix data as 1 filtered .h5
file per sample or sparse matrix files for each sample as 3 files (two .csv
with one .mtx
or two .tsv
with one .mtx
for each sample). The spatial output files should be in compressed format (.zip
). The high resolution image (.tif
)can be uploaded and is optional. The spatial result file name must begin with the sample name. Only 1 sample can be ingested at a time.
Spatial Transcriptomics
10X Genomics
Xenium assays including:
In Situ Gene Expression
cell_feature_matrix.h5
cells.csv.gz
cell_boundaries.csv.gz
nucleus_boundaries.csv.gz transcripts.csv.gz
/ transcripts.parquet.csv.gz
morphology_focus.ome.tif
One sample per set of files
Includes the unzipped Xenium Output Bundle with the preferred input image file (.tiff
) for each sample. The .h5
file name must begin with the sample name. Only 1 sample can be ingested at a time.
Metadata
Various
Various
Various
.tsv
.csv
Multiple samples per file
N/A
See Sample Metadata for more detail on format.
Last updated
Was this helpful?