Bulk RNA-seq on Polly

Polly delivers the highest quality Bulk RNA-seq datasets to fit diverse analysis methods & pipelines. All datasets are Polly Verified, i.e. harmonized with a configurable, granular & transparent curation process.

How Does Bulk RNA-seq Data on Polly Become ML-ready?

Why Access Bulk RNA-seq Datasets on Polly?

Configure Curation to Fit Your Analysis Needs

Customize the processing pipeline used, metadata fields annotated, or ontologies applied to fit unique analysis fields.

Request additional curation of cohort-based comparisons within bulk RNA-seq datasets. Every comparison comprises pre-computed differential expression results to streamline your search for biologically relevant signatures.

Use Data You Can Trust

Our experts implement ~50 QA checks to perform batch effect correction, metadata validation, and remove technical artifacts & variations in every dataset.

The data normalization methods or QC metrics used on Polly are not a black box. Learn how each Bulk RNA-seq dataset was processed by downloading a detailed QA/QC report from Polly.

Work With Data in Flexible Ways

Perform gene, pathway, or metadata-based queries to find and explore the data you need.

Utilize interactive volcano plots, heatmaps, and more to visualize enriched genes and pathways.

Stream Polly harmonized Bulk RNA-seq datasets to your preferred tools for advanced analyses.

Polly Verified – Our Quality Guarantee

We use at least 50+ QA checks to ensure every dataset is:


Data validation checks ensure that all dataset and sample-level metadata annotations contain non-NULL and non-blank values.


Rigorous QC checks to ensure metadata attributes are human-readable and accurately assigned
at all levels (dataset, cell).


Normalization and batch Effect correction are applied wherever necessary to eliminate technical variations and enable meaningful comparisons between samples.


Doublets, which can arise during sample preparation and confound
analysis, are identified
and removed.


Poor-quality samples, genes, and cohorts are filtered out. Genes that drive biological variation are retained and used for downstream analyses.

Request Demo