Demo dataset¶
We have put together a small demo dataset that is used in the Tour and in Tutorials. It consists of feature annotations, ribosome profiling data, and RNA-seq from the merlin (laboratory) strain of human cytomegalovirus (hCMV).
Downloads:
demo dataset part one, for all of the tutorials
demo dataset part two, specifically used in Gene expression analysis.
Part 1 includes the following files:
Filename
Contents
Source
merlin_NC006273-2.fa
Sequence of hCMV merlin strain
merlin_orfs.bed
,merlin_orfs.gtf
Coding region models for hCMV strain, plus estimated UTRs
[SGWM+12] (CDS). 5’ UTRs estimated as 50 nt upstream of CDS. 3’ UTRs estimated as 100 nt downstream of CDS.
SRR609197_riboprofile_5hr_rep1.bam
Ribosome profiling data, 5 hours post hCMV infection, aligned to hCMV merlin strain genome sequence
[SGWM+12], raw data available at SRA, accession no. SRR609197
SRR592963_rnaseq_5hr_rep1.bam
RNA-seq data, 5 hours post CMV infection, aligned to hCMV merlin strain genome sequence
[SGWM+12], raw data available at SRA, accession no. SRR592963
Part 2 includes further replicates, as well as timepoint data from 24 hours post-infection.