Command-line scriptsΒΆ

plastid includes the following command-line scripts. Click on a name below for more information:

Analysis of sequencing or quantitative data
counts_in_region Count the number of read alignments covering arbitrary regions of interest in the genome, and calculate read densities (in reads per nucleotide and in RPKM) over these regions
cs Count the number of read alignments and calculate read densities (in RPKM) specifically for genes and sub-regions (5’ UTR, CDS, 3’ UTR)
get_count_vectors Fetch vectors of counts or other quantitative data at each nucleotide position in one or more regions of interest, saving each vector as its own line-delimited text file
make_wiggle Create wiggle or bedGraph files from alignment files after applying a read mapping rule (e.g. to map ribosome-protected footprints at their P-sites), for visualization in a genome browser
metagene Compute a metagene profile of read alignments, counts, or quantitative data over one or more regions of interest, optionally applying a mapping rule
phase_by_size Estimate sub-codon phasing in ribosome profiling data
psite Estimate position of ribosomal P-site within ribosome profiling read alignments as a function of read length
Generating or modifying genome annotations
crossmap Generate a mask file annotating regions of the genome that fail to produce uniquely mapping reads under various alignment criteria, so that they may be excluded from future analyses
gff_parent_types Examine parent-child relationships of features in a GFF3 file
reformat_transcripts Convert transcripts between BED, BigBed, GTF2, GFF3, and PSL formats
findjuncs Find all unique splice junctions in one or more transcript annotations, and optionally export these in Tophat‘s .juncs format
slidejuncs Compare splice junctions discovered in data to those in an annotation of known splice junctions, and, if possible with equal sequence support, slide discovered junctions to compatible known junctions.