makeTxDbFromGFF {GenomicFeatures}R Documentation

Make a TxDb object from annotations available as a GFF3 or GTF file

Description

The makeTxDbFromGFF function allows the user to make a TxDb object from transcript annotations available as a GFF3 or GTF file.

Usage

makeTxDbFromGFF(file,
                format=c("gff3","gtf"),
                exonRankAttributeName=NA,
                gffGeneIdAttributeName=NA,
                chrominfo=NA,
                dataSource=NA,
                organism=NA,
                circ_seqs=DEFAULT_CIRC_SEQS,
                miRBaseBuild=NA,
                useGenesAsTranscripts=FALSE,
                gffTxName="mRNA",
                species=NA)

Arguments

file

path/file to be processed

format

"gff3" or "gtf" depending on which file format you have to process

exonRankAttributeName

ignored and deprecated

gffGeneIdAttributeName

ignored and deprecated

chrominfo

data frame containing information about the chromosomes. Will be passed to the internal call to makeTxDb. See ?makeTxDb for the details.

dataSource

Where did this data file originate? Please be as specific as possible.

organism

What is the Genus and species of this organism. Please use proper scientific nomenclature for example: "Homo sapiens" or "Canis familiaris" and not "human" or "my fuzzy buddy". If properly written, this information may be used by the software to help you out later.

circ_seqs

a character vector to list out which chromosomes should be marked as circular.

miRBaseBuild

specify the string for the appropriate build Information from mirbase.db to use for microRNAs. This can be learned by calling supportedMiRBaseBuildValues. By default, this value will be set to NA, which will inactivate the microRNAs accessor.

useGenesAsTranscripts

ignored and deprecated

gffTxName

ignored and deprecated

species

deprecated in favor of organism

Details

makeTxDbFromGFF is a convenience function that feeds data from the parsed file to the makeTxDbFromGRanges function.

Value

A TxDb object.

Author(s)

M. Carlson and H. Pages

See Also

Examples

## TESTING GFF3
gffFile <- system.file("extdata","GFF3_files","a.gff3",package="GenomicFeatures")
txdb <- makeTxDbFromGFF(file=gffFile,
            format="gff3",
            dataSource="partial gtf file for Tomatoes for testing",
            organism="Solanum lycopersicum")

## TESTING GTF, this time specifying the chrominfo
gtfFile <- system.file("extdata","GTF_files","Aedes_aegypti.partial.gtf",
                       package="GenomicFeatures")
chrominfo <- data.frame(chrom = c('supercont1.1','supercont1.2'),
                        length=c(5220442, 5300000),
                        is_circular=c(FALSE, FALSE))
txdb2 <- makeTxDbFromGFF(file=gtfFile,
             format="gtf",
             chrominfo=chrominfo,
             dataSource=paste("ftp://ftp.ensemblgenomes.org/pub/metazoa/",
                              "release-13/gtf/aedes_aegypti/",sep=""),
             organism="Aedes aegypti")

[Package GenomicFeatures version 1.20.1 Index]