A dataset containing the gene information in GFF3 format. The variables are as follows:

data(gff_gene)

Format

A data frame with 1236881 rows and 9 variables

Details

  • seqid. name of the chromosome or scaffold; here, of Panicum virgatum.

  • source. name of the program that generated this feature.

  • type. type of feature. Term or accession from the SOFA sequence ontology.

  • start. Start position of the feature, with sequence numbering starting at 1.

  • end. End position of the feature, with sequence numbering starting at 1.

  • score. A floating point value.

  • strand. defined as + (forward) or - (reverse).

  • phase. One of '0', '1' or '2'. '0' indicates that the first base of the feature is the first base of a codon, '1' that the second base is the first base of a codon, and so on.

  • attributes. A semicolon-separated list of tag-value pairs, providing additional information about each feature. Some of these tags are predefined, e.g. ID, Name, Alias, Parent.

Author

Alice MacQueen alice.macqueen@utexas.edu