Below we load the file genes.rda
containing a named vector genes
. This vector contains the function descriptions of a set of genes. The names of this vector give the COG codes of these genes:
> head(genes, n = 10)
RCOG3478
"Predicted nucleic-acid-binding protein containing a Zn-ribbon domain "
CCOG1031
"Uncharacterized Fe-S oxidoreductase "
SCOG3771
"Predicted membrane protein "
KCOG1191
"DNA-directed RNA polymerase specialized sigma subunit "
SCOG5230
"Uncharacterized conserved protein "
JCOG0532
"Translation initiation factor 2 (IF-2; GTPase) "
CCOG1905
"NADH:ubiquinone oxidoreductase 24 kD subunit "
FCOG0461
"Orotate phosphoribosyltransferase "
SCOG2904
"Uncharacterized protein conserved in bacteria "
ECOG0509
"Glycine cleavage system H protein (lipoate-binding) "
...
Question: Use this vector to create a character vector geneclass
with the values “Predicted”, “Uncharacterized” and “Described”. These are defined as follows: