What kinds of biases exist in biological databases?
Model species
Coding vs. noncoding
Strongly-expressed genes
Redundancy
Length
cDNA vs. genomic
Sampling error
Automated annotation favors
known protein families
smaller genes with few exons