What we hope to learn:
- Identification of biases in
representative databases (eg. GenBank, SwissProt)
- Development of metrics for
measurement of bias
- Create Datasets
- Real datasets (reflect real data)
- Simulated datasets (allow you to
- Test effects of biases on real and
- Improvement of existing methods
- Which kinds of biases exist?
- Which ones are important and which
can we ignore?
- How do we make better datasets?
- How do we improve analytical