Genome Prairie

  Most query systems are oriented toward retrieving small data entries, not generating statistics on the database as a whole

Example: NCBI Entrez

What we need is a way to run through large sections of a database, and tabulate statistics. For example:


Data pipeline for generating statistics on databases

We use the GDE interface by Steven Smith to call web services which provide the raw data.


In turn, the actual GenBank entries corresponding to the list of GI numbers can be retrieved:

The new GDE window has menus for working with sequence data. In this fashion, the same GDE interface can be used to go back and forth between different types of data.

GDE is designed for rapid addition of new functions. GDE itself does nothing but display data and call external programs. Therefore, any existing program can be added to GDE's functionality, simply by adding a menu specification.


 
 
FRISTENSKY LAB
BEGINNING Previous Next END