update March 1, 2007
NAME
estano.csh - Identify ESTs using blastx and write a comma-separated value file with information on the best hit for each EST.
SYNOPSIS
estano.csh outfile.csv

DESCRIPTION

estano.csh reads all FASTA-formatted nucleotied sequence files in the current working directory with the file extension .fsa. Each file must contain a single sequence. For each file, blastx is run at NCBI using the blastcl3 client. The GI number of the blastx hit with the highest E-value is used to obtain additional information  from the SeqHound server at SLRI [http://www.blueprint.org/seqhound/]. Output is sent to a .csv file, which can be directly imported by most spreadsheet programs.

outfile.csv contains a set of lines containing comma-separated fields. The following fields are included in the file:

(1) EST name:                  The name of the EST in the .fsa input file
(2) GI number:                NCBI GI number
(3) Taxonomy name:        The NCBI Taxonomy name, listing Genus and Species corresponding to the GI  number.
(4) Protein name:            The NCBI protein name corresponding to the GI number.
(5) 3D Structure IDs:      The semicolon-seperated list of 3D structure IDs retrieved from SeqHound API
                                         corresponding to  the GI number with E-value of 1011 or higher
(6) E-value:                      E-value for the highest blastx hit


BUGS

1. A number of improvements might be useful:
- ability to specify blast parameters
- ability to specify a directory from which EST sequence files are to be read
- ability to read more than one sequence from a file

2. For almost any realistic EST project, it is impractical to do all of the BLAST searches at NCBI. Rather, blastall or fasty3 would be used to search a locally-installed database.

SEE ALSO
blastcl3, leash
 
AUTHORS
Dr. Brian Fristensky
Department of Plant Science
University of Manitoba
Winnipeg, MB  Canada R3T 2N2
frist@cc.umanitoba.ca
http://home.cc.umanitoba.ca/~frist

Dr. Binhua Liang
Department of Medical Microbiology
University of Manitoba
Winnipeg, MB Canada R3E 0W3
liangb0@cc.umanitoba.ca