Binhua Liang

Assignment 3-- Data Pipelines

March 4, 2007

This assignment is to modify "estano.csh" codes and added 3 fields:

(1) Protein name
(2) A semicolon-seperated list of 3D structure IDs
(3) E-value

The following are the major modifications for check in estano.csh file (comments inside):

(1) set DEBUG = 0
(2) 'PROTEINNAME', '3D_IDS', and 'E-VALUE'  are added to the first line of outfile contains colum headings
(3) all codes to "geting protein name, geting 3D Structure IDs, and geting E-Value" are seperated by /==/
(4) add $SEP$ProteinName$SEP$PIDS$SEP$EVALUE to print output line to outfile.tmp
(5) translate $SET into comma character and change outfile to .csv file: tr $SEP ',' <outfile.tmp>outfile.csv

The followings are the required to hand in:

Ben's Modified estano.csh

Ben's Modified estano.html

Output.csv



****please note: You will get different outputs when runing this program set DEBUG = 0 or 1. Here, I run it using 0!




Return to My_Web_Site