Table 1. PIR
flatfiles used in BIRCH |
|
description |
file(s) |
completely merged, classified and
annotated sequences |
pir1.dat.Z |
completely merged, classified and
annotated sequences |
pir2.dat.Z |
unverified and unannotated sequences |
pir3.dat.Z |
sequences neither naturally occurring
nor naturally expressed, but fully annotated. Includes sequences known to be conceptual translations of pseudogenes, mistranslations or otherwise unexpressed potential ORF's that may have mistakenly been assigned identifiers as coding regions by other databases. It also includes engineered or synthetic sequences, sequences resulting from fusion, cross-over or frame-shift mutations, and sequences of natural polypeptides that are not synthesized on ribosomes. |
pir4.dat.Z |
Header files listing current description of
pirx.dat files |
pirx.nam |
Release notes |
0codata.txt |
Formal description of file format |
0protein_doc.codata |
Table 2. A sample filelist |
0codata.txt 0protein_doc.codata pir |
cd $pir
./ pirupdate filelist &
0codata.txtIf these numbers are equal, the name of the file is written to files_received. Otherwise, the name of the file is written to files_missed. By default, files remain in the $PIR directory. Files beginning with "0" are documentation files, and are moved to $doc/PIR. The full listing of files for this division are written, and then the names of each file are echoed to the output. Before beginning the download, will remove the current files for this division, if they exist, as a way of making sure that enough space is available.
ORIGINAL= 3358
RECEIVED= 3358
pir
-rw-r--r-- 1 root system 21091315 Feb 3 06:37 pir1.dat.Z
-rw-r--r-- 1 root system 1243 Feb 3 06:37 pir1.nam
-rw-r--r-- 1 root system 177763121 Feb 3 06:40 pir2.dat.Z
-rw-r--r-- 1 root system 1248 Feb 3 06:40 pir2.nam
-rw-r--r-- 1 root system 11135 Feb 3 06:40 pir3.dat.Z
-rw-r--r-- 1 root system 1233 Feb 3 06:40 pir3.nam
-rw-r--r-- 1 root system 221983 Feb 3 06:40 pir4.dat.Z
-rw-r--r-- 1 root system 1897 Feb 3 06:40 pir4.nam
pir1.dat.Z
pir1.nam
pir2.dat.Z
pir2.nam
pir3.dat.Z
pir3.nam
pir4.dat.Z
pir4.nam
Removing file(s) for pir1, if they exist
No match
ORIGINAL= 21091315
RECEIVED= 21091315
etc ...
How FASTA finds database files
FASTA reads a list of database files from the file 'fastgbs'. The
location of fastgbs is specified by the environment variable
$FASTLIBS, which is set to $BIRCH/dat/fasta/fastgbs. A typical
fastgbs file is shown below:
PIR Protein Identification Resource 75.02 $00@/home/psgendb/BIRCHDEV/dat/fasta/pir.fil
GenPept GenBank 133.0 CDS translations$01/home/psgendb/BIRCHDEV/GenPept/genpept.wrp
GB133 Primate$1P@/home/psgendb/BIRCHDEV/dat/fasta/gbpri.fil
GB133 Rodent$1R@/home/psgendb/BIRCHDEV/dat/fasta/gbrod.fil
GB133 other Mammal$1M@/home/psgendb/BIRCHDEV/dat/fasta/gbmam.fil
GB133 verteBrates$1B@/home/psgendb/BIRCHDEV/dat/fasta/gbvrt.fil
GB133 Invertebrates$1I@/home/psgendb/BIRCHDEV/dat/fasta/gbinv.fil
GB133 pLants$1L@/home/psgendb/BIRCHDEV/dat/fasta/gbpln.fil
GB133 Expressed Sequece Tags$1E@/home/psgendb/BIRCHDEV/dat/fasta/gbest.fil
GB133 Bacteria$1T@/home/psgendb/BIRCHDEV/dat/fasta/gbbct.fil
GB133 Viral$1V@/home/psgendb/BIRCHDEV/dat/fasta/gbvrl.fil
GB133 Phage$1G@/home/psgendb/BIRCHDEV/dat/fasta/gbphg.fil
GB133 Synthetic$1Y@/home/psgendb/BIRCHDEV/dat/fasta/gbsyn.fil
GB133 Unannotated$1U@/home/psgendb/BIRCHDEV/dat/fasta/gbuna.fil
GB133 Patented$1D@/home/psgendb/BIRCHDEV/dat/fasta/gbpat.fil
GB133 STS$1X@/home/psgendb/BIRCHDEV/dat/fasta/gbsts.fil
GB133 HTG$1h@/home/psgendb/BIRCHDEV/dat/fasta/gbhtg.fil
GB133 GSS$1s@/home/psgendb/BIRCHDEV/dat/fasta/gbgss.fil
GB133 All sequences (VERY long!)$1A@/home/psgendb/BIRCHDEV/dat/fasta/genbank.fil
pir.fil lists the files comprising PIR. An example is shown in Table 3.
Table 3. pir.fil |
</home/psgendb/BIRCHDEV/PIR |
This file lists the location and names of the 4 files in PIR. A complete description of syntax for these files can be found in $BIRCH/doc/fasta/fasta3x.asc.
Finally remember to edit fastgbs, using Find/Replace in
your text editor to change the PIR Release number to the current
release.
The default FASTA menus for GDE are located in $dat/GDE/makemenus/menus/Database. These menus only have one database choice for User-created files. The FASTA menus in $birch/local/dat/GDE/makemenus/menus/Database have additional menu choices for each database file listed in $BIRCH/dat/fasta/fastgbs. All we need to do is to edit $birch/local/dat/GDE/makemenus/menus/menulist to choose the local menu item files. This is done by adding lines to menulist, such as
DatabaseNow, re-run makemenus.py to update the .GDEmenus files
FASTAPROTEIN
FASTXY