GENSCAN output

GENSCANW output for sequence 15:28:02



GENSCAN 1.0	Date run: 31-Mar-105	Time: 15:28:04

Sequence 22 : 29319 bp : 58.25% C+G : Isochore 4 (57 - 100 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:


Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Init +   1791   1929  139  2  1   89   72   154 0.916  12.34
 1.02 Intr +   3499   3786  288  2  0   75   61   103 0.830   4.78
 1.03 Intr +   4167   4475  309  1  0   33  105   225 0.595  16.25
 1.04 Intr +   5703   6078  376  1  1   55   71   699 0.051  60.28
 1.05 Intr +   6943   7094  152  1  2   96   39    88 0.618   5.70
 1.06 Intr +   8008   8140  133  2  1   14  105   278 0.546  22.98
 1.07 Intr +   8953   9124  172  1  1   86   68   290 0.955  27.94
 1.08 Intr +   9705   9911  207  2  0   96   76   264 0.612  26.49
 1.09 Intr +  10556  10623   68  1  2   83  105   129 0.947  12.95
 1.10 Intr +  10700  10800  101  2  2  112   97   147 0.999  18.92
 1.11 Term +  11187  11665  479  1  2  113   41   594 0.917  53.32
 1.12 PlyA +  12107  12112    6                              -4.33

 2.06 PlyA -  12154  12149    6                               1.05
 2.05 Term -  13152  13047  106  2  1   43   45    80 0.046  -1.99
 2.04 Intr -  15369  15285   85  1  1   98   72    24 0.126   1.83
 2.03 Intr -  15610  15557   54  2  0   80   72    53 0.201   2.99
 2.02 Intr -  18119  18024   96  0  0   78   57    43 0.142   1.34
 2.01 Init -  20643  20562   82  0  1   74   75    71 0.167   3.63
 2.00 Prom -  22343  22304   40                               1.34

 3.00 Prom +  26501  26540   40                              -4.26
 3.01 Init +  29097  29284  188  2  2  109   92   105 0.870   9.70

Click here to view a PDF image of the predicted gene(s)
Click here for a PostScript image of the predicted gene(s)

Predicted peptide sequence(s):


>22|GENSCAN_predicted_peptide_1|807_aa
MEAVEAAAPAGGWDGLALRGLPGPFSRGLLEGPSVEQRGPGVRNQARRSEERALPTLRMY
KRPGQRRAVRASARDARARGLRSLGSVPLLGSPATIVSQLQGAVGHLLGWVLSCCAKEQR
TLRPWGSCQWPVREPKLPSPHPGAPGSEWHHRPPPPDSVGWAGLGAVRLGTRSRRRRRPG
RRAPMDARSPLSPRASAFSIASLVAAEAAERTAHQGSGSSDRVKLRWLPGSPAGMHFSTV
TRDMEAFTASSLSSLGAAGGFPGAASPGADPYGPREPPPPPPRYDPCAAAAPGAPGPPPP
PHAYPFAPAAGAATSAAAEPEGPGASCAAAAKAPVKKNAKVAGVSVQLEMKALWDEFNQL
GTEMIVTKAGRPSLLSPGTQALLRTAAAQPLAQASPARALSALGFTREPTSRSTLQPAEA
SARVTQGLITPRRMFPTFQVKLFGMDPMADYMLLMDFVPVDDKRYRYAFHSSSWLVAGKA
DPATPGRVHYHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHILSPAPPLELIPHLVFQ
IILNSMHRYQPRFHVVYVDPRKDSEKYAEENFKTFVFEETRFTAVTAYQNHRITQLKIAS
NPFAKGFRDCDPEDWPRNHRPGALPLMSAFARSRNPVASPTQPSGTEKDAAEARREFQRD
AGGPAVLGDPAHPPQLLARVLSPSLPGAGGAGGLVPLPGAPGGRPSPPNPELRLEAPGAS
EPLHHHPYKYPAAAYDHYLGAKSRPAPYPLPGLRGHGYHPHAHPHHHHHPVSPAAAAAAA
AAAAAAAANMYSSAGAAPPGSYDYCPR

>22|GENSCAN_predicted_peptide_2|140_aa
MRPHWGVWAHLGRGRSAERALGHRFMPDLPLGPEQQLLSPELGPCRLSQEVAEQAGFQIV
RGEAASALGNCGYVEFSAQGATKQTRLVPQPSPRVLLPLHLQCFPGANMYTKNRSARVDI
WKLLAVPSERWHSGCRYHVG

>22|GENSCAN_predicted_peptide_3|63_aa
MERPAAAAGPGLCSGALPGIISRHRLTPRCLRAKPNLPAGTPTARGGGRFSWSQTQGRLL
ACX


Explanation

Gn.Ex : gene number, exon number (for reference)
Type  : Init = Initial exon (ATG to 5' splice site)
        Intr = Internal exon (3' splice site to 5' splice site)
        Term = Terminal exon (3' splice site to stop codon)
        Sngl = Single-exon gene (ATG to stop)
        Prom = Promoter (TATA box / initation site)
        PlyA = poly-A signal (consensus: AATAAA)
S     : DNA strand (+ = input strand; - = opposite strand)
Begin : beginning of exon or signal (numbered on input strand)
End   : end point of exon or signal (numbered on input strand)

Len   : length of exon or signal (bp)
Fr    : reading frame (a forward strand codon ending at x has frame x mod 3)
Ph    : net phase of exon (exon length modulo 3)
I/Ac  : initiation signal or 3' splice site score (tenth bit units)
Do/T  : 5' splice site or termination signal score (tenth bit units)
CodRg : coding region score (tenth bit units)
P     : probability of exon (sum over all parses containing exon)
Tscr  : exon score (depends on length, I/Ac, Do/T and CodRg scores)

Comments

The SCORE of a predicted feature (e.g., exon or splice site) is a
log-odds measure of the quality of the feature based on local sequence
properties. For example, a predicted 5' splice site with
score > 100 is strong; 50-100 is moderate; 0-50 is weak; and
below 0 is poor (more than likely not a real donor site).

The PROBABILITY of a predicted exon is the estimated probability under
GENSCAN's model of genomic sequence structure that the exon is correct.
This probability depends in general on global as well as local sequence
properties, e.g., it depends on how well the exon fits with neighboring
exons.  It has been shown that predicted exons with higher probabilities
are more likely to be correct than those with lower probabilities.