GENSCAN 1.0 Date run: 31-Mar-105 Time: 15:28:04 Sequence 22 : 29319 bp : 58.25% C+G : Isochore 4 (57 - 100 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1791 1929 139 2 1 89 72 154 0.916 12.34 1.02 Intr + 3499 3786 288 2 0 75 61 103 0.830 4.78 1.03 Intr + 4167 4475 309 1 0 33 105 225 0.595 16.25 1.04 Intr + 5703 6078 376 1 1 55 71 699 0.051 60.28 1.05 Intr + 6943 7094 152 1 2 96 39 88 0.618 5.70 1.06 Intr + 8008 8140 133 2 1 14 105 278 0.546 22.98 1.07 Intr + 8953 9124 172 1 1 86 68 290 0.955 27.94 1.08 Intr + 9705 9911 207 2 0 96 76 264 0.612 26.49 1.09 Intr + 10556 10623 68 1 2 83 105 129 0.947 12.95 1.10 Intr + 10700 10800 101 2 2 112 97 147 0.999 18.92 1.11 Term + 11187 11665 479 1 2 113 41 594 0.917 53.32 1.12 PlyA + 12107 12112 6 -4.33 2.06 PlyA - 12154 12149 6 1.05 2.05 Term - 13152 13047 106 2 1 43 45 80 0.046 -1.99 2.04 Intr - 15369 15285 85 1 1 98 72 24 0.126 1.83 2.03 Intr - 15610 15557 54 2 0 80 72 53 0.201 2.99 2.02 Intr - 18119 18024 96 0 0 78 57 43 0.142 1.34 2.01 Init - 20643 20562 82 0 1 74 75 71 0.167 3.63 2.00 Prom - 22343 22304 40 1.34 3.00 Prom + 26501 26540 40 -4.26 3.01 Init + 29097 29284 188 2 2 109 92 105 0.870 9.70Click here to view a PDF image of the predicted gene(s)
Click here for a PostScript image of the predicted gene(s)
Predicted peptide sequence(s): >22|GENSCAN_predicted_peptide_1|807_aa MEAVEAAAPAGGWDGLALRGLPGPFSRGLLEGPSVEQRGPGVRNQARRSEERALPTLRMY KRPGQRRAVRASARDARARGLRSLGSVPLLGSPATIVSQLQGAVGHLLGWVLSCCAKEQR TLRPWGSCQWPVREPKLPSPHPGAPGSEWHHRPPPPDSVGWAGLGAVRLGTRSRRRRRPG RRAPMDARSPLSPRASAFSIASLVAAEAAERTAHQGSGSSDRVKLRWLPGSPAGMHFSTV TRDMEAFTASSLSSLGAAGGFPGAASPGADPYGPREPPPPPPRYDPCAAAAPGAPGPPPP PHAYPFAPAAGAATSAAAEPEGPGASCAAAAKAPVKKNAKVAGVSVQLEMKALWDEFNQL GTEMIVTKAGRPSLLSPGTQALLRTAAAQPLAQASPARALSALGFTREPTSRSTLQPAEA SARVTQGLITPRRMFPTFQVKLFGMDPMADYMLLMDFVPVDDKRYRYAFHSSSWLVAGKA DPATPGRVHYHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHILSPAPPLELIPHLVFQ IILNSMHRYQPRFHVVYVDPRKDSEKYAEENFKTFVFEETRFTAVTAYQNHRITQLKIAS NPFAKGFRDCDPEDWPRNHRPGALPLMSAFARSRNPVASPTQPSGTEKDAAEARREFQRD AGGPAVLGDPAHPPQLLARVLSPSLPGAGGAGGLVPLPGAPGGRPSPPNPELRLEAPGAS EPLHHHPYKYPAAAYDHYLGAKSRPAPYPLPGLRGHGYHPHAHPHHHHHPVSPAAAAAAA AAAAAAAANMYSSAGAAPPGSYDYCPR >22|GENSCAN_predicted_peptide_2|140_aa MRPHWGVWAHLGRGRSAERALGHRFMPDLPLGPEQQLLSPELGPCRLSQEVAEQAGFQIV RGEAASALGNCGYVEFSAQGATKQTRLVPQPSPRVLLPLHLQCFPGANMYTKNRSARVDI WKLLAVPSERWHSGCRYHVG >22|GENSCAN_predicted_peptide_3|63_aa MERPAAAAGPGLCSGALPGIISRHRLTPRCLRAKPNLPAGTPTARGGGRFSWSQTQGRLL ACX Explanation Gn.Ex : gene number, exon number (for reference) Type : Init = Initial exon (ATG to 5' splice site) Intr = Internal exon (3' splice site to 5' splice site) Term = Terminal exon (3' splice site to stop codon) Sngl = Single-exon gene (ATG to stop) Prom = Promoter (TATA box / initation site) PlyA = poly-A signal (consensus: AATAAA) S : DNA strand (+ = input strand; - = opposite strand) Begin : beginning of exon or signal (numbered on input strand) End : end point of exon or signal (numbered on input strand) Len : length of exon or signal (bp) Fr : reading frame (a forward strand codon ending at x has frame x mod 3) Ph : net phase of exon (exon length modulo 3) I/Ac : initiation signal or 3' splice site score (tenth bit units) Do/T : 5' splice site or termination signal score (tenth bit units) CodRg : coding region score (tenth bit units) P : probability of exon (sum over all parses containing exon) Tscr : exon score (depends on length, I/Ac, Do/T and CodRg scores) Comments The SCORE of a predicted feature (e.g., exon or splice site) is a log-odds measure of the quality of the feature based on local sequence properties. For example, a predicted 5' splice site with score > 100 is strong; 50-100 is moderate; 0-50 is weak; and below 0 is poor (more than likely not a real donor site). The PROBABILITY of a predicted exon is the estimated probability under GENSCAN's model of genomic sequence structure that the exon is correct. This probability depends in general on global as well as local sequence properties, e.g., it depends on how well the exon fits with neighboring exons. It has been shown that predicted exons with higher probabilities are more likely to be correct than those with lower probabilities.