Thermotoga maritima MSB8 pseudogenes

26 ORFs from the Thermotoga maritima genome are considered "pseudogenes" in the TIGR database. 23 genes are annotated as frameshifts and 3 as point mutations that introduce a premature stop codon. It is assumend that none of those genes would yield any protein product. Consequently, none of those entries have a corresponding translation in the TIGR database. The Thermotoga genome deposited in NCBI also lacks translations for any of the possible truncated products of those genes.

The fact that some of the protein products of those predicted "pseudogenes" have been purified as proteins or even crystallized at JCSG suggests that at least some of the TIGR predictions of putative pseudogenes are erroneous.

Target selection at the Bioinformatics Core (mainly focused on the C. elegans genome) is protein-driven, whereas the first round of target selection at GNF (focused on Thermotoga) was DNA-driven. The view of the JCSG tracking database served on the JCSG web site shows proteins or protein fragments. Since the "pseudogenes" in the TIGR database do not include a translation, they have not been included in the JCSG target database. This explains the fact that some Thermotoga proteins absent in the display of the tracking database may be even at the crystallization stage at GNF.

Detailed analysis of one of those pseudogenes (TM1781), whose product has been crystallized by JCSG, indicated that TIGR assigned the wrong start to the ORF and the wrong reading frame. The combination of wrong start point with wrong reading frame required the introduction of a frameshift at some point to align the ORF with other orthologous bacterial genes. In this case, we predicted a possible gene product in a different reading frame and with a different start, and looked for similar proteins in NR using BLAST. We obtained a E-value=0 hit, indicating the existance of a protein in Genbank that corresponded exactly to our prediction. It appears that the same correction that we were suggesting was performed also at NCBI, but the correction was not back-propagated to the genomic entry for Thermotoga or the Thermotoga database at TIGR.

It is our goal at JCSG to improve the current annotation of the genome of Thermotoga maritima and to discern between real pseudogenes and annotation errors. Empirical information about the size of the amplified PCR products, and that of the expressed proteins (obtained from SDS-PAGE and Mass Spectrometry) will help to clarify which of those genes are authentic frameshifts and point mutations, and which are the result of wrong predictions. Subsequently, those gene products will be included in the Bioinformatics Core target tracking database.

TABLE 1

. Gene
. JCSG Status
. Last update
. TM0131
. Amplified
. June 2001
. TM0227
. Amplified
. June 2001
. TM0254
. Cloned
. June 2001
. TM0257
. Confirmed
. June 2001
. TM0277
. Cloned
. June 2001
. TM0323
. Confirmed
. June 2001
. TM0378
. Crystallized
. June 2001
. TM0380
. Purified
. June 2001
. TM0621
. Amplified
. June 2001
. TM0674
. Amplified
. June 2001
. TM0680
. Crystallized
. June 2001
. TM0873
. Amplified
. June 2001
. TM1007
. Cloned
. June 2001
. TM1168
. Amplified
. June 2001
. TM1282
. Cloned
. June 2001
. TM1318
. Cloned
. June 2001
. TM1320
. Cloned
. June 2001
. TM1343
. Cloned
. June 2001
. TM1418
. Cloned
. June 2001
. TM1438
. Cloned
. June 2001
. TM1445
. Amplified
. June 2001
. TM1477
. Confirmed
. June 2001
. TM1710
. Cloned
. June 2001
. TM1725
. PCR Failed
. June 2001
. TM1781
. Crystallized
. January 2002
. TM1837
. Cloned.
. June 2001

TM0131 - Transposase, authentic frameshift
[Identification alignment]

>TM0131
TTGCTGAATGTACCGCATCAAATTTACATCGTTCACCCTCATCCTAAATCCCCCCTTTCC
ATAATGAATTTTATCAGAACGTGCAAAGAAAACTACGTGATAAGAATAGTAAAGAACGAC
TGGCTTTACAAGATCATCCGCGATCTTGTCAGCGAAAGCCAAACTGGAAAGATCGTGGTG
AAAGATCTCAACATTTAAGGCAAAGCGCTGTGGGGTCAAAGTGATAAAAGCAAGCAGATA
CTATCCCTCAAGTCAGCTATGCAGTGAATGTGGATACATAAATAAAGAAGCCAAAGAGTG
GAGCGCATCATGATAGGGACGCTGCAAAGAATCTAGCCAGGTATGGCTTGATGCTATCAG
TAGGGCGGGAGCCGTCCGAGTTCATGCCCGTTGACCGTGCTCTGGCGGCGGAACCTAAAA
AGGGTCTACGAGCCATCACGGGT

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:1149666 percent identity: 69.88; identified by sequence similarity; putative;transposase, authentic frameshift".


TM0227 - NADP-reducing hydrogenase, subunit A, authentic frameshift
[Identification alignment]

>TM0227
GTGAAGATGAGGGAGGTAATAGCAGAGATCGTCCAGAAAGCTAAAGAGACAGCAGAAGAG
AGAGATGTTTTGATAAACACTCTGCACGAGATACAGAAGCGCTTTGACAACTTCATACCA
CCGGAGGCTGCTGAGATCGTGGCTGAAGAGCTCGATGTTCCGCTCTCCAGAGTGTACGAG
GTGTTGACGTTCTACACCATGTTTTCGACAAAACCGAAGGAAAATACGTGATAAGGGTTT
GTGAGAGTCTGCCGTGTCATGTTGAAAACGGAAGAGAAGTGGTCAAAGCCCTCAAAGAAA
TTCTGAAAATCGACTTCGGACAGACCACTTCTGATGGTCTTTTCACACTCGAAATGACGA
GTTGTCTGGGTCTTTGTGGTGTTGCACCGGTGATCATGGTGAACGACGAGTATTACGGTA
ACATGACGCCCGATCGTGTGAAGGATCTCATAGATAGACTGAGAGGTGAGTCGCTA

NOTES: Annotated in NCBI (AE000512) as "="This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:U07229 PID:466363 percent identity: 63.64; identified by sequence similarity; putative;NADP-reducing hydrogenase, subunit A, authentic frameshift".


TM0254 - Small protein B, authentic frameshift
[Identification alignment]

>TM0254
TTGAAGGTATCTGGCTGGGGTGAGATGGTGAAGGTTGTTGCCACGAACAAAAAAGCCTAC
ACGGACTACGAGATCCTGGAAACTTACGAAGCGGGAATCGTTCTCACAGGAACGGAAGTG
AAATCCCTGAGGAACGGCTCAGTCAATTTCAAGGATTCCTTCTGTAGATTCAAGAACGGG
GAGCTTTACCTTTTGAATTTACACATTCCACCCTACAGTCACGGTGGAGTTTACAACCAC
GATCCTGAAAGGCCGAGGAAACTGCTTCTTCACAAAAGAGAACTGAAGAGACTCATGGGT
AAGGTACAGGAAGAGGGAGTAACGATAGTTCCACTGAAGATATACTTCAACGATCGCGGG
ATTGCAAAAGTGGAGATAGCCGTTGCTCGAGGGAAAAAAGAAGTATGACAAGAGAGAAGC
GATAAAAAAGAGGGAAATGGAAAGAAAGATCAGGGAGTACATGAAGTATTCGAGA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:M90060 SP:P43659 PID:153566 percent identity: 75.33; identified by sequence similarity; putative;small protein B, authentic frameshift".


TM0257 - DNA replication enhancer, putative, authentic frameshift
[Identification alignment]

>TM0257
TTGACGAGAAGGGAAGTGGTTGGTTTGAAACTCGGTGATTTGATGCCAAGAGATCTGCTA
AAACAGGAACTGTTTGTTCAGGATTTGAAAAGTCATATCAACGACAACGTAGAAATTATC
TTGAAGATTAGGAGTAAAAAACTCCAGGAAACAAAAGACAGTAAGAAATTCCTGATCATG
ACACTGGAGGACAGGACGGGAACCGTCAGGGCTGTGGACTGGTACAACGCTGAACTCAAC
GATCAGCGTCTGAAGGAAGGAAACGTCGTCAGAGTGAAGGGCAGAGTGGTATTTTTTGAG
AACAGGATACAGATAAACGTGGATAACGATTACGGTGCCATAAAGATTCTGAAGAGCGAC
GAGTACGATTACACGAAATTCGTCGCACAGTCGAAAAAAGACCTGGAGATTTTGAAGAAA
AAGCTCTTTGTCCTTCTGGATCAAATAAAGGATCAACACTACAAAAAACTTCTGAAGGCA
TTCTTTGAAAATAAAGAGTTCTCCGAAAAATTCTTCAGATCACCGGCGGGTATGAGAGTA
CATCATGCCTACATAGGAGGACTTCTAGAGCACAGCGTCACGGTCGCTGAGATTTGCAAA
GAGATCAGTAAATATTACTCCCTCGATAGAGATCTTCTCATAACTGGTGCCCTCCTCCAT
GATGTTGGAAAGGTGGAAGAATACAGAATCACAGAATCCGGGATCGAAGTGACTACCGAA
GGGAGTTGAAAGGGCATATAGCAATAGGGGCGGCTATGGTTAGAGAAATGGCGAAGAAAT
TGTCTATTCCAGAGCACAAGATCCTCGAGTTAGAGCACATAATTCTTTCGCACCACGGAG
AGCTCGAATGGGGTTCACCCGTGGTTCCAAAGACGATAGAGGCTTTAATAGTTCATCACG
TTGAGAACCTGGATTCCAAGATAGCTCGATTCATCGAGGTTATGGAAAGTTCCGAATCGA
ATCAGGGTTGGACGGAGTACGACAGGAATTTGGGAAGAAGAATCTTTTTAAGAGGTGGAG
GAACTGATGAG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:2352094 percent identity: 76.90; identified by sequence similarity; putative;DNA replication enhancer, putative, authentic frameshift".



TM0277 - Sugar ABC transporter, periplasmic sugar-binding protein, authentic frameshift
[Identification alignment]

>TM0277
ATGAAGAGATTTTTACTGTTGATCTTTCTCATCATCACCTCGTTGATCTTCTCGGTTAAA
ATCTCCGTTCTCTGTTCTCCAGACAACGCGGACGCCCTGAAGTGGCTTGCCCAGGAGTTC
ATGAAACAGAATCCCGACATTCAGGTTGAAATCGTACCTCTTTCGTGGGAAGTACTTTAT
CCGAAGCTGCTGCAGGATCTCAGATCTCAGGCTGGATCGTTCGACGCTTTCACTTACGAT
GTGATGACCACTGGAGCCGTCTCTTTCCGGACTGGTTGACCTTGGAGAGTTCATGAAACA
ACATCCAGAACTTGTTCCAGAAGATTACGATTTGAACGATTTTATCCCACAGGTTCTGGA
AGAATCTGGAAAGTGGAAGGGAAAACTCATCGGGCTTCCGTTCTACAACAACACAATGCT
CTTCTATTACAGAAAAGATCTCTTTGAAGATCCAAAGATAAAACAAGCGTTCAAAGAAAA
ATACGGTAGAGAACTCACCCTCCCGACCACCTGGGAAGAAGTTGTAGAAATAGCGGAATT
CTTCACCAAAAAATACAACAAGAGCTCTCCAACAGACTACGGAATCGCCCTCATGTTCCC
GAGAACCCACACACTCTTCTACATGTATCTGCTGTTTTTCGGTGAGTACAGGAACGCACC
ACTCGGTATCATGAGGCACGGAACCGCGGATCTTGAATTCGGTGAATACTTCACAGCGGA
TCACAAACCTGCCTTCAACAGTGAAGAGGGATTGAAAGCGCTCGAAATGATGAAAAAACT
CATGCCTTACAGTCCAGATCCGCTCGGCTCTGATTACGGTGAAACGATCGAGTACTTCAA
CCAGGGACTCGTTGCTATGGTACCTCAATGGACGGGACCGTATCTGATCTTCAAGAGCAC
CCTCGGTGAAGATAAAGTCGGGATCATTCCCATGCCGGGTCGATCCGTGAGTGGTCAATG
GGCACTCGGCATCAACAAATTTATACCTGAGGAAAAGAAACTCGCTGCGTTCAAATTCAT
CATTTTCGCCACCAGCAAATGGGCTGACAAGAACAAGTTCCTGAGATTCGCCGTCGCTCC
TGCCAGAATCTCAACACTCCAGGATCCCGAGGTGAGGGCCGCTGATCCGAGAGTTCCCGC
CCTCGAGGTAACGTACGTTTCTCAGACCCACAGGCCAAGGATTCCAGAGGAACCAAGACT
CGAAGACATCACCGTTGAGACCTTCTCCAAGATCCTCTCTGGAGAACTCCCGCTCTCCAT
GGAAACGCTGAACGATCTCGCAAAGAAATGGGAAGAGATTCTTGGAAAA


NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:2293414 percent identity: 44.29; identified by sequence similarity; putative".


TM0323 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM0323
GTGAAAAAAGTGGACCAAATTCTTTTGAAGTTTGAGAAGCACGCCGCCAAGATTCTCCTT
ATGGCTATGATTGTTCTTGTCTTTGCTTCAGGGGTTGCCAGATTTCTAAAGCACCCAATA
AACTGGGCTGTTGATATGAGTAGTTTTCTCTTTGCCTGGGCTTGTTTCTTTGCAGTCGAT
GTAGCCTGGCGTGAAAACAAAATGATGTCGGTGGATATACTTGTGAAGAAATTCTCCGAA
AGAACTCAGAATAGTGAATTACCTTATCATTCTTGCCTTCATTGTTTATCTCATTGTGTG
GGGCTTTTATCTTTCCTATAAAACAAGATACAGAACCTTCGTAGGGATACCGAACTTTAG
CTACACGTGGGTTACACTTAGTGTTCCTGTTGGTGCGATTCTGCTCTTCAGAACAACTGT
ACTGAAGCTGATAGGAGAATTCAGAGGCAACAAGAAGGAGGAAAGA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:L42023 SP:P44994 PID:1006260 PID:1221145 PID:1205279 percent identity: 50.67; identified by sequence similarity; putative".


TM0378 - Glycerol-3-phosphate dehydrogenase, authentic frameshift
[Identification alignment]

>TM0378
ATGGAGATGAGATTTTTTGTTCTCGGTGCAGGGAGCTGGGGGACAGTTTTTGCACAGATG
CTGCATGAAAACGGAGAAGAAGTGATTCTCTGGGCAAGAAGGAAAGAGATCGTCGATCTC
ATAAATGTTTCACACACGAGCCCTTATGTGGAGGAATCGAAGATCACCGTAAGAGCCACA
AACGATCTTGAAGAAATCAAAAAAGAAGACATTCTCGTTATAGCGATTCCCGTTCAATAC
ATAAGAGAACATCTTCTGAGACTACCTGTGAAGCCTTCCATGGTGCTGAATCTTTCAAAG
GGAATAGAGATCAAAACAGGTAAAAGAGTGTCTGAGATCGTTGAAGAGATACTGGGTTGT
CCTTACGCTGTCCTCTCCGGTCCTTCACATGCCGAAGAGGTCGCAAAAAAACTCCCAACT
GCCGTCACACTCGCTGGAGAAAATTCGAAAGAACTTCAAAAGAGGATCTCCACCGAGTAC
TTCAGGGTGTACACCTGCGAAGATGTGGTGGGTGTGGAAATCGCTGGAGCATTGAAAAAT
GTCATCGTATCGCCGCGGGGATCCTGGATGGATTCGGTGGTTGGGACAACGCAAAAGCAG
CACTCGAAACTCGCGGTATATACGAAATAGCAAGATTTGGAATGTTTTTTGGAGCGGATC
AGAAGACCTTCATGGGTCTTGCAGGAATAGGCGATCTCATGGTCACTTGCAACAGTCGTT
ACAGCAGAAATAGACGCTTCGGAGAATTGATAGCGCGGGGATTCAACCCGCTGAAACTCC
TTGAAAGCTCAAACCAGGTTGTAGAAGGTGCCTTCACTGTGAAAGCTGTGATGAAGATAG
CCAAAGAAAACAAAATAGATATGCCCATCTCCGAAGAGGTTTACCGAGTCGTTTACGAAG
GGAAACCACCTCTTCAGTCGATGAGAGATCTCATGAGAAGAAGCCTGAAAGACGAATTCT
GGGCGAGC


NOTES:
Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to SP:P46919 PID:974332 PID:1146220 GB:AL009126 percent identity: 65.53; identified by sequence similarity; putative".


TM0380 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM0380
ATGGCAGCGAAAGAAAAACTGGAACAGATTCAGAAGCAGCTTCAGGAAATCATCGGAAAG
GCACCTGAGATCGGAAAGTTCGCAGAATACGTTCACGCGGCTGAAAGCCCGAAAGCACTC
GACACAAAAACGAAGGAACTCATATCTCTTGGAATAGCTGTGGCAGTGAGATGTGAACCG
TGTATCGTCTGGCATGCGGGACCGCTGTGAGAGCAGGAGCAACAGAAGAAGAAATCCTCG
ACACAATCAAAGTGGCAGTCTGTATGGGCGGTGGCCCAGCACTCATGTACGGTTTGAAGG
CATACGAAATCGCCCTCGAATTCCTCGGAAAA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:L42023 PID:1006306 PID:1221170 PID:1205303 SP:Q57498 percent identity: 59.41; identified by sequence similarity; putative".


TM0621 - Lipopolysaccharide biosynthesis protein, putative, authentic frameshift
[Identification alignment]

>TM0621
TTGCAGACAGATATGGATCTGGAAATAATAGTTGTTAACGATGGTTCCACAGATCGAACG
AAAGAAGTGCTTATGAAGTACTTTCCAACAGCGGTTTTAGCAACTACAAAGTCATCACAA
AAGAAAACGGTGGCCCAAGTTCAGCGAGAAACAGAGGATTGAAAGAAGCACAGGGACAGT
ACGTGATCTTCCTCGATGGGGACGACTATGTAGCCCCGATTTTGGTTGAAGAATTGAAAA
AAGCCTTGTCCATAGCCCAGGCAGATGTCTTCTGTTGGAATTTTTTAGTGGTTGACGAAT
CAGGTTCTGCACTGAGCTGGCAGTTCCCATGGAGGCTTACCGATTCGTATGATTTACTGG
ATGGGATTGCCATTCTTCGAAAAATCCTAATTGAAAAGCAGTTGTGGGTGTGGACTGGCA
GCGCTGCATATTCGCGACACTTTCTGAGTCAGAACGATTTCCTTTATGCTGAAAAATATT
ACACTGGTGAGGACCTTGAATTTGAATGGAGGGTTCTTTTGAAGAATCCCAAAGTGCTTG
CGATTAGTAAAACGTTGTCGTACTACGTTCAGCGTCCCGCTTCATTGACTAAGACCATCA
ATTTCAGGCAGTTCGATTTCTACCCTGCTCTCAAGCAACTTTATGAAAAACAGAAAGAAT
TGATCTCGGATGATGAACCCACGAGCGAGTTACTACAAGCTGTTCTTGAGTGGTCAATTT
TCAACTTTCTGGGCTTAGTCTTTTTCCAGCTGAGAAACTCAAAAAAATCTCCGATCAAGG
AATTTCTAAAAGGGCTAGAGCAACATCATCCCGGGTTGTTCCAGATGGTGGTAAGAGATG
CCAGATCTATAACGAGACTGGTTTCAAAGGTCACCAAAAAGGACAGAGTCTACTTTTCAT
TGTTTAGGTTTTCTCCTCGCGTTTTCCTTACCATCTGGCTTCATCTTTCAAGACTCAAAG
CTGCGTGCAGGAGGCTTATAAAAAAACTCATGAGAGATAAAACTCCAGAATAT

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GP:1276882 percent identity: 53.77; identified by sequence similarity; putative".


TM0674 - Flagellar hook protein FlgE, authentic frameshift
[Identification alignment]

>TM0674
GTGATGATGAGGTCACTCTACAGTGGCATAACGGGTCTCAAAAACTTCCAGGTAGCGATG
GATGTTGTAGGAAACAACATCTCCAACGTGAACACAGTTGGGTTCAAAGCATCGCGAGTG
AATTTCGAGACGATGCTCGCTCAGACGATCAAGGCGGGGAGATCTCCTCAGGAAAATGTG
GGCGGTACAAATCCCATGCAGATAGGTCTTGGTTCTCAGGTGTCCAGTATCGACAAAATC
ATGACGCAGGGCTCGTTCCAGAACACTGGTGTGAAAACTGACCTTGCCATTCAGGGAGAT
GGGTTCTTCATCGTCTCGGATGGAAGCTCCTATTATTACACAAGAGCCGGTGCATTCACT
CTGGACAGCAACGGAAACCTGATTCAGACCTCTACAGGTTACAAGGTTCAGGGATGGACA
GCCGTTCAGGATCCAGAGACTGGTGAAAGGTACATCGACACAAACAAACCCATAGGGGAT
CTGGTGATCAGTGCAGGTATGACGATGCCGGCAAAGAAAACTTCGAACGTTCGTTTTGAA
GGAAATTTGAACTCCAAATTGGGACCTGGTTCGTTCGTCATCACTTTAACTGATGAAAAT
GGAATTAATCACGATGTCAGGTTGTGGTTTGAAAAAACACAAAACGATCTGGGAACAGAT
CCGTTCAGTTCTTCTCAGAGGTACACGATGAAGATCGATATCGACAACGATGGAACAGCC
GATGTGGATGGTTACATGGTGTTCAACGAATTTGGAAGAGTGGAAGAGGCTGGAGTATAC
GCTGAATCTCAGGTCATAACGGCAACCACCGATGGAAGTATTTCTGGAAGCACAGCTCTA
CCAGATGGTACCTATCAGGCTATTGTCATGGATTCTTCTGGAAATGTGATATACAACGGC
ACCACAGACGTTTCAGGTGGTAATTTCACGATCAACGATTCTGATATCACCGCTGGAAAC
AATTACACCGTGATACTCTTAACTCCAAACAACACATCAACTTCTCTCACAATAGCACCC
GGTGCTGAACTTGTCATCCCCACTTCCGGTGAGCCCAGGTTCTATGAATCAGACAATCCA
ACGAATTTCGTCCAAACTGATTACACCTCCCCGAGGTACACCACCGCTGTTCAGGTTTAC
GATTCTCTTGGAAACGCGTACACCGTTTACTACGAATTCGTCAGACTTGGAAACACTACA
ATCGGTTCGGAAAGCTTCAAGAACGCCTGGATCTGGAGAGCGTACACGGATAGTGGAGAA
CCGGTTTCACTGATCGATGAAGACGGCAACGGTGGTTATGTGGCGGGGCTGATAGATTTC
GATGAATCAGGAAGGCCTGTTCAATACAGAGGTCTTGACTCTTCATACAATTTAAGCTCC
AAAGAGGTGAGAACGATCCAGTTTGACACGGGCCAGAAAGGAGACGGAGTGGTAACAATA
ACAGCTGATTTTTCCGGAGCAACTCAATTCTCCGGCGACAGCACTCTGAGCATCCCGTGG
CAAGATGGAAATCCGATGGGTGTTCTGGAATCCTTCGCCATAAACGAGCAGGGAGAGATC
ATAGGAACCTTCAGCAACGGACTCACCGACGTTCTTGGACAGATCGCCTCGCGGTGTTTA
ACAATCTTCGGGATTGATGGAAGCAGGTAATTCACTCTATACCATGTCTCCTAACAGCGG
CGTTCCAAAGATAGGAGCTCCTGGCAGCGGAGGAAGGGGTGTTTTGATCCCCGGTGCACT
TGAGATGTCCAACGTGGATCTTGCGGAGGAGTTCACCAAGATGATCGTTGCACAGAGAGG
TTTCCAGGCGAACGCGCGTGTCATCACAACTGCCGATCAGATTCTGAACGAACTTGTGAA
CATCAAGAGA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:1165267 PID:639926 SP:Q44767 GB:AE000783 percent identity: 61.09; identified by sequence similarity; putative".


TM0680 - Flagellar motor switch protein FliY, authentic frameshift
[Identification alignment]

>TM0680
ATGACGGAAAATGAATTCCTCTCTCAGGAAGAGATAGACAAGCTTCTTAGCGATTCTGAT
ACCGGTGGTGTCCTCACTCCCGAAGAAAAGGACATGATAGGCGAGATAGGAAACATCGCA
ATGGGGAGTGCTGCAACGACGCTCTCGATGATCCTCGGAAGGGATATTCACATCACCGTT
CCAACTGTTCGGGAAGAAAAGATGAAGAACGTCAAGAGTGACTTCAGTGGTGAGCAGGTA
GTGGTGAGTGTGGAATACACGGAGGGGCTCGAAGGTTTGAACGTTCTTGTCCTCGATAAA
AAACTCGTGGCGGTGATAGCTGACCTCATGATGGGGGGAAGTGGAGAGGTAGAAACCGAG
GAATTGGACGAAATCAAACTCAGCGCGGTTGGAGAAGCGATGAATCAGATGATGGGAAGC
GCTGCAACATCGCTTTCTGAACTTCTGGGAATAACCATCAATATATCTCCCCCGAAGTGG
AGATATTGAATTTCGATGATCCTAACACACAGTTTCCACCGGTAACCGACAATCCGGAGA
AAGATGTTGCCGTTGTCGAATTCGAAATGGAGATAGAGGGACTTCCGAAGTCGAAGTTCT
ACCAGGTGATAAGCGCCGATCTGGTGAAGAAGATGTACGAGTATTTCACGAAAAAACAAT
CCGAAGCAGCTGAGAAAAAAGAGAAGAAAGAGGAGAAAAAGGTGAAGGTCGAACCGGTAG
AATTCGCAGAGCTGAAACCTTCTGAAACCAGAAAAACCGAAGTGCCGAGTGACAAGCTGG
AACTGCTTCTCGATATTCCTCTCAAAGTCACAGTGGAACTTGGAAGAACGCGAATGACTC
TGAAACGGGTTCTGGAAATGATCCCTGGCTCCATAATAGAGCTGGACAAACTCACGGGGG
AACCTGTGGATATCCTCGTGAACGGAAAGCTCATCGCCCGCGGAGAAGTTGTCGTGATAG
ATGAAAACTTTGGGGTGAGGATAACAGAGATCGTGAGTCCAAAAGAGAGGCTCGAGCTTC
TCAACGAA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:M86738 SP:P24073 PID:142926 PID:551704 GB:AL009126 percent identity: 65.68; identified by sequence similarity; putative".


TM0873 - ATP-dependent Clp protease, ATPase subunit, authentic frameshift
[Identification alignment]

>TM0873
ATGAGAAACCTGAACGAAAAGATGAAGGATATTTTCGAGAGGAGCGTGGAGGATATCAAA
GAAAGAAATCAGAACCTGCTTAGACCAGAGCACATTCTCCTTCAGATCCTTTACGATGAA
AACGAGGCCACGAAGGTGCTGAAAGATCTCAATGTAAACGTGGAGGAAGTGATTACTGAG
CTGGAGGACTACATCGATTCCCAGTACGGGATATACTACGGATTCTCCGATCAGGTCTAT
GTATCGAAAGAACTTTCTTACATCCTGGAGCTAGCGAGAAAGGAAGCCAGGCTTTTCAAG
CAGAAGGATATAGGTCCTCTTCACTTCCTTCTGGGACTGCTGAGAGATGGTTCCACACAC
GCTGCACGTGTCCTGAAGAAGTACGGTGTTGATTACGAAAAAGTGCTTCAGACGGTTAAG
GAGCACGAAGAAGAATACGCAGCTGAGCAGTCACCACTCACGGCTTTTGCAACCGATCTC
ACAAAATTGGCAAAAGAAGGAAAAGTGGGTCCCATCATAGGAAGAGACAGGGAAATCGAG
CGCGTGATAGAGATTCTCATGAGAAAAACCAAGAACAACCCAATTCTCATCGGTGATCCG
GGTGTTGGAAAAACCGCCATTGTGGAAGGTCTCGCACAGAGAATCGTTGAAGGAAAAGTT
CCTGATCCTCTAAAGGATGTGCGCATACTGATGGTAGATCTCGGAAGGATGATAGCGGGG
ACCAAATACAGGGGAGAATTCGAAGAAAGGTTGAAATCATTCCTCGATGAAGTGATGAAA
CAGAAAGAAAAAACGATTCTCTTCATCGACGAGATACACACTCTGGTTGGTGCTGGCGCA
GCCGAAGGAGCGATGGACGCTGCCAACATGTTGAAACCGGCCCTTGCTCGCGGTGAAATA
CGCGTCATAGGTGCAACACACTGGATGAGTACAGAAAACACATAGAAAAAGACAAAGCGC
TCGCGAGAAGATTCCAGCCGGTGATGGTGAGAGAACCGAGTGTGGAAGAGACCATAGAGA
TTTTGAAGGGGTTGAAGAAAGTCTACGAGGAGCACCACAAAGTGAAGATAGAGGACGAAG
CGATAGAGGCAGCTGCCAAGCTCTCAGCGAGATACATAACGGACAGATTCTTGCCAGACA
AGGCGATTGACCTGCTGGATGAGGCGGCAGCGAGAGTGAGACTCAGCGCTACAAAACAGG
AAAAGGATGAAACGAAGCTTCGCGAGCTCGAGGAAAAAATAAAAGAGCTTGAAACGAAAA
TCGATGAGCTCACTATAAGATCTCAGTACAAAGAAGCGGCTGATCTCAAGAAAGAACTCT
TCAAGCTCAAAAATGAGTATGAAGCTTTGAAGAGTGGGAAACCGGTTGTCACAGCGGAAA
AAATAGCAGAAGTGGTTGAATCCTGGTCTGGTGTTCCAGTTTCCAGAATCGTTGAATCTG
AAAAGGAAAAGCTTCTGAAGCTCGAAGAGATAATCCACCAGAGGCTCGTGGACCAGGAGG
AAGCGGTCAAGGTAGTGGCCGATGCCATCAGGAAGGCAAGGGCTGGAATAAAAGATCCTA
ACAGGCCAGTTGGTACCTTCCTCTTTCTTGGACCAACGGGTGTTGGTAAGACTGAGCTCG
CAAAGACACTCGCTGAAGTGCTCTTTGGAAGCGAAAACGCACTGATACGAATCGACATGA
CCGAATACATGGAAAAGCACGCTGTCTCCAAGCTGATAGGAGCACCTCCGGGATACGTTG
GTTACGAGGAAGGTGGTCAACTGACCGAAGCCGTAAGAAGAAGACCGTACAGTGTTATAT
TGCTCGATGAGATTGAAAAAGCGCATCCGGATGTGTTCAACATACTGCTTCAGATAATGG
ACGATGGAAGACTCACAGACAGCAAAGGAAACGTTGTTGATTTCAAGAACACGATCATCA
TCATGACGAGTAACATAGCAAGCGATCTGATACTGAACTACGTTAAAGAAGGAAAGAGCT
TTGATGAGATTGAAGAAAGAGTCAGAGAAGAGCTGAAACACTACTTCAGACCCGAGTTCA
TAAACAGGATAGACCATGTCGTTGTCTTCAAACCGCTCACAAAAGAACACATGAAACAGA
TCGTGGAAATAATGATCAGAAGACTTGAAGCGCGGTTGAAGGACAAGAATATAAAACTCA
CGATCACTGAAGCGGCGAAAGAATACCTCGCAGAAAAAGGATACGATCCAACGTTCGGTG
CAAGACCGCTGAGGCGGCTGATAGAAAGGGAAATCGAAACACCACTCGCAAGAATGATCA
TAGCAGGTGAGGTGCAGGAAGGTCAGACGGTGAGAGTGGATTACAACGGTGAAAAACTGA
TCCTCGAAGTTGCCAGAGAACTCGAGAAGGTTCAG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:1001492 percent identity: 73.99; identified by sequence similarity; putative;ATP-dependent Clp protease, ATPase subunit, authentic frameshift".


TM1007 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM1007
TTGATGGAAAGAACGGGAATTCTTCTTGGAATATGTCTTGGCCTCACCAGTTTTTCAATA
CTTCAGGGATCGGTGTTCGGAGCGGTTCTTCCCTCCATTGTGGAAGAATTCGGCGTGGAT
TGGAGTATCATAGGAGTTGCTATGAGTGTCTGGACGGTCATTTCTGCTCTCTCACCCATG
TTATTTGGAAGATTTGTTCATAGATTATATCCAATGAACTCCATGGCTCTGGTCATGATG
ATGCTCTCTATTCCAACAATTCTTGTTGCTTTCGTGAAAGACTTTTTCTCTTTAAACGTT
GTGAAGATAGTGGGGAGCCTGGCTGTTCCCTTCTCTTATCCTCTTGCTGCAAAAGTGGTG
GAGATGTATGTGGACTCCAGAAAAAGGGGAATCGCAACTGCCATATACAACACTGGTTCT
ATGATCGGACTTGCACTCGGATACGCTGTTGTTGCGTTAGCAGGTGGTTATTGGAAAAGA
TCCATGATCACTGGAGGATTTCTCGGTGTTATTTATGTTCCTGTTGCATACATTCTGTGG
AAAAGCTTGCTGGAGTCAAAGGTACAGAGAAAGCCGGAGTGGAACGATTCTCAAAAGAGA
TCACATGTTTCTTTCAAACGAGTGTTCTCCATCATACTGTGGCTTTCCTTCGGTCATTTT
TCTGCTGTTTACACCTGGAATCTCATGTTCAATTGGCTTTCTACTTTCCTTGTTCGTGAG
ATCCAGCTGGGTTATAGTTTCATAGCCCTTGTGCTTGGAATCATGGCTGTTGTATCGAGC
GTAATGGAGGTTTTCGTTGGATTGTGGTCTGACCGGGTGAGAGGAATGCGTGGAAGGTTA
ATTCCCCTGTATACCGGTTTATTTCCGTCGGCTTTTCTTTTAATACTTTCCACTCTTTCA
ACCAATCCTCTTCTGACATCCATTCTGGTGGGGTTCTCCATCCTCTTCTGAGACTTTCAA
CCCCTTCTTTCTGGGCAATATTTGGAGATCTCATTCCGCAGGAACACTTCGAAAAAGCGA
GTAGTATCTACGTGGGAGCTGTCCTTCTTTCTGGTATTGCTTCTTCTATTATGAACGGTT
ACATAGTCTCGTTGACAGGTTCGATGAAGTACGCCATACTCCTTTCGGCTTTTATACTGA
TTCTTTCTCCGATTTTCTTCACGGTAGCGGGAAAAGTTGGTACGAGAATTTCAGGAGCAT
GGATCCATCTA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:AE000782 percent identity: 49.58; identified by sequence similarity; putative;conserved hypothetical protein, authentic frameshift".


TM1168 - Alpha-glucan phosphorylase, authentic frameshift
[Identification alignment]

>TM1168
GTGCTGGAGAAACTTCCCGAGAACCTGAAAGAGCTCGAGAGCCTTGCCTACAACCTCTGG
TGGAGCTGGTCCAGACCTGCTCAGAGACTCTGGAGAATGATCGATTCAGAAAAGTGGGAG
GAACACAGAAATCCCGTCAAAATACTGAGAGAAGTCTCAAAGGAAAGACTGGAAGAACTA
TCGAAAGACGAGGACTTCATCGCTCTCTACGAACTGACGCTCGAGAGATTCACAGACTAC
ATGGAAAGGGAAGACACCTGGTTCAACGTGAACTATCCCGAATGGGACGAAAAGATAGTT
TACATGTGTATGGAATACGGACTGACGAAAGCACTTCCGATCTACTCTGGAGGACTCGGT
ATCCTTGCCGGAGACCACCTCAAATCAGCCAGTGATCTTGGCCTTCCTCTCATAGCCGTA
GGTCTTCTTTACAAACACGGGTATTTCACTCAACAGATAGACAGTGACGGAAGACAGATC
GAGATCTTTCCAGAATACGACATCGAAGAACTCCCGATGAAACCTCTCAGGGATGAAGAC
GGAAACCAGGTGATCGTAGAAGTACCCATAGACAACGATACTGTAAAAGCGCGTGTGTTC
GAGGTACAGGTCGGAAGGGTGAAACTGTATCTTCTCGACACTGACTTCGAGGAAAACGAG
GATAGATTCAGAAAGATCTGCGACTATCTCTACAATCCCGAGCCTGATGTGAGAGTTTCC
CAGGAAATTCTGCTCGGCATTGGTGGAATGAAACTCCTGAAGACTCTCAAGATAAAACCT
GGAGTCATCCACCTGAACGAAGTCATCCCGCTTTTTCATCCCTCGAAAGGATAAAGAGCT
ACATGGAAGAAGGATATTCCTTCACCGAGGCCCTTGAGATCGTCAGACAGACCACAGTTT
TCACGACACACACCCCCGTCCCCGCAGGTCACGACAGGTTCCCGTTCGATTTCGTGGAAA
AGAAGCTGACAAAGTTCTTCGAAGGATTCGAATCCAAAGAACTGCTTATGAACCTTGGAA
AAGACGAAGACGGAAATTTCAACATGACGTATCTTGCTTTGAGAACCTCCTCCTTTATAA
ACGGAGTGAGCAAACTTCACGCTGACGTATCGAGAAGGATGTTCAAAAATGTCTGGAAGG
GAGTTCCGGTGGAGGAGATCCCCATTGAAGGCATCACGAATGGTGTCCACATGGGAACCT
GGATCAACCGCGAGATGAGAAAACTGTTCGACAGGTACCTCGGTAGAGTCTGGAGGGAAC
ACACTGACCTCGAAGGAATATGGTACGGAGTTGACAGAATACCCGATGAAGAACTCTGGG
AAGCGCATCTGAACGCAAAGAAACGATTCATAGATTACATAAGAGAATCCATCAAAAGGA
GAAACGAAAGGCTTGGAATCAACGAACCACTGCCGGAGATCAGTGAAAACGTGCTCATCA
TAGGTTTTGCCAGAAGGTTCGCAACTTACAAGAGAGCCGTCCTGCTCTTCAGCGATCTGG
AAAGACTCAAGAGAATTGTCAATAATTCCGAGAGGCCGGTTTACATTGTGTACGCTGGAA
AGGCCCACCCGAGAGACGAAGGTGGAAAGGAGTTTCTCAGAAGGATCTACGAAGTTTCAC
AGATGCCCGATTTCAAGAACAAAATCATCGTACTCGAAAACTATGACATCGGAATGGCTC
GACTCATGGTGTCGGGTGTTGACGTGTGGTTGAACAATCCAAGGAGGCCCATGGAGGCAA
GCGGGACAAGTGGTATGAAAGCTGCAGCGAACGGTGTTCTGAACGCGAGTGTATACGATG
GCTGGTGGGTTGAGGGATACAACGGCAGAAACGGATGGGTGATAGGTGATGAAAGCGTGC
TTCCAGAAACAGAAGCGGATGATCCAAAGGATGCCGAGGCTCTGTATGAGCTTCTCGAAA
ACGAAATAATCCCCACCTACTACGAAAACAGAGAAAAGTGGATCTTCATGATGAAAGAAA
GCATAAAGAGCGTGGCTCCAAAATTCAGCACCACCCGCATGCTCAAAGAGTACACGGAGA
AATTCTACATAAAGGGACTTGTGAACAGGGAATGGCTGGAGAGAAGAGAAAACGTCGAAA
AAATCGGAGCCTGGAAAGAAAGAATCCTCAAGAACTGGGAGAATGTTTCCATAGAGCGCA
TTGTTCTTGAAGATTCGAAGAGCGTAGAAGTAACTGTAAAACTGGGCGATCTCACACCGA
ACGACGTGATAGTCGAACTTGTGGCTGGAAGAGGAGAGGGAATGGAAGATCTCGAAGTGT
GGAAAGTGATACACATCAGAAGGTACAGGAAAGAGAACGATCTATTCGTTTACACTTACA
CCAATGGTGTTCTTGGTCATCTTGGATCTCCCGGATGGTTCTACGCGGTGAGAGTCATAC
CGTACCATCCCAGGCTTCCCATCAAGTTCCTGCCCGAAGTACCGGTTGTCTGGAAGAAGG
TTCTC

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GP:2660639 percent identity: 99.27; identified by sequence similarity; putative;alpha-glucan phosphorylase, authentic frameshift".


TM1282 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM1282
ATGGAACCAAATCTACTACACATTGTGATCGTATTTTCATCTCTCCTGATATCCTTACTT
TACTTTTTTTTCTCCAGAAAGCGGTTCCTTCACACCAACGGTTTTTCCAAGGAACCGGTA
GGAAAAATCTCTGTTATAATACCTGCAAGAAATGAAGAGAAAAACATAGGAAAAATTCTG
AAACTTTTAAGCATACAAAGGGTTAAACCGCACGAAGTTATCGTTGTTGATGACAACTCT
ACAGATAGAACAAGCGCTGTAGCAGAAAATTTCAAAGACGCTTTTGAAAGATTCATATTG
ATCAGACTAACCAAAGATCCGCCAAAAAATTGGGTAGGAAAGACCTGGGCAATCTGGAAC
GGATATCAAAACTCAAGTGGAGAAATTCTGATATTCATGGATGCAGATGTAGAGCCTGAA
GAAGGAGCGATTGAGGTTCTTGTTGAGATCCATAAGAAACATCCTGGATTGATTTCCGTC
TGGCCTTATCAGAGATTCGAGAGGTTCTATGAGCACTTGAATTTGGTGTTCAACTTAATG
ATCGTCTACGCGAGTAACATGCTCGGCTTTCCATCAAAAAGGCCAAAAGGAGCTTTTGGC
CCTGTGATACTGACTTCGCGAAGGGATTATATGAAAACAGGCGGCCATGCAGCTATCAAA
GATTCTGTTCTTGAAGATCATAAAAAACGGGATCAAGGTGATGAATTTCTTAGGAAATGG
GATTATAAAGTTCAGAATGTATCCAGAAGGGTCCAGACAACTGTCTGAGGGATTTTCGAA
AAACATTTCTTCAGGTGCCTTGACCGGCGGGATTTTGAGCTTCCTACTCGCTTTGATATG
GATTTCTGGTTTTTATTACTCCTTCACGTCCTTTAGAACACCGCTTTGGTGGTCTATGAT
ATATTTCATATTTTCCCTGATTGTCTATTTGCTTTCTAAGCCTCTGGGGGACTACAGGTG
GTACGATGCCTTTCTATATCCGTTGCATTTCACCTTCTTCGCTGCCGTCTTTTTCCATTC
ACTTTACAAAACACTGGTGCTAAAAAAGGTCACGTGGCGCGGAAGAGAAATAAAAATAAG
A

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GP:1340131 percent identity: 56.11; identified by sequence similarity; putative;conserved hypothetical protein, authentic frameshift".


TM1318 - ABC transporter, ATP-binding protein, authentic frameshift
[Identification alignment]

>TM1318
ATGAAAACTATGAACCTCTTCTTATTTACAGGTCATCTCAGAGACGATATCAGAAAATTC
GAGAGGAAGTACAAAAAACACTATCCGATTTTCTTCTTACTGGAAGGCCTGTTTTTGACC
ACCAACGTTCTTACTCCACTTCTCATCAAAAAGACCATCGACAGTGCGTTTTACAGAGGT
GATGTGGGAGAAATCGCTCTGTTCTCAGTACTATACTTCATCGTTCTTGTGGCCCAGTCT
TTCATCATGTACAAACTCAACTACTCAGCTGCGAAGTACTTGCTGAACGCATCCCGGGCT
AGGGAATCGAAGAGCGCATACGCAAAAATCCTGGCGCTTCCACTGACTTACGCGAACTCA
CAGAACACCGGTGATTATCTTTCGGTTTTCATGAGGGATGTTCCGAAGTAGCTTCGGGAG
TTTACCTTGGAAGGTTGCAGTTTTTCTTCAACCTGGGATTTTTCCTTGCTGTTCTTTTCC
TTTTGTTCATTCTGAGTGTAAAACTCACTCTTGTGGTCCTGGTGAGCATCGTTCTGTTTT
TTGTGTCAACCTCGATTCTGAGGAAGATGGTCATCAGGGCTTCTCAGAGAGATCAGGAAT
CCTACCAGAGGTTTTTGAAAAGATCCAGAGAAGTGGTGGAAGGAACACCGGTTCTCAAAC
AGTTTTCTGGCCTTTTGTTTCTCAGAGATTTCCTCGATTCAAGTGCCAGAGAGTGGAGCA
GGGCAAGCATCATCCACAGCACGGTAAACGAACTTTCGAACAGGAACATCGAGATGAACA
GATGGGTCGGGAGTACCATTGTCCTTGCCTTCGGTGTTTACCTTCTCTGGAAGGGTGAGA
TAAGTGTTGGAACGCTTCTGGCTTTTGAATCCTATATGAACTGGATATACGACATCGTCA
GAATGGCACTCACAGGTCTGACGACGTTTTTCTCTACAGTTCCAAACTGGGAAAATTTTA
CAAGAGTGTTTTCCCTGCCCTTTGAAAGAGCAAGCGGAATAGATCTTGAAAGGTTCGAAA
AACTGCAGTTGAAAAACGTGCATTTTAAATACGATGAAACACCTGTTCTCACTGGTTTGA
ATTTCGAGATAAACTCCGGGGATAAGATCGCAATTGTGGCAAGATCTGGTGCAGGAAAGA
GCACGCTCGTTTCTCTTTTCAACAGGCTACTGTCTCCCACAGAGGGTGAAATTCTCATAA
ACGGTGTTCCCATCGAACAGTACTCTCTTCAATCACTCAGAAGAAACATCGTTCTTGTTC
GCTCGAACGATATACTCTTTGATACAACAATCAGAAACAACATCACCCTTTTTGAAGACT
TTCCAGAAGATGAAATCGAAAATGTTCTCAAAATGTGTGAGTGTGACTTTGTTGAGAAAC
TGGAAAGCGGAATAGACACGATCGTAGGAGAGAGGGGAACAAAACTCTCAGATGGTCAAA
GGCAGAGAATAGTCCTCGCGAGAGCTTTGATAAGAAAACCACAGGTTCTCATCCTAGACG
AGGCCACCTCTGGTGTTGATGGCAAGACGGAGGAGAGGATTTTCGAGAAAATCTTGAGGG
AAATAGAAACTGTGATTATCATCTCACACAGGCTTTCCACTGTGAGAAAGGCCAAAAAGA
TATACGTCATGGAGAACGGACGGATTCTGGACTCTGGTTCTCACGAGGAGCTGATCGTCA
GGTGTGAAAAATACACAGAGATTCTAAAGGAGCAGTTTGTAGAA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:AL009126 percent identity: 53.93; identified by sequence similarity; putative;ABC transporter, ATP-binding protein, authentic frameshift".


TM1320 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM1320
TTGAGTCGTCTTCCAATAGTTGATCCAAAAACTATGGAGAAGGTTTTACTGAAACTTGGA
TTTCAACGTGTTCGCCAAAAAGGAAGTCATGTGTTCTACAGGCACAGCAATGGAAAGTAT
ACGACCATCCCATTTCATGCAAGAGACTTGCCCAAGTCACTTAATAAGAAAAATCATCCG
TGAAGCGGGTATATCAGTGGAAGAATTCAAGAAAATACTCGAAAATCTATATCAGTACTT
TCCCCACACAAACATTTCTACTGTTCTTATACCTCCAATTGACTCCTTACAATCTCCCTG
TAAAGGGGAGATTTTTCCA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to PID:1652005 percent identity: 57.14; identified by sequence similarity; putative;conserved hypothetical protein, authentic frameshift".


TM1343 - Pyrimidine dimer DNA glycosylase, authentic pointmutation
[Identification alignment]

>TM1343
GTGGCACTCTGGAGGGAAAGTTTACTGGCTAAAAAAGTGCTCGAAGGAAAGACCAGGGGG
TACAGAAACCACCCGCAGCTTGAACGTTTTAAAAACCATTCAGAGCCGTTAAAGGCCATA
AACGCCTACCTCTTTGAGGTATAGAGGGAAGCGGAAAGGAGGGGCTACCGCTTTGACATC
AGGAAGATAGAGGTAGTGGAACTCAAGGAAAAGATCCCGGTCACGAAGGGCCAACTTGAA
TATGAGTTTCACCACCTCCTCAAGAAACTTCAGAAAAGAGATCCAGAGCGATATGAAACA
TTGAAAAGCGAAAAGGATATCCACGCCAGTCCTGTGTTTCTGATCATCGAGGGAAAAGTA
GAACACTGGGAAAAAGGAGTCTTCCCT

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic point mutation, causing a premature stop, and is not the result of a sequencing artifact; This region contains an authentic frame shift and is not the result of a sequencing artifact.; similar to PID:1088462 PID:1848058".


TM1418 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM1418
GTGAGAGAGGCCATTGTGCTGGCTTCCGGAGCTGGAAAAAGACTCAGATCGGTTACCGGA
GACGTTCCGAAGGTCTTTTACAGGTTCGACGGTTGTGAGCTCGTAAAATATCCGATGATC
TCCCTGATGAAAAACGGTGTTGAAAGATTCGTTCTTGTCGTCTCAGAGGGCTATAGAGAT
CTCGGAGAGAAGGTTCTGAATGATCTGGGTGTCGAGGGGATTGTCGTTGAGAACAAAAAG
GTGGAACTCGGCAACGCGTATTCTTTCTTCCTGAGTGAACCTTACGTGGAGAGCGAGAAA
TTCTTTCTCTCGTGTGGCGATTCTCTCTTTCCACCTGAGGCACTGAAAAGTGCTTTCAGT
GAGGATGAATTCCATATAAAACTCGGTGTGAGTAAGAGAAGTGACCTGATAGATCCTGAA
GAGGCGAGTAAAGTATTAGTAAATGAAGATCGGATCGTTAAGATCGGAAAGAGAATCGAT
GAGTACAACTATTTCGACACGGGTGTTTTTGTGATGACGAAAAAGGTTTACAGCCTCAAG
GAGAGTTTTTCGTGGACTGAGGAAATTTCTCTGTACCATGTGCTGCAGAAGGCGGTTGAC
ACGGGCATGATCGTTAAGGTGTTCGATTTTGGAAATGCTCTGTGGACAGAAATAGACTCT
CCCGAGGATTTGAATGAAAAGGTTTATGAGTTGATGAAAAAGATAAAGGAGGGAGTGGCA
TGCTGAGAAAATCCACCGATGGGTGGATTTCTTCTCTGATAAACAGGAGATTTTCCTCAA
GGATCACAAATCTCATTCTCGAAAAAAACTGGCAGATAACACCCAACCAGATGTCTTTCA
TCAGCTTTCTTGTTGGTGTTCTCGCGTTTCCGTTTTATCTTCTCAAGCTTCCGTGGATCG
CAGGAATTCTTGTTCAGGTCTCTTCTGTTCTGGACGGAGTGGATGGAGAGCTCGCACGCG
CAAGAAACATGTCTTCAAACTGGGGGGCTTTTTTCGATACGATGTTGGACAGGTTCGTGG
ATATACTGGCCGTTCTTGGGCTGTCTCTTTACGGTTGCTTGAAAGATGGTCCCTCTCTTT
CCTTACTCCTCTGGTCAGTTCTTGCCGTCAGCGGTTCTCTCATGGTGAGTTACCTTCACA
GTGTTGGGAAGGTGTTTGGTACCCACCCGGCTCTCGTCGGAAAGCTATCCGGTTTTGCCT
CCAGGGATGTTAGGTTGTTCGTTGTATTCGTTTTTTCTCTCTTTGGAATGCATCTTCCTG
CTCTTGTAGTTATCTCGATCCTGTCGTATGTTTACACTACTGGGAAATTCGTGGAACTTC
TGGTGCTCAACAGG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:Pyro_h percent identity: 60.58; identified by sequence similarity; putative;conserved hypothetical protein, authentic frameshift".


TM1438 - Conserved hypothetical protein, authentic point mutation
[Identification alignment]

>TM1438
ATGAGAGGAAAAATACTGATATTTCTGCATGCGCATCTTCCATACGTTCACCATCCTGAA
TACGATCATTTTTTGGAAGAAAGGTGGCTTTTTGAGGCCATAACAGAAACTTACATACCG
CTTTTAATGATGTTCGATGAAATAGAAGATTTCAGGTTGACCATGTCGATCACTCCTCCG
CTGATGGAGATGCTCTCCTCCAGAGACCTTCAGGAGAAGTACGAAAGACACATGGAAAAA
CTGATCGAACTCGCAAACAAGGAAGTGGAGAGAACTAAAAAGGAGCACCCGCTGAAGCAT
AAGATGGCTAAATTCTACCGTGAACATTTTGAAAAAATTCTGAACGTGTTTCGCTCTTAC
GATGGAAACATCTTGGAGGGCTTCAAAAAATACCAGGAGACCGGAAAGCTGGAGATAGTG
ACCTGCAACGCCACACACGCGTTTTTGCCGCTCTATCAGATGTACCCAGAGGTGGTGAAC
GCTCAGATCACAGTTGGCGTGAAGAACTACGCTAAGCACATGACGTAACACCCAAGGGGT
ATTTGGCTTGCGGAATGCGGATACTATCAGGGGCTGGATCTGTACCTTGCCCAGAACAAC
GTTGAGTATTTCTTTGTGGATTCTCATGCCTTCTGGTTCGCCGATGAACAACCCAGATAC
GGTGTCTACAGACCCATCATGACGCCAAGTGGTGTTTTCGCCTTCGCACGAGATCCGGAG
TCGAGCGAACAGGTCTGGAGTGCAGCCGTTGGGTATCCTGGTGATCCAAGGTACAGAGAA
TTCTACAGAGATATAGGTTTCGACAGAGAAATGGAGTACATAAAAGATTACATAGACCCT
TCTGGAGTCAGGATAAACACCGGAATAAAATACCACAGGATAACTTCGAAAAGCTTGGAT
GCTTCGCAGAAAGAATATTACGACATAGATCTGGCCATGGAAGCTGTGGAAGAACACGCG
AGGGACTTCCTTCACAAAAAGGAAAGTCAGGCAAGAAGATTGATGGATATAATGGGTGTC
GAACCGGTCATCGTTGCTCCCTTCGACGCTGAGCTCTTCGGTCACTGGTGGTTCGAGGGT
GTGTTCTTCTTGAAGAGGTTCTTTGAACTGGTGAATGAATCAAAAGACCTGAAGCTCGTC
ACCGCATCCGAAGTTATAGACACTCTCGAAGAGGTTCAGATCGCCACACCCGCCGACTCG
AGCTGGGGTGCCGGAGGATACTACGAAACGTGGCTCAACGGAACGAACGACTGGATCTAC
AGGCATCTCCACGAGATGATCGAAAGAATGATAGATCTTTCGAAAAAGTACTACAACAGT
TCCGATCCACTCGTGGAAAGGGTTTTGAATCAGATGCTGAGAGAACTATTTCTCGCACAA
TCGAGCGACTGGGCTTTCATCATGACTACAAGAACGAGTGTTCAATACGCGGAAAACAGA
ACGAAGCTTCACATAAAGAGGTTTCTGAACCTCTACGATCAACTCGTTTCTGGAAGAATA
GACGAAGAGATGCTAAGATACTACGAGTGGACGGATGCCATCTTTCCAGAGATAAACTTC
AGGGTGATGGCGAGGGATGTGATT

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic point mutation, causing a premature stop, and is not the result of a sequencing artifact; This region contains an authentic frame shift and is not the result of a sequencing artifact.; similar to PID:1653832".


TM1445 - Ribosomal protein S1, authentic frameshift
[Identification alignment]

>TM1445
ATGGAACCATTTGAATTCAACGATGAAATCCTCAGCCAGTACGAACCTGAGGAGTTCAGA
AGAGGACAGATTGTAAAAGGTGTGGTTATCGGCAAGGAAGACGATGGTGTGGTGGTGGAC
TTCGGGGGAAAAAGCGAGGGATTCGTACCGGAGAACGAACTGATCAAATCCTTAGATGAG
TACAAAGTGGGTGAGAATCTCACCCTTCAAATACTGAATCTGAACTACGAAGAGAGATCT
ATTCTCTCGGAGAGGAGACCAGTTCTTCGAAAAACGCTTGAAGAGTTGAGAAAAGATTAC
GAAGAAAAGAAACCTGTGAAAGCCCGCATCGTTTCGCAGACGAAAGGAGGATACAACGTT
TTACTGAAAGGAGTAGTCTCTGCTTTTCTTCCAGGATCACACTCCCTCCTCAGAAGGAAC
GATCCCATGCCTGAAAAAGAAATAGAAGTCATCATTTTGGAAATGGCTCAGACGAGACGG
GGACCAAGAATCGTTGTTTCGCGAAGAGCTCTCCAGGATAAGAAGATCGAGGAGTTCTTC
TCAGAGAAGAAAGTGGGAGACATCGTAGAAGGTACCGTGAAAGGGATCAGTAACGCAGGT
GTCGAAGTTGAAATTTCAGAGGGAGTGAGGGGATTCATTCCCAGGAGCGAACTCAGCTAC
GACACCAGAATATCTCCTGAAGACATAGTGAAACCCGGCCAGAACATAACAGCGAAGATA
ATCGAACTGGACAAGGTGAAAAAGAATGTCATCCTGAGTTTGAAGAAACTCATGCCCGAT
CCGTGGGAAAAAGTGGAAGAGAAGTATCCGGTTGGAAAAGTGGTGAGTGGAGAAGTGACT
TCGATTCATCCGTTCGGATTTTTTGTGAGACTGGAGCCCGGCGTGGAAGGACTTGTGCCA
AGGTCTGAAGTCTTCTGGGGGAACGCAAGAAAAAGTCTTGAAGAAGTGGTGAGTGTTGGA
GATCTGGTGAAGTTGAAGTTATCAATGTAGACAAAGAAAACAGAAAACTCACTTTGAGCT
ACAGAAAGGCAAAGGGAGATCCATGGGAAAACATCGAAGACAGGTACAATGTCAACAACG
TGGTGACAGGAAAGGTGACGGGAATCATAAAACAGGGAGCTTTTGTCGAGTTAGAAGAAG
GTGTTGAAGGATTCGTTCCCGTCTCTGAAATTTCATGGAAAAGAATTGATGAACCTGGAG
AAATTCTAAAGATCGGCGAAAAGGTGAAAGTGAAGATCTTGAAGATAGACAAAGAGAACA
GAAAGATCACTCTCAGCATAAAAAGAACACAGGAAAATCCCTGGGAACGTGCTCTCAAAG
AGTTGAAACCAGATTCCATCGTGAGTGGAACCATAAAGAAGATCGTGAACTCGGGAGTGG
TAGTGGAAGTCGAAGAGTACGATGTGGAGGGTTTCGTGCCGAACAACCATCTTCTCAGCG
AGCCTGAAACGGGAAAAGCTTTAAACCTCGTCGTTCTTAGAATAGATCCCGACGAGGTGT
TCGGTGGAAGAATGATACTGAGCGAAAAGCGGTATGAAGAAAGGAAAAACATAGAGGAGT
ACAAAAAAATGGTGGAGAAGGAGAGCTCTCAGAAATCCATTGGAGACCTTCTCAAAAAGA
ATGGAGAG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to SP:Q48082 PID:1574150 percent identity: 55.28; identified by sequence similarity; putative;ribosomal protein S1, authentic frameshift".


TM1477 - Translation initiation factor IF-1, authentic frameshift
[Identification alignment]

>TM1477
ATGGGAAAGGAAGATGTCATCCGGATGGAAGGAACCATAATAGAGGCTTTACCTAACGCT
ATGTTTAGAGTAGAATTAGACAATGGGCACAAAGTGCTAGCCCATGTTTCTGAGCAGGAT
GAGAAAAAATTTTATAAGACTGGTTCCTGGAGATCGGGTTATTGTTGAACTCTCTGTGTA
CGATCTCACTCGAGGACGAATCGTTTATAGAAAAAAACCAGAG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:M26414 SP:P20458 PID:142459 PID:1044987 GB:AL009126 percent identity: 84.93; identified by sequence similarity; putative".


TM1710 - Conserved hypothetical protein, authentic frameshift
[Identification alignment]

>TM1710
TTGAAGAGAATAGTCGTTGTGTCCGGGCTTTCGGGGGCGGGAAAAACCACCGCGATGGGT
TTTCTGGAGGATCTGGGGTATTTTTGTGTTGATAACGTTCCCGGTAACATACTGGAGGAA
CTTTTGAAGCTCTTCATGAGTTCCGATCTTGAAAAGATGGCGATGGCCATTGATGTGCGA
AGCGAACACCTTGGAGATCCCATATCGACGGTTGAGAGAATAAAAGAAAAGACGAATGCT
CTGGTAATTTTTCTGGAGGCGTCCACGGAGGAACTTCTGAGGAGATACGCTCTAACGAGA
AGAAGGCATCCTCTTCAGAAGGACGGAATAGGGCTGGAAGATGCCATAGAGAAGGAGAGG
AAAATCCTGTCTCGTATAAGGGAAATCGCGGACGTTGTGATAGATACAACCAGCATGAAC
ACACATCAACTGCGTGAAACTCTGACGCATTTTCTCGTGAACCAGGCCGGTGGAACCTCT
GTCAGAATAATGAGTTTCGGATTCAAACACGGAATCCCAATGGACGCGGATTTCGTTTTT
GATGCTCGTTTTCTCCCGAACCCACACTACGTACCGGAACTTTCTTCAAAGACGGGTTTG
GATAGCGAGGTAGAAGCGTACTTTAAAAATTATCCAGTGGTGGAAGAGTTCATCGAAGAA
GATCTATGAAGTACTCAAAGTGGCGATAGAAGAGTACCAGAGAACTGGAAGAAGGATAAT
CACTGTGGGAATAGGATGTACCGGCGGCAAGCATAGATCTGTGTATATCGCACACAGGCT
GAAAGAAATGCTGGAAAAAGAAGGTTTTACGGTCATTGAGAAACACAGGGACATAGAGAA
GGTG

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:AL009126 percent identity: 64.31; identified by sequence similarity; putative;conserved hypothetical protein, authentic frameshift".


TM1725 - Vacuolar ATP synthase subunit D-related protein, authentic point mutation
[Identification alignment]

>TM1725
TTGCCAAGCACCGACAGACTGTATCTCAGATTGAGATGAAAATAGAAAGTGAAGAAAAGG
GTCAATGCTCTGGAGAACGTGGTGATCCCCCAGCTCAAAGAAACGATAAAATACATTCAG
GATACCCTCGAAGAACAAGAAAGAGAAGAGTTCTTCAAGATCAAGCGATTGAAAGAAAGG
GTGCAGGTTGGAAGAAGA

NOTES: Annotated in NCBI (AE000512) as "This region contains an authentic point mutation, causing a premature stop, and is not the result of a sequencing artifact; This region contains an authentic frame shift and is not the result of a sequencing artifact.; similar to GB:Pyro_h".


TM1781 - Argininosuccinate lyase, authentic frameshift
[Identification alignment]

JCSG Status: CRYSTALLIZED

>TM1781
ATGAGTGAAAAACTCTGGGAGAAAGGCTACAAAGTCAACGAAGAAGTAGAAAAATTCACT
GTCGGAGACGATTACATAACGGACATGAAGATCATAGAATACGACATAAAGGCCTCCATA
GTACACTCCAGGATGCTACACAAAATAGGCCTTCTGAGCGCGGAAGAACAAAAGAAAAAG
AAGAAGCGCTCAGTGAACTCCTCAATCTTGTAAAAGAGGGAAAGTTCCAGATAAAACCGG
AGGAGGAAGACTGCCACACCGCCATCGAGAACTTTCTCGTGAAAAAACTTGGAGAGATCG
GAAAAAAGATCCACACCGCTCGCTCAAGGAACGATCAGGTCTTAACCGCGCTGAGACTCA
TGTACAAGGAAGAATTGAAAGAGATAGAAAACCTCATTAGAGAACTTCAAAAAAGCCTGG
AAAGATTCATAGAAAAGTTCGGTGACGTGAAATTTCCAGGATACACCCACACCAGAAAAG
CGATGCCAACCGATTTTGCAACGTGGGCAGGAGCGCTGAAAGACGCCCTCGAAGACGATC
TGAAACTTCTCAAAACGGCTTACGAAATCGTAGATCAATCGCCTCTGGGGACGGGAGCTG
GCTACGGTGTTCCCATCGACATAGACAGAGAGTTTACAGCGAAGGAACTCGGATTCTCGA
AGGTCCAGTGGAATCCCATCTACACCCAGAACAGCAGGGGAAAGTTCGAATATCTGATTC
TTCACACGCTCTCTCAGATATCTTACGATCTGAACCGGTTCGCCTCCGATATCATATTCT
TTTCTCTTCCAGAGATAGGTTATCTCAAACTGCCAAAAGAGCTCTGCACGGGAAGTTCCA
TCATGCCACACAAGATAAATCCGGATCCACTGGAACTCGTAAGGGCCCATCACCACACGA
TAGTTTCGAAGATGCTGATGGCAGTCACTCTGCCGTCAAATCTCATCTTCGGTTACCACA
GAGACTTCCAGCTTTTGAAGAAGCCGGTGATAGAGGCTTTCGAAGTTGTTAAGAATATCG
TAAGAATAATGAAAATAATTTTTGACCATCTTGAAGTTGATAAAGAAAGATCTGAGTCTA
GTATTACTGAGGAAGTACTGGCCACACACAGAGTCTATGAACTGGTGAAACAAGGAGTAC
CGTTCCGCGACGCTTACAGGATGGTAGCGGAAAAGTACGGGAGGGAAAAAGAT

NOTES: The status of this putative has been corrected in Genbank. The frame and start codon were wrongly predicted in TIGR. Subsequently, this entry has been corrected (September 10, 2001) and corresponds to protein entry NP_229578 in NCBI. Currently, the genomic sequence is STILL annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to GB:AE000657 percent identity: 58.49; identified by sequence similarity; putative;argininosuccinate lyase, authentic frameshift".

The corrected protein entry is:

>gi|15644632|ref|NP_229578.1| argininosuccinate lyase [Thermotoga maritima]
MGERLQSQRRSRKIHCRRRLHNGHEDHRIRHKGLHSTLQDATQNRPSERGRTKEKEEALS
ELLNLVKEGKFQIKPEEEDCHTAIENFLVKKLGEIGKKIHTARSRNDQVLTALRLMYKEE
LKEIENLIRELQKSLERFIEKFGDVKFPGYTHTRKAMPTDFATWAGALKDALEDDLKLLK
TAYEIVDQSPLGTGAGYGVPIDIDREFTAKELGFSKVQWNPIYTQNSRGKFEYLILHTLS
QISYDLNRFASDIIFFSLPEIGYLKLPKELCTGSSIMPHKINPDPLELVRAHHHTIVSKM
LMAVTLPSNLIFGYHRDFQLLKKPVIEAFEVVKNIVRIMKIIFDHLEVDKERSESSITEE
VLATHRVYELVKQGVPFRDAYRMVAEKYGREKD

The corrected cDNA is not available at NCBI. The JCSG-generated CDS is:

>TM1781 CDS for NP_229578.1
ATGGGAGAAAGGCTACAAAGTCAACGAAGAAGTAGAAAAATTCACTGTCGGAGACGATTA
CATAACGGACATGAAGATCATAGAATACGACATAAAGGCCTCCATAGTACACTCCAGGAT
GCTACACAAAATAGGCCTTCTGAGCGCGGAAGAACAAAAGAAAAAGAAGAAGCGCTCAGT
GAACTCCTCAATCTTGTAAAAGAGGGAAAGTTCCAGATAAAACCGGAGGAGGAAGACTGC
CACACCGCCATCGAGAACTTTCTCGTGAAAAAACTTGGAGAGATCGGAAAAAAGATCCAC
ACCGCTCGCTCAAGGAACGATCAGGTCTTAACCGCGCTGAGACTCATGTACAAGGAAGAA
TTGAAAGAGATAGAAAACCTCATTAGAGAACTTCAAAAAAGCCTGGAAAGATTCATAGAA
AAGTTCGGTGACGTGAAATTTCCAGGATACACCCACACCAGAAAAGCGATGCCAACCGAT
TTTGCAACGTGGGCAGGAGCGCTGAAAGACGCCCTCGAAGACGATCTGAAACTTCTCAAA
ACGGCTTACGAAATCGTAGATCAATCGCCTCTGGGGACGGGAGCTGGCTACGGTGTTCCC
ATCGACATAGACAGAGAGTTTACAGCGAAGGAACTCGGATTCTCGAAGGTCCAGTGGAAT
CCCATCTACACCCAGAACAGCAGGGGAAAGTTCGAATATCTGATTCTTCACACGCTCTCT
CAGATATCTTACGATCTGAACCGGTTCGCCTCCGATATCATATTCTTTTCTCTTCCAGAG
ATAGGTTATCTCAAACTGCCAAAAGAGCTCTGCACGGGAAGTTCCATCATGCCACACAAG
ATAAATCCGGATCCACTGGAACTCGTAAGGGCCCATCACCACACGATAGTTTCGAAGATG
CTGATGGCAGTCACTCTGCCGTCAAATCTCATCTTCGGTTACCACAGAGACTTCCAGCTT
TTGAAGAAGCCGGTGATAGAGGCTTTCGAAGTTGTTAAGAATATCGTAAGAATAATGAAA
ATAATTTTTGACCATCTTGAAGTTGATAAAGAAAGATCTGAGTCTAGTATTACTGAGGAA
GTACTGGCCACACACAGAGTCTATGAACTGGTGAAACAAGGAGTACCGTTCCGCGACGCT
TACAGGATGGTAGCGGAAAAGTACGGGAGGGAAAAAGAT


TM1837 - Maltose ABC transporter, permease protein, authentic frameshift
[Identification alignment]

>TM1837
TTGATATTACTTTTTGTGCTCGTTCTCTATCCCATATACTTCACGGTGAAAGTTGCGTTC
ACCAATTATGGAACAGGACACCTCATGACGAAACAAGAGGCCATCGAGAGAATTCTTTTC
GACCCGAATTACACTTATGTTCCAGAGAGCGCTGAACCTGTTGAATACATGGTTTTCTCC
GTGTTCAACGGTTTGAATCCTACCGAGGATTTTGTTGTTCTCTTCGAGAAAGATGGGAAC
ATCTACATTGCGGACGCCCCGTTGTCACAAAGAGGAGTGGGAAAGAGGTCCTTTTGAGAG
AGTCAACTCTCTTTCCCGTGAAGGATGGAACCGCCGAAGTGAATGGGAAGGTTTATGAGA
TAGTCCCCTGGCCCGCTTCCATAAAAGAGGTGAACGCCGTCTATTCAGACGGTAAAATTT
ACAAACCGCTTTACTCACCTGAAGAGGTTTCCCTGAAGAGGTACGAACCTTTCTTCAAGG
TAAATGTGGTTCAGAAGTATCTCAACAGGGCAGAGTTTTGGCTCGAGGATCAGAGTTACA
TGTTCAGAATAGGTGAAAACGGTGAGTGGAACTTTTATCCAGTGAAGAGGCTTTATTCTC
TGTCTTTCGAAGAATCCCTCGAAAACGGAAGGATAACAACGAAACTTGTGGTGAAAAACA
ATCTCACTGGAAGGCATCTCGTTGAAAGGGAAGGTGCTTTCTACGATTACGACGAAAATG
GAAGAGAATTTTTCGTGATTGGATACATGGAATACATTGGTTTCAAGAACTTCTCCAGGA
TATTCACCGATCCGAAGATCGCTGGTCCTTTTTTCAAGGTTTTCACCTGGACGTTCACCT
GGGCAGCCCTCAGCGTTCTTTTAACGTTCGCGATAGGTCTTTCCCTCGCTCTTGTTCTGA
ACGATAAAACACTGAAGGGAAAGAACGTGTACAGGACGTTACTCATTATTCCATGGGCTG
TGCCAGCTTTCATTTCCGTTCTCGTCTGGAGAAACGGGATGTTCAACGAGACTTATGGAA
TTCTCAACCGATTTGTTCTTCCGTTCTTGGGATTGGATCCGGTGAAGTGGTTCAACGATC
CGTTCTGGGCGAAGGTCGCAGTTCTCACCGTCAACACGTGGCTGGGATTTCCGTACATGA
TGGCCGTCTCACTCGGAGCTTTGCAGAGCATTCCAGAGGAGCTCTACGAAGCCGCTGCGA
TCGATGGAGCTGGAAGGTGGAGAAGGTTCTGGACTATCACGTTTCCGCTTCTGATGACCA
CTGTGGCTCCTCTGTTGGTCGGAAGTTTCGCTTTCAACTTCAACAACTTTGTGAACATAT
ATCTGCTCACCGGTGGAGGTCCCGCAATGGCTGGAACCACAACTCCTGTTGGACACACGG
ACATTTTGGTCTCCTACGTTTACAAACTCGCGTTCGAAGGAGGAAGAGGACAGGACTTCG
GTTTTGCCAGTGCCATATCCATCATCATATTCTTCCTCGTTGGAGGAATCAGCTTTGTGA
ACTTCAAACTCTCTGGTGCGTTTGAAGAGGTGAGTGAA


NOTES:Annotated in NCBI (AE000512) as "This region contains an authentic frame shift and is not the result of a sequencing artifact; similar to SP:P18812 percent identity: 69.97; identified by sequence similarity; putative;maltose ABC transporter, permease protein, authentic frameshift".