; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g26200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g26200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionFlagellar attachment zone protein 1
Genome locationchr5:18669621..18674777
RNA-Seq ExpressionMoc05g26200
SyntenyMoc05g26200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53679.1 hypothetical protein EZV62_018935 [Acer yangbiense]4.5e-0924.76Show/hide
Query:  TSRVLLEGSPVKSRKPRG-NKRVESPENSGDCS-DFSYSNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-----------
        TS V LE +   S    G +    S E   D S D   S   +  L  L  +Y IP +I LR+P        P  G V+ +   FE+G+           
Subjt:  TSRVLLEGSPVKSRKPRG-NKRVESPENSGDCS-DFSYSNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-----------

Query:  ---------------------------------------LAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNW
                                                 ++ LK   K+ G Y LS +PG   +V            ++ P+S KNWK +WF+ SG+W
Subjt:  ---------------------------------------LAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNW

Query:  LMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSRRWEILLGVGTSGFATE
            +E      IP  F    +    PELT E ++ +     + + DR    LL+ KNL+        S   M+ R    K      I +GV   G   +
Subjt:  LMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSRRWEILLGVGTSGFATE

Query:  EEVATSS
         +V  SS
Subjt:  EEVATSS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.0e-0836.36Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVL
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVL

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.6e-0931.51Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L 
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA

Query:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
                 R    L++++ L   GL+      + NP     +SSR
Subjt:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.2e-0932.19Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L 
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA

Query:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
                 R    L+++K L   GL+      + NP     +SSR
Subjt:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-1531.51Show/hide
Query:  SNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-LAIHT-----LKKSSKAPGQ----------------------------
        S  PE  L  LR  ++IP +I LR+P   E  D P  G V+ Y +MFEYG+ L +H      L ++  AP Q                            
Subjt:  SNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-LAIHT-----LKKSSKAPGQ----------------------------

Query:  ----YYLSCFPG--IAKLVNGHVM--EKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAW
              L+CF    IAK      M   K    IV  PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L         
Subjt:  ----YYLSCFPG--IAKLVNGHVM--EKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAW

Query:  DRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
         R    L++++ L   GL+      + NP     +SSR
Subjt:  DRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

TrEMBL top hitse value%identityAlignment
A0A5C7HA73 Plus3 domain-containing protein2.2e-0924.76Show/hide
Query:  TSRVLLEGSPVKSRKPRG-NKRVESPENSGDCS-DFSYSNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-----------
        TS V LE +   S    G +    S E   D S D   S   +  L  L  +Y IP +I LR+P        P  G V+ +   FE+G+           
Subjt:  TSRVLLEGSPVKSRKPRG-NKRVESPENSGDCS-DFSYSNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-----------

Query:  ---------------------------------------LAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNW
                                                 ++ LK   K+ G Y LS +PG   +V            ++ P+S KNWK +WF+ SG+W
Subjt:  ---------------------------------------LAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNW

Query:  LMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSRRWEILLGVGTSGFATE
            +E      IP  F    +    PELT E ++ +     + + DR    LL+ KNL+        S   M+ R    K      I +GV   G   +
Subjt:  LMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSRRWEILLGVGTSGFATE

Query:  EEVATSS
         +V  SS
Subjt:  EEVATSS

A0A6J1CR42 uncharacterized protein LOC1110138264.9e-0936.36Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVL
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVL

A0A6J1DWD2 uncharacterized protein LOC1110246807.6e-1031.51Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L 
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA

Query:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
                 R    L++++ L   GL+      + NP     +SSR
Subjt:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

A0A6J1DWF1 uncharacterized protein LOC1110251085.8e-1032.19Show/hide
Query:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA
        +LA    K+ +K PG++Y+    G   +V G             PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L 
Subjt:  MLAIHTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLA

Query:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
                 R    L+++K L   GL+      + NP     +SSR
Subjt:  GCATLSAWDRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

A0A6J1DXS5 uncharacterized protein LOC1110255029.2e-1631.51Show/hide
Query:  SNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-LAIHT-----LKKSSKAPGQ----------------------------
        S  PE  L  LR  ++IP +I LR+P   E  D P  G V+ Y +MFEYG+ L +H      L ++  AP Q                            
Subjt:  SNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGM-LAIHT-----LKKSSKAPGQ----------------------------

Query:  ----YYLSCFPG--IAKLVNGHVM--EKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAW
              L+CF    IAK      M   K    IV  PTSIK W  +WFY SG WL     G  + ++P  F  LV I P+PELT+ S   L         
Subjt:  ----YYLSCFPG--IAKLVNGHVM--EKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAW

Query:  DRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR
         R    L++++ L   GL+      + NP     +SSR
Subjt:  DRYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSR

SwissProt top hitse value%identityAlignment
C9ZN16 Flagellar attachment zone protein 11.6e-0427.01Show/hide
Query:  SAQLANYNILMKELRKEQKFLAKEKEEFKTLKCETLNVVAASEKCILENNKFKVDKIKLEEEVSRLQVENSELKAEADQVGPLVAQLREELAELKTSTGN
        S QL N   + +EL +E++  +    + + L  E     A +E+ +LENNK + D   L  +V RL +E  ELKA  ++      +L EEL ELK +   
Subjt:  SAQLANYNILMKELRKEQKFLAKEKEEFKTLKCETLNVVAASEKCILENNKFKVDKIKLEEEVSRLQVENSELKAEADQVGPLVAQLREELAELKTSTGN

Query:  VLRAEVEKRKTIETELTRTLDAKVVRRESIEAELVNTKLKLSQTESHLATIESKFREFEKVVEDSVSKLADTESRLANTEAKISNFDLLYEIMSNFPEFR
         L  E+E +     +L   LD K    E +  EL   +LK+++ E     +E K  E EK+ E+   K A+ E      E K +  + L E +    E +
Subjt:  VLRAEVEKRKTIETELTRTLDAKVVRRESIEAELVNTKLKLSQTESHLATIESKFREFEKVVEDSVSKLADTESRLANTEAKISNFDLLYEIMSNFPEFR

Query:  QLEKDLEYFDLEYAVDWLKRFAKEVDLINSSDGPLISCIEISSDHSGDFSSSSSYSSSVPTSPVAPNPSLALKQEKESWEKEKLVLEEESRAMERRITFL
          E +    +LE      ++ A+ +DL  + +  L   +++ +  +   +             VA N  LA + E ++ E EKL  E E +A E      
Subjt:  QLEKDLEYFDLEYAVDWLKRFAKEVDLINSSDGPLISCIEISSDHSGDFSSSSSYSSSVPTSPVAPNPSLALKQEKESWEKEKLVLEEESRAMERRITFL

Query:  ERQLSMERKHK
        E +L      K
Subjt:  ERQLSMERKHK

Q585H6 Flagellar attachment zone protein 19.2e-0525.28Show/hide
Query:  EMELSHRDTAG--HKNEIKLTEATRRA----------NVCSAQLANYNILMKELRKEQKFLAKEKEEFKTLKCETLNVVAASEKCILENNKFKV-DKIKL
        E+E   RD +G   +NE    E  R+           N   + + N N+ ++ L +E +  A E E+      E L + AA  + + E  + KV +  KL
Subjt:  EMELSHRDTAG--HKNEIKLTEATRRA----------NVCSAQLANYNILMKELRKEQKFLAKEKEEFKTLKCETLNVVAASEKCILENNKFKV-DKIKL

Query:  EEEVSRLQVENSELKAEADQVGPLVAQLREELAELKTSTGNVLRAEVEKRKTIETELTRTLDAKVVRRESIEAELVNTKLKLSQTESHLATIESKFREFE
         EE+     EN +L  E +       +L EEL ELK +    L  E+E +     +L   L+ K    E +  EL   +LK ++ E     +E K  E E
Subjt:  EEEVSRLQVENSELKAEADQVGPLVAQLREELAELKTSTGNVLRAEVEKRKTIETELTRTLDAKVVRRESIEAELVNTKLKLSQTESHLATIESKFREFE

Query:  KVVEDSVSKLADTESRLANTEAKISNFDLLYEIMSNFPEFRQLEKDLEYFDLEYAVDWLKRFAKEVDLINSSDGPLISCIEISSDHSGDFSSSSSYSSSV
        K+ E+   K A+ E      E K++  + L E +    E +  E +    +LE  V   ++ A+E++L  + +  L   +E+ +  +   +      +  
Subjt:  KVVEDSVSKLADTESRLANTEAKISNFDLLYEIMSNFPEFRQLEKDLEYFDLEYAVDWLKRFAKEVDLINSSDGPLISCIEISSDHSGDFSSSSSYSSSV

Query:  PTSPVAPNPSLALKQEKESWEKEKLVLEEESRAMERRITFLERQLSMERKHK
             A N  LA + E ++ E EKL  E E +  E      E +L      K
Subjt:  PTSPVAPNPSLALKQEKESWEKEKLVLEEESRAMERRITFLERQLSMERKHK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGGACTAAATCATTCGATCCTAGCTCCCATAGGGTCTCAACTTCTCGAGTCCTTTTAGAGGGTTCTCCAGTCAAAAGCAGAAAACCTCGCGGGAATAAA
CGTGTTGAATCGCCGGAGAATAGCGGCGATTGCTCTGACTTTTCATATTCCAACTGTCCCGAAGAACTATTGAGGATTCTTAGGTATAATTACTCGATCCCGACC
GATATTGAGTTGAGAATCCCTGCGACGAGCGAAACAATCGACAAACCTTCTCTAGGGTGTGTCAGCTTCTACCCCCAAATGTTTGAGTACGGAATGTTAGCCATC
CATACCCTCAAAAAGTCGTCAAAAGCCCCAGGTCAGTACTATCTGAGTTGCTTTCCCGGTATTGCAAAACTAGTCAATGGGCATGTGATGGAAAAAGTTTTTTTC
TCAATAGTGAACGACCCAACTTCCATAAAGAACTGGAAACCAAGATGGTTTTATGTCTCAGGAAACTGGTTGATGACCACTAGCGAAGGGGCCCCTTATTGTGAG
ATCCCTATGGAATTCGATAGATTGGTTTTGATCGACCCCCTACCAGAATTGACAAAAGAGTCTCGGTCGGTTCTGGCTGGCTGTGCAACCCTTTCTGCTTGGGAT
CGCTATAGCCCCAATCTTCTGTCCAACAAGAACTTGAGGAACTGTGGACTCGTAGCCGAACTTTCTGAGGAGGAAATGAATCCTCGTTCAACGAAGTTTAAAAGT
AGCCGCCGCTGGGAGATCCTGCTGGGAGTTGGTACCTCAGGTTTTGCAACAGAAGAAGAAGTGGCTACGAGCTCTCCCTCCAAGAGGGTACAACAAATAAGAGGC
CGTGTAACACAGGTGCAATTTGTGCCTGAAGATAGGTCTGAGGAACAGTTCAAGCATCCCCAACCTCCTACGACCAAGTCACAGACTCTGAGAGATGTGTCTGGA
AAGCAACAAAGAAACACTAAAATTGATCAAGAGAAGGAAGCAAGACTTGATGATTTTGGTAGAGAAGCTGCTGGAATTTATAGCTGTGCAGAATTGTGGAACACT
TCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCACGTTTTCTGCAGCAGCCCAGGC
GCCTGGCGCCTCCTGAGCGACACTGAGGGCTCGCCTTTGAGAAGAAAAAGGCTTCGACGTGATGAAGAATTAGCCGACCTGGTCAGCACACAGATAACTGATCCT
CAGACAGAGGCTCCGCCCTCTGTTCTTCCTGAGGTCAGTATCCCTTCGTCCGGAGGTCGAAAAGAGACCGCCTCTGCACTTGAAGCCCTTGGAACTGGACCTGGG
ATATCCCTCCAGGAGCTCGACGAGGCTCAATCGCGCTCCCCTTTAACTGGCAATGCGGGTTTCACCACTGGGCCTTCCGATGCTAGGGAAGGCTTTTTTGAGGTT
TCTGGAACTCCACTCGTCTCTGCGGATTTGGTGCCCCATTTCTTGACTTATCTATGTGCGGGTGAGACTTTATCTCTCACAGACAGGCTTTACCCTGTTTTTTCG
GATGAATCGGAGAAGCATATCCGAAGTATCAACCCTTCATCATCCCATGAACTCTTCCACGAGACCACTACGTGCATTGCAAGGGCCTTAGCTCTTTCTTGTAGC
GGAGTGGCTTCCATCGAAATGGAACTCTCTCACCGTGACACGGCCGGTCATAAAAATGAAATCAAGCTCACAGAGGCCACCAGAAGAGCCAATGTCTGTTCGGCT
CAGCTCGCCAATTACAATATTTTGATGAAGGAGCTTCGTAAAGAACAAAAGTTTTTGGCCAAAGAGAAGGAAGAGTTTAAAACCTTGAAGTGTGAGACTCTCAAT
GTTGTTGCTGCCTCTGAAAAATGCATCCTTGAAAACAATAAGTTTAAAGTTGATAAGATCAAATTGGAAGAAGAAGTGTCTCGTCTACAAGTCGAAAATTCCGAG
CTGAAGGCCGAGGCTGATCAAGTCGGACCTCTTGTCGCTCAACTGCGGGAGGAACTCGCTGAGCTAAAGACCTCTACAGGAAACGTTCTGAGAGCAGAGGTCGAG
AAAAGGAAAACAATCGAGACCGAGCTCACCCGAACCTTGGATGCGAAGGTTGTCAGGAGAGAATCGATTGAAGCAGAATTGGTGAATACCAAATTGAAACTCAGC
CAAACTGAGTCGCATCTAGCCACAATTGAGTCTAAATTTCGTGAATTTGAAAAAGTGGTTGAAGATTCTGTGTCAAAGCTTGCGGACACAGAATCCAGACTTGCG
AACACTGAGGCGAAAATCAGCAATTTTGACCTTCTATACGAAATAATGTCTAACTTCCCTGAATTCAGGCAACTCGAGAAAGATCTGGAGTACTTCGATCTCGAA
TATGCGGTCGACTGGTTAAAAAGATTTGCTAAAGAAGTCGACCTGATCAACTCAAGTGACGGTCCTCTCATATCTTGTATAGAGATTTCCTCCGATCATTCTGGA
GATTTTTCATCTTCTTCCTCCTATTCGTCTTCTGTTCCGACCTCTCCAGTTGCCCCCAATCCTTCACTGGCCCTGAAGCAGGAGAAGGAGAGTTGGGAGAAAGAA
AAGCTGGTCCTCGAGGAGGAGAGTCGTGCCATGGAACGCCGTATTACCTTCTTGGAGCGGCAGCTGTCCATGGAGAGGAAGCACAAACGGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAGGACTAAATCATTCGATCCTAGCTCCCATAGGGTCTCAACTTCTCGAGTCCTTTTAGAGGGTTCTCCAGTCAAAAGCAGAAAACCTCGCGGGAATAAA
CGTGTTGAATCGCCGGAGAATAGCGGCGATTGCTCTGACTTTTCATATTCCAACTGTCCCGAAGAACTATTGAGGATTCTTAGGTATAATTACTCGATCCCGACC
GATATTGAGTTGAGAATCCCTGCGACGAGCGAAACAATCGACAAACCTTCTCTAGGGTGTGTCAGCTTCTACCCCCAAATGTTTGAGTACGGAATGTTAGCCATC
CATACCCTCAAAAAGTCGTCAAAAGCCCCAGGTCAGTACTATCTGAGTTGCTTTCCCGGTATTGCAAAACTAGTCAATGGGCATGTGATGGAAAAAGTTTTTTTC
TCAATAGTGAACGACCCAACTTCCATAAAGAACTGGAAACCAAGATGGTTTTATGTCTCAGGAAACTGGTTGATGACCACTAGCGAAGGGGCCCCTTATTGTGAG
ATCCCTATGGAATTCGATAGATTGGTTTTGATCGACCCCCTACCAGAATTGACAAAAGAGTCTCGGTCGGTTCTGGCTGGCTGTGCAACCCTTTCTGCTTGGGAT
CGCTATAGCCCCAATCTTCTGTCCAACAAGAACTTGAGGAACTGTGGACTCGTAGCCGAACTTTCTGAGGAGGAAATGAATCCTCGTTCAACGAAGTTTAAAAGT
AGCCGCCGCTGGGAGATCCTGCTGGGAGTTGGTACCTCAGGTTTTGCAACAGAAGAAGAAGTGGCTACGAGCTCTCCCTCCAAGAGGGTACAACAAATAAGAGGC
CGTGTAACACAGGTGCAATTTGTGCCTGAAGATAGGTCTGAGGAACAGTTCAAGCATCCCCAACCTCCTACGACCAAGTCACAGACTCTGAGAGATGTGTCTGGA
AAGCAACAAAGAAACACTAAAATTGATCAAGAGAAGGAAGCAAGACTTGATGATTTTGGTAGAGAAGCTGCTGGAATTTATAGCTGTGCAGAATTGTGGAACACT
TCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCACGTTTTCTGCAGCAGCCCAGGC
GCCTGGCGCCTCCTGAGCGACACTGAGGGCTCGCCTTTGAGAAGAAAAAGGCTTCGACGTGATGAAGAATTAGCCGACCTGGTCAGCACACAGATAACTGATCCT
CAGACAGAGGCTCCGCCCTCTGTTCTTCCTGAGGTCAGTATCCCTTCGTCCGGAGGTCGAAAAGAGACCGCCTCTGCACTTGAAGCCCTTGGAACTGGACCTGGG
ATATCCCTCCAGGAGCTCGACGAGGCTCAATCGCGCTCCCCTTTAACTGGCAATGCGGGTTTCACCACTGGGCCTTCCGATGCTAGGGAAGGCTTTTTTGAGGTT
TCTGGAACTCCACTCGTCTCTGCGGATTTGGTGCCCCATTTCTTGACTTATCTATGTGCGGGTGAGACTTTATCTCTCACAGACAGGCTTTACCCTGTTTTTTCG
GATGAATCGGAGAAGCATATCCGAAGTATCAACCCTTCATCATCCCATGAACTCTTCCACGAGACCACTACGTGCATTGCAAGGGCCTTAGCTCTTTCTTGTAGC
GGAGTGGCTTCCATCGAAATGGAACTCTCTCACCGTGACACGGCCGGTCATAAAAATGAAATCAAGCTCACAGAGGCCACCAGAAGAGCCAATGTCTGTTCGGCT
CAGCTCGCCAATTACAATATTTTGATGAAGGAGCTTCGTAAAGAACAAAAGTTTTTGGCCAAAGAGAAGGAAGAGTTTAAAACCTTGAAGTGTGAGACTCTCAAT
GTTGTTGCTGCCTCTGAAAAATGCATCCTTGAAAACAATAAGTTTAAAGTTGATAAGATCAAATTGGAAGAAGAAGTGTCTCGTCTACAAGTCGAAAATTCCGAG
CTGAAGGCCGAGGCTGATCAAGTCGGACCTCTTGTCGCTCAACTGCGGGAGGAACTCGCTGAGCTAAAGACCTCTACAGGAAACGTTCTGAGAGCAGAGGTCGAG
AAAAGGAAAACAATCGAGACCGAGCTCACCCGAACCTTGGATGCGAAGGTTGTCAGGAGAGAATCGATTGAAGCAGAATTGGTGAATACCAAATTGAAACTCAGC
CAAACTGAGTCGCATCTAGCCACAATTGAGTCTAAATTTCGTGAATTTGAAAAAGTGGTTGAAGATTCTGTGTCAAAGCTTGCGGACACAGAATCCAGACTTGCG
AACACTGAGGCGAAAATCAGCAATTTTGACCTTCTATACGAAATAATGTCTAACTTCCCTGAATTCAGGCAACTCGAGAAAGATCTGGAGTACTTCGATCTCGAA
TATGCGGTCGACTGGTTAAAAAGATTTGCTAAAGAAGTCGACCTGATCAACTCAAGTGACGGTCCTCTCATATCTTGTATAGAGATTTCCTCCGATCATTCTGGA
GATTTTTCATCTTCTTCCTCCTATTCGTCTTCTGTTCCGACCTCTCCAGTTGCCCCCAATCCTTCACTGGCCCTGAAGCAGGAGAAGGAGAGTTGGGAGAAAGAA
AAGCTGGTCCTCGAGGAGGAGAGTCGTGCCATGGAACGCCGTATTACCTTCTTGGAGCGGCAGCTGTCCATGGAGAGGAAGCACAAACGGAGCTAG
Protein sequenceShow/hide protein sequence
MTRTKSFDPSSHRVSTSRVLLEGSPVKSRKPRGNKRVESPENSGDCSDFSYSNCPEELLRILRYNYSIPTDIELRIPATSETIDKPSLGCVSFYPQMFEYGMLAI
HTLKKSSKAPGQYYLSCFPGIAKLVNGHVMEKVFFSIVNDPTSIKNWKPRWFYVSGNWLMTTSEGAPYCEIPMEFDRLVLIDPLPELTKESRSVLAGCATLSAWD
RYSPNLLSNKNLRNCGLVAELSEEEMNPRSTKFKSSRRWEILLGVGTSGFATEEEVATSSPSKRVQQIRGRVTQVQFVPEDRSEEQFKHPQPPTTKSQTLRDVSG
KQQRNTKIDQEKEARLDDFGREAAGIYSCAELWNTSSTRMHRLEPPKRIGRRVSLRAKLEENHVFCSSPGAWRLLSDTEGSPLRRKRLRRDEELADLVSTQITDP
QTEAPPSVLPEVSIPSSGGRKETASALEALGTGPGISLQELDEAQSRSPLTGNAGFTTGPSDAREGFFEVSGTPLVSADLVPHFLTYLCAGETLSLTDRLYPVFS
DESEKHIRSINPSSSHELFHETTTCIARALALSCSGVASIEMELSHRDTAGHKNEIKLTEATRRANVCSAQLANYNILMKELRKEQKFLAKEKEEFKTLKCETLN
VVAASEKCILENNKFKVDKIKLEEEVSRLQVENSELKAEADQVGPLVAQLREELAELKTSTGNVLRAEVEKRKTIETELTRTLDAKVVRRESIEAELVNTKLKLS
QTESHLATIESKFREFEKVVEDSVSKLADTESRLANTEAKISNFDLLYEIMSNFPEFRQLEKDLEYFDLEYAVDWLKRFAKEVDLINSSDGPLISCIEISSDHSG
DFSSSSSYSSSVPTSPVAPNPSLALKQEKESWEKEKLVLEEESRAMERRITFLERQLSMERKHKRS