; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003229 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003229
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:32001150..32004378
RNA-Seq ExpressionSpg003229
SyntenySpg003229
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.2e-2627.65Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--PHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKR
        ++ L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+LQ+    HA F E      +++ +  + ++  E  +    +T + 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--PHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKR

Query:  TFQAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIID
        +     L+  A +W  F+K +++PT+H++TVS  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+  D  L     I 
Subjt:  TFQAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIID

Query:  TPNLARL-------------QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           L  L             +++    +      +  ++E +    +++  +   ++ F+ YVK RD  +    Q          P FPD++L
Subjt:  TPNLARL-------------QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-3242.86Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF
        I    W QFCA PE     +VREFYANL D  E  V VRG+ V WS EAIN +F L D P    +E +   +   L   +  V + GA+  +S     T 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF

Query:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTP
          + L   A VW  F+K  +LPTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID  
Subjt:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTP

Query:  NLARLQRTQE
         +AR+  TQE
Subjt:  NLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.4e-3935.7Show/hide
Query:  TTDSERENTERVEREKKEAEDKAREEEAKKAEEEILLK-RRAEKGKSVAEASEEPDEIEESRFPYNRFVNNRARAKTGIVNLGWSQFCAKPEPVNSNIVR
        +T   R   +RV R+  +A     E  A + E  I  +   AEKG  V + SE   ++     P+   V         I    W QFCA PE     +VR
Subjt:  TTDSERENTERVEREKKEAEDKAREEEAKKAEEEILLK-RRAEKGKSVAEASEEPDEIEESRFPYNRFVNNRARAKTGIVNLGWSQFCAKPEPVNSNIVR

Query:  EFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIKLRMLP
        EFYANL D EE  V VRG+ V WS EAIN +F L D P    +E +   +   L   +  V   GA+  +S     T   + L   A VW  F+K R+LP
Subjt:  EFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIKLRMLP

Query:  TTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARLQR---TQEARQ------
        TTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q      
Subjt:  TTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARLQR---TQEARQ------

Query:  ---------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
                 G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  ---------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.7e-2639.34Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF
        I+   W  FCA PE     +VREFY N+ + ++  V +RG+ V  S EAIN +F+L D P    +E V   +  +L   +  V I GA+  +S     T 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF

Query:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP
          + L   A VW  F+K R+LPTTH  TVS++ V L +++L   SI+VG++I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.9e-3939.27Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIK
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+  +S     T   + L   A VW  F+K
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIK

Query:  LRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARL------------
         R+LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+            
Subjt:  LRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.9e-3342.86Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF
        I    W QFCA PE     +VREFYANL D  E  V VRG+ V WS EAIN +F L D P    +E +   +   L   +  V + GA+  +S     T 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF

Query:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTP
          + L   A VW  F+K  +LPTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID  
Subjt:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTP

Query:  NLARLQRTQE
         +AR+  TQE
Subjt:  NLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)6.6e-4035.7Show/hide
Query:  TTDSERENTERVEREKKEAEDKAREEEAKKAEEEILLK-RRAEKGKSVAEASEEPDEIEESRFPYNRFVNNRARAKTGIVNLGWSQFCAKPEPVNSNIVR
        +T   R   +RV R+  +A     E  A + E  I  +   AEKG  V + SE   ++     P+   V         I    W QFCA PE     +VR
Subjt:  TTDSERENTERVEREKKEAEDKAREEEAKKAEEEILLK-RRAEKGKSVAEASEEPDEIEESRFPYNRFVNNRARAKTGIVNLGWSQFCAKPEPVNSNIVR

Query:  EFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIKLRMLP
        EFYANL D EE  V VRG+ V WS EAIN +F L D P    +E +   +   L   +  V   GA+  +S     T   + L   A VW  F+K R+LP
Subjt:  EFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIKLRMLP

Query:  TTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARLQR---TQEARQ------
        TTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q      
Subjt:  TTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARLQR---TQEARQ------

Query:  ---------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
                 G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  ---------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

A0A2P5DAQ2 Uncharacterized protein8.3e-2739.34Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF
        I+   W  FCA PE     +VREFY N+ + ++  V +RG+ V  S EAIN +F+L D P    +E V   +  +L   +  V I GA+  +S     T 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTF

Query:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP
          + L   A VW  F+K R+LPTTH  TVS++ V L +++L   SI+VG++I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  QAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP

A0A2P5DXM3 Uncharacterized protein1.9e-3939.27Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIK
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+  +S     T   + L   A VW  F+K
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FPHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIK

Query:  LRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARL------------
         R+LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+            
Subjt:  LRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

A0A6A3BU96 Uncharacterized protein1.1e-2627.65Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--PHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKR
        ++ L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+LQ+    HA F E      +++ +  + ++  E  +    +T + 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--PHAAFNEMVVAPSNDQLNAAVREVGIEGAQGRLSKTEKR

Query:  TFQAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIID
        +     L+  A +W  F+K +++PT+H++TVS  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+  D  L     I 
Subjt:  TFQAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVSLIDKGIID

Query:  TPNLARL-------------QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           L  L             +++    +      +  ++E +    +++  +   ++ F+ YVK RD  +    Q          P FPD++L
Subjt:  TPNLARL-------------QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTATAGTACAGAAAGCGAAAACGAAAAAGAAGAAGACGCCAGAAGAGAA
AGAAGCGAAGAGGAGGAGGAAGCAACAGAGGGCCGAGGATCAGGAGGCTCTACAGAAAGCAACGGATGACGTGGCTGCCACTGTAGTTGAAGAAAACCCGCAAGAACCAG
AAGAACAGAACCTAGAACAGACTGAGCAGAGAGTTGCGGGTACAGAAGAAGTTCAAGAGGAGCAAACAGAGGAAGTTCAAGAGGATCGGACCGAGGAAGTTCGAGAAGAA
ATTACAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAGGACGCTCAAGTAGAGGTGAT
CATGCCGGAGGTGCCAAAGCGTCGTCGAGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTTCGAACTGATACTCCTTCGCCTCTGACCACGGATTCTGAAAGAGAGAATA
CAGAGAGAGTAGAGCGTGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCGAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAGGGTAAA
AGCGTGGCTGAAGCTTCAGAAGAACCTGACGAGATTGAGGAGTCGAGATTTCCGTACAACCGCTTCGTCAATAACCGTGCTCGGGCAAAGACTGGAATAGTGAACCTCGG
CTGGAGTCAATTTTGTGCGAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACGCAAATCTTGACGATCAAGAAGAATTTCAGGTTATAGTTCGAGGAATGC
CAGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCCAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAATTAAATGCG
GCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGGGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGTATGGATGGGTTT
CATCAAGCTGCGCATGTTGCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAA
TTTCTTCTGAGATTCTGGATTGCTGGAGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGAT
GTGTCGTTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGA
GCAATTACAGCTGCATTCCAGCAGGATGGAATTTGTGGAAAGACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGA
ATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGAAACC
TTTTGCTTGAGCATTTTCTTTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTAC
TAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTACTCAGAATCTG
TTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTATAGTACAGAAAGCGAAAACGAAAAAGAAGAAGACGCCAGAAGAGAA
AGAAGCGAAGAGGAGGAGGAAGCAACAGAGGGCCGAGGATCAGGAGGCTCTACAGAAAGCAACGGATGACGTGGCTGCCACTGTAGTTGAAGAAAACCCGCAAGAACCAG
AAGAACAGAACCTAGAACAGACTGAGCAGAGAGTTGCGGGTACAGAAGAAGTTCAAGAGGAGCAAACAGAGGAAGTTCAAGAGGATCGGACCGAGGAAGTTCGAGAAGAA
ATTACAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAGGACGCTCAAGTAGAGGTGAT
CATGCCGGAGGTGCCAAAGCGTCGTCGAGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTTCGAACTGATACTCCTTCGCCTCTGACCACGGATTCTGAAAGAGAGAATA
CAGAGAGAGTAGAGCGTGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCGAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAGGGTAAA
AGCGTGGCTGAAGCTTCAGAAGAACCTGACGAGATTGAGGAGTCGAGATTTCCGTACAACCGCTTCGTCAATAACCGTGCTCGGGCAAAGACTGGAATAGTGAACCTCGG
CTGGAGTCAATTTTGTGCGAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACGCAAATCTTGACGATCAAGAAGAATTTCAGGTTATAGTTCGAGGAATGC
CAGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCCAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAATTAAATGCG
GCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGGGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGTATGGATGGGTTT
CATCAAGCTGCGCATGTTGCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAA
TTTCTTCTGAGATTCTGGATTGCTGGAGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGAT
GTGTCGTTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGA
GCAATTACAGCTGCATTCCAGCAGGATGGAATTTGTGGAAAGACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGA
ATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGAAACC
TTTTGCTTGAGCATTTTCTTTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTAC
TAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTACTCAGAATCTG
TTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPIVQKAKTKKKKTPEEKEAKRRRKQQRAEDQEALQKATDDVAATVVEENPQEPEEQNLEQTEQRVAGTEEVQEEQTEEVQEDRTEEVREE
ITEEVQEQQAEDVQMQQAEEVQVPDNEPVQDAQVEVIMPEVPKRRRVKRKAGRARVVRTDTPSPLTTDSERENTERVEREKKEAEDKAREEEAKKAEEEILLKRRAEKGK
SVAEASEEPDEIEESRFPYNRFVNNRARAKTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFPHAAFNEMVVAPSNDQLNA
AVREVGIEGAQGRLSKTEKRTFQAAYLKSEANVWMGFIKLRMLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDD
VSLIDKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEVDEEQET
FCLSIFFGLVVAAAKKILEVVLTYVIRFKLRSSPALTKLWQVLRIELKVVIICPCRKNYFAAAELGFAEYSESVAGRLEGAKSVLQQSWEQNCHVTAR