; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001971 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001971
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:2609449..2615234
RNA-Seq ExpressionSpg001971
SyntenySpg001971
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.0e-2930.23Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--SHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        ++ L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+LQ+    HA F E      +++ +  + ++  E  +W   +T + 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--SHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPED-------------
        +     L+  A  W  F+K +LMPT+H++TVS  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN+IT LCR+  V E+             
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPED-------------

Query:  EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDL
         D +PL+        K  +   ++   +   E R   L   + Q Q QL  LH          ++ F+ YVK RD  +    Q          P FPD++
Subjt:  EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDL

Query:  L
        L
Subjt:  L

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.5e-3342.92Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I    W QFCA PE     +VREFYANL D  E  V VRG+ V WS EAIN +F L D    H+ F E +   +   L   +  V + GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID
        T   + L   A  W  F+K  L+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A  P   ++  L + G ID
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID

Query:  TPNLARLQRTQE
           +AR+  TQE
Subjt:  TPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-4438.36Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I    W QFCA PE     +VREFYANL D EE  V VRG+ V WS EAIN +F L D    H+ F + +   +   L   +  V   GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID
        T   + L   A  W  F+K RL+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A  P   ++  L + G ID
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID

Query:  TPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVF
           +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P F
Subjt:  TPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVF

Query:  PDDLL
        P ++L
Subjt:  PDDLL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.3e-2839.46Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I+   W  FCA PE     +VREFY N+ + ++  V +RG+ V  S EAIN +F+L D    H+ F E +  P   +L   +  V I GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVP
        T   + L   A  W  F+K RL+PTTH  TVS++ V L +++L   SI+VG++I  EI  C  +K G LFFP++IT +CR    P
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.3e-4039.27Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKCTFQAAYLKSEANAWMGFIK
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKCTFQAAYLKSEANAWMGFIK

Query:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------------
         RL+PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A    +E+   L + G ID   +AR+            
Subjt:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)7.4e-3442.92Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I    W QFCA PE     +VREFYANL D  E  V VRG+ V WS EAIN +F L D    H+ F E +   +   L   +  V + GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID
        T   + L   A  W  F+K  L+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A  P   ++  L + G ID
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID

Query:  TPNLARLQRTQE
           +AR+  TQE
Subjt:  TPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.5e-4538.36Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I    W QFCA PE     +VREFYANL D EE  V VRG+ V WS EAIN +F L D    H+ F + +   +   L   +  V   GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID
        T   + L   A  W  F+K RL+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A  P   ++  L + G ID
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIID

Query:  TPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVF
           +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P F
Subjt:  TPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVF

Query:  PDDLL
        P ++L
Subjt:  PDDLL

A0A2P5DAQ2 Uncharacterized protein2.1e-2839.46Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        I+   W  FCA PE     +VREFY N+ + ++  V +RG+ V  S EAIN +F+L D    H+ F E +  P   +L   +  V I GA+W +S     
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVP
        T   + L   A  W  F+K RL+PTTH  TVS++ V L +++L   SI+VG++I  EI  C  +K G LFFP++IT +CR    P
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVP

A0A2P5DXM3 Uncharacterized protein6.3e-4139.27Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKCTFQAAYLKSEANAWMGFIK
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQD--FSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKCTFQAAYLKSEANAWMGFIK

Query:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------------
         RL+PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP++IT LCR A    +E+   L + G ID   +AR+            
Subjt:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLL

A0A6A3BU96 Uncharacterized protein5.0e-3030.23Show/hide
Query:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--SHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC
        ++ L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+LQ+    HA F E      +++ +  + ++  E  +W   +T + 
Subjt:  IVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDF--SHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKC

Query:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPED-------------
        +     L+  A  W  F+K +LMPT+H++TVS  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN+IT LCR+  V E+             
Subjt:  TFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPED-------------

Query:  EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDL
         D +PL+        K  +   ++   +   E R   L   + Q Q QL  LH          ++ F+ YVK RD  +    Q          P FPD++
Subjt:  EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDL

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCTAGAAAAGAGAGGGAGAATGAGGACGAAGAGGTACATGTTACCCCTGTAGCACAGAAAGTGAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGCAGAGGGTCACGGATACAGAGGAAGAACGAACAGAAGAGGTGCAAGAAGATCGGACCGAGGAAGTTCGAGAAGAACTTACAGAGGAAGTTCAAGAACAG
AAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGAGAGATGCAGAGAGAGTAGAGCGTGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCGAA
GAAAGCAGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAAGGCAAAAGCGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATC
GCTTCGTCAATAACCTTGCTCGGGCAAAGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAAGCCGGAGCCTGTTAATTCCAACATTGTTCGAGAGTTTTAC
GCAAATCTTGACGATCAGGAAGAATTTCAGGTTATAGTTCGAGGAATGCCAGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCCAGGATTTTTCGCATGC
GGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGATTGTCGAAGACGGAAAAGTGCA
CATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGGTTTCATCAAGCTGCGCTTAATGCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTG
CTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGGATTGCTGGAGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACAT
TATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAGGGGATAATTGACACACCAAATTTGGCTAGGCTTCAGAGGACGC
AGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAATTTGTTGAAAGGCAATTGCAGACTTTCTGG
AGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTGGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGAT
CCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAG
TGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTATTTGTCTTCGCGTCAAAAGAGTATTTAGCCTAATTGGTGATGAGTTTGAGGCATGGGTATAC
TGCACCATAAAGTGGGTCATCCAGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAGAATTTGAACCCCTTGAAAATGTGTTTTGATATGTCTGA
AAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAATGGTGATTATTTGTCCATGCTGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCT
CAGAATCTGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCTCCCACTCAGCTGCAAAGGGTATACTGAA
CAAGCCATTTCCTCATTACGAGGAATGTTCGGCAAGGATCAGGCTAGTGGATCGGGGTGCGATGTACCAGCAGAACAAGCTGCAGAAAGCCCCCCAGTGGACGCAGAGGG
GGATTCCAACTTCCAGGGGACACCGGATTATTATGTTCCCATCCCTCCAGTAGAAAATTTGGCCACGGACGTCGAGTTTGAGGACGTCCCCATAACGCCCACGAGCCGAC
CGAGCACAGCGGGTTCCTCGCAGGGTCGGAAGAGGAGTAGAGCATCATATGAAGCAGAAACCCTGGAAATAATGAGGATCACCGATCAGATGTCATCAAATGTTACGGAC
AATCTGGTGGACGATGTGTCTGACACAAGCAGCAGTAATGTGGGGCCAGCTGAGAGATCAACAGGATCAATTAGTAGGAGACGAACTTCATATAACAGAGAGATGATGGA
GGTCGTGACAGCTGCAATGGATAGCCAAATTACCAGCCTTCAAAAGATCGCATCCTGGCTAGAACAAAAACACGAACGGGAGGCTGCGCGACGGAAGTTGGTGGTCGAGC
AGATTCAAGAAATGGAACAGTTTGAAGAAGGAGAGAAAAAACAACTTCTGAATGTCTTGCTCGCTGATATGCAGACAATTGAATTTTTCCTTGCCCTGCCCATGCCTTTC
AAAAAAAAAAAACATTGCATGGATGTGCTTGGAAGAAACGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCTAGAAAAGAGAGGGAGAATGAGGACGAAGAGGTACATGTTACCCCTGTAGCACAGAAAGTGAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGCAGAGGGTCACGGATACAGAGGAAGAACGAACAGAAGAGGTGCAAGAAGATCGGACCGAGGAAGTTCGAGAAGAACTTACAGAGGAAGTTCAAGAACAG
AAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGAGAGATGCAGAGAGAGTAGAGCGTGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCGAA
GAAAGCAGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAAGGCAAAAGCGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATC
GCTTCGTCAATAACCTTGCTCGGGCAAAGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAAGCCGGAGCCTGTTAATTCCAACATTGTTCGAGAGTTTTAC
GCAAATCTTGACGATCAGGAAGAATTTCAGGTTATAGTTCGAGGAATGCCAGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCCAGGATTTTTCGCATGC
GGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGATTGTCGAAGACGGAAAAGTGCA
CATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGGTTTCATCAAGCTGCGCTTAATGCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTG
CTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGGATTGCTGGAGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACAT
TATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAGGGGATAATTGACACACCAAATTTGGCTAGGCTTCAGAGGACGC
AGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAATTTGTTGAAAGGCAATTGCAGACTTTCTGG
AGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTGGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGAT
CCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAG
TGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTATTTGTCTTCGCGTCAAAAGAGTATTTAGCCTAATTGGTGATGAGTTTGAGGCATGGGTATAC
TGCACCATAAAGTGGGTCATCCAGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAGAATTTGAACCCCTTGAAAATGTGTTTTGATATGTCTGA
AAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAATGGTGATTATTTGTCCATGCTGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCT
CAGAATCTGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCTCCCACTCAGCTGCAAAGGGTATACTGAA
CAAGCCATTTCCTCATTACGAGGAATGTTCGGCAAGGATCAGGCTAGTGGATCGGGGTGCGATGTACCAGCAGAACAAGCTGCAGAAAGCCCCCCAGTGGACGCAGAGGG
GGATTCCAACTTCCAGGGGACACCGGATTATTATGTTCCCATCCCTCCAGTAGAAAATTTGGCCACGGACGTCGAGTTTGAGGACGTCCCCATAACGCCCACGAGCCGAC
CGAGCACAGCGGGTTCCTCGCAGGGTCGGAAGAGGAGTAGAGCATCATATGAAGCAGAAACCCTGGAAATAATGAGGATCACCGATCAGATGTCATCAAATGTTACGGAC
AATCTGGTGGACGATGTGTCTGACACAAGCAGCAGTAATGTGGGGCCAGCTGAGAGATCAACAGGATCAATTAGTAGGAGACGAACTTCATATAACAGAGAGATGATGGA
GGTCGTGACAGCTGCAATGGATAGCCAAATTACCAGCCTTCAAAAGATCGCATCCTGGCTAGAACAAAAACACGAACGGGAGGCTGCGCGACGGAAGTTGGTGGTCGAGC
AGATTCAAGAAATGGAACAGTTTGAAGAAGGAGAGAAAAAACAACTTCTGAATGTCTTGCTCGCTGATATGCAGACAATTGAATTTTTCCTTGCCCTGCCCATGCCTTTC
AAAAAAAAAAAACATTGCATGGATGTGCTTGGAAGAAACGAATGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEDEEVHVTPVAQKVKTKKKKTPEEKEAKRRRKQQRAEEQEKATEVATVTATVEEESPKQPEENTEQRVTDTEEERTEEVQEDRTEEVREELTEEVQEQ
KAEDVQMQQAEEVQRDAERVEREKKEAEDKAREEEAKKAEEEILLKRRAEKGKSVAEASEEPDEIEESRFPYNRFVNNLARAKTGIVNLGWSQFCAKPEPVNSNIVREFY
ANLDDQEEFQVIVRGMPVDWSPEAINDLFNLQDFSHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKCTFQAAYLKSEANAWMGFIKLRLMPTTHDSTVSRDRVL
LAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNIITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFW
SYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEVDEEQETFCLSIFSDLVVAAAKKILEVVLTYVIRFKLRSSPALICLRVKRVFSLIGDEFEAWVY
CTIKWVIQCLRAYDCRAALSLKNKNLNPLKMCFDMSENRAKLWQVLRIELKMVIICPCWKNYFAAAELGFAECSESVVGRLEGANSVLQQNWEQNCHVTALPLSCKGYTE
QAISSLRGMFGKDQASGSGCDVPAEQAAESPPVDAEGDSNFQGTPDYYVPIPPVENLATDVEFEDVPITPTSRPSTAGSSQGRKRSRASYEAETLEIMRITDQMSSNVTD
NLVDDVSDTSSSNVGPAERSTGSISRRRTSYNREMMEVVTAAMDSQITSLQKIASWLEQKHEREAARRKLVVEQIQEMEQFEEGEKKQLLNVLLADMQTIEFFLALPMPF
KKKKHCMDVLGRNE