; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022015 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022015
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr7:15941630..15943611
RNA-Seq ExpressionLag0022015
SyntenyLag0022015
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.4e-2828.92Show/hide
Query:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A++Q    R+  FE GF       G   P  ++  ++ L W +F   P  VN+++V+EFYAN+   N   + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      S++    + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQ---------
        + DC  KK   L FP     LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +LA      + QAQ         
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQ---------

Query:  TYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
         ++ Y K RD  +    + I  +  R F  FP
Subjt:  TYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.7e-3237.6Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL
        I  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.8e-3734.3Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLASR-----------
        I  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R           
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLASR-----------

Query:  ---LEFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
           L+   +Q Q +W Y+K RD AL+   K +  +  RP   FP
Subjt:  ---LEFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.7e-3036.53Show/hide
Query:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL    +  + VRGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL---------QRM
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+         Q+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL---------QRM

Query:  QEVRQGGLVYG--VNQILEQLSVLASRL---EFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
           R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+   K +  +  RP   FP
Subjt:  QEVRQGGLVYG--VNQILEQLSVLASRL---EFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

XP_024995361.1 formin-like protein 3 [Cynara cardunculus var. scolymus]3.1e-2830.39Show/hide
Query:  RSKTSRRKKKGKLRIRKKKKRQERQKKSVCVQRESKGKGIAEASGEIEEPRAPFIR----FVNDLARAKYQEVLKRDFLFERGFGSDL---PRFLESGIV
        R +  R+K+  ++  R  K+R+E  ++S   +R S+ +  +E   + EE    F++    F   LAR  Y +  ++  + ++GF   L      +++ I 
Subjt:  RSKTSRRKKKGKLRIRKKKKRQERQKKSVCVQRESKGKGIAEASGEIEEPRAPFIR----FVNDLARAKYQEVLKRDFLFERGFGSDL---PRFLESGIV

Query:  NLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQA
        +LGW   C  P   N + VREF   + V +  E+ VR  PV +SP AIN+L +L    ++  + +    + ++L   + +V  EG  W   + R    + 
Subjt:  NLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQA

Query:  AYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPT-HYMLCSRAGVPTVPEDMIMTDKGIIDTPNL
         YLK EAN W  F+R  + P +HD+ +  +RVL+ + IL +   DVG++I S I  C ++  GKL +P+  + L  +A V  +P+D++  +K  ID  NL
Subjt:  AYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPT-HYMLCSRAGVPTVPEDMIMTDKGIIDTPNL

Query:  ARLQRM
         RL R+
Subjt:  ARLQRM

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-3237.6Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL
        I  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.4e-3734.3Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLASR-----------
        I  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R           
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLSVLASR-----------

Query:  ---LEFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
           L+   +Q Q +W Y+K RD AL+   K +  +  RP   FP
Subjt:  ---LEFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

A0A2P5DXM3 Uncharacterized protein4.7e-3036.53Show/hide
Query:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL    +  + VRGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL---------QRM
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+    LC  A  P +  +  + + G ID   +AR+         Q+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARL---------QRM

Query:  QEVRQGGLVYG--VNQILEQLSVLASRL---EFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
           R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+   K +  +  RP   FP
Subjt:  QEVRQGGLVYG--VNQILEQLSVLASRL---EFAERQAQTYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

A0A6A2WM54 Uncharacterized protein1.7e-2730.38Show/hide
Query:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A++Q    R   FE GF       G   P  ++  +  L W++F   P  VN+++V+EFYAN+   N + + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        D    HA F E      S++    + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++TVS  R+LL  +I+ S  IDVG II  +
Subjt:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQTY
        + DC  KK   L FP     LC +  V     D I+     I+   L  L  ++  +    V+    G  +   +  +LA        QAQ +
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQTY

A0A6A3BU96 Uncharacterized protein6.8e-2928.92Show/hide
Query:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A++Q    R+  FE GF       G   P  ++  ++ L W +F   P  VN+++V+EFYAN+   N   + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKYQEVLKRDFLFERGF-------GSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      S++    + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQ---------
        + DC  KK   L FP     LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +LA      + QAQ         
Subjt:  IVDCWKKKVGKLFFPTHY-MLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLSVLASRLEFAERQAQ---------

Query:  TYWTYAKRRDDALRGPCKPISQHRIRPFQCFP
         ++ Y K RD  +    + I  +  R F  FP
Subjt:  TYWTYAKRRDDALRGPCKPISQHRIRPFQCFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGAGCAAAATATTCCGAATTAGAAGGGTTTCCGTTGGAATTTATTATTTTTACCGTTGGATTTGTTTTGAATTCTCGCAGAGAGGAGAGTGAAGAGGAGGAGAT
ACCGGTCACACCGGAAGTGCAAAAAGGGAAAACTAAGAAGAAAAGAACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCAGCAGAGGGCTGCGGAACAAGAAGTCA
TCCAAGAAACAGAGGCTGTTCAGAATCTTGAGACAGAACTGATAGTTTCTGCTACGGTTCAGGAAGGAATACTGAGAAGGAACATGAGACAGAGGTCGAAGACAAGTCGC
AGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTACAGAGAGAAAGCAAGGGCAAAGGAATTGCCGAAGC
ATCGGGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTCGAACGAGGAT
TTGGCAGTGATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACATTGTTCGAGAATTTTAT
GCCAATCTTGACGTTAAGAATGATTTTGAGGTTATCGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTCGATCTCCAGGATTTTCCGCATGC
CGTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGGAAGCATA
CGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGTATCTCGGGACAGGGTATTG
CTTGCTTTTGCCATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCAACACA
TTACATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCGTATGCAAG
AGGTTCGCCAGGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGTCAGTGTTGGCCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTACTGGACT
TATGCTAAAAGGAGAGATGATGCGCTCAGGGGGCCTTGCAAACCAATTTCTCAACACCGTATCCGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATACCA
CCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGACTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGAGCAAAATATTCCGAATTAGAAGGGTTTCCGTTGGAATTTATTATTTTTACCGTTGGATTTGTTTTGAATTCTCGCAGAGAGGAGAGTGAAGAGGAGGAGAT
ACCGGTCACACCGGAAGTGCAAAAAGGGAAAACTAAGAAGAAAAGAACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCAGCAGAGGGCTGCGGAACAAGAAGTCA
TCCAAGAAACAGAGGCTGTTCAGAATCTTGAGACAGAACTGATAGTTTCTGCTACGGTTCAGGAAGGAATACTGAGAAGGAACATGAGACAGAGGTCGAAGACAAGTCGC
AGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTACAGAGAGAAAGCAAGGGCAAAGGAATTGCCGAAGC
ATCGGGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTCGAACGAGGAT
TTGGCAGTGATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACATTGTTCGAGAATTTTAT
GCCAATCTTGACGTTAAGAATGATTTTGAGGTTATCGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTCGATCTCCAGGATTTTCCGCATGC
CGTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGGAAGCATA
CGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGTATCTCGGGACAGGGTATTG
CTTGCTTTTGCCATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCAACACA
TTACATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCGTATGCAAG
AGGTTCGCCAGGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGTCAGTGTTGGCCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTACTGGACT
TATGCTAAAAGGAGAGATGATGCGCTCAGGGGGCCTTGCAAACCAATTTCTCAACACCGTATCCGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATACCA
CCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGACTGACTGA
Protein sequenceShow/hide protein sequence
MAGAKYSELEGFPLEFIIFTVGFVLNSRREESEEEEIPVTPEVQKGKTKKKRTPEEKEAKRRRRQQRAAEQEVIQETEAVQNLETELIVSATVQEGILRRNMRQRSKTSR
RKKKGKLRIRKKKKRQERQKKSVCVQRESKGKGIAEASGEIEEPRAPFIRFVNDLARAKYQEVLKRDFLFERGFGSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFY
ANLDVKNDFEVIVRGVPVQWSPEAINELFDLQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVL
LAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPTHYMLCSRAGVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVYGVNQILEQLSVLASRLEFAERQAQTYWT
YAKRRDDALRGPCKPISQHRIRPFQCFPMICLIFGYHPRLLNEKRMLMRSRVRKTD