; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005330 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005330
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDNA-binding WRKY
Genome locationchr09:2988182..2990422
RNA-Seq ExpressionPI0005330
SyntenyPI0005330
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR008581 - Protein of unknown function DUF863, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039690.1 DNA-binding WRKY [Cucumis melo var. makuwa]8.8e-25582.03Show/hide
Query:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL
        MLSQEVMFRKQ                  VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKL
Subjt:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL

Query:  RHRPLDLQLPPDQYVSLI--DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLP
        RH PLDLQLPPDQYVSLI  DLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLP
Subjt:  RHRPLDLQLPPDQYVSLI--DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLP

Query:  IKNE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVT
        IKNE LRPREARYLDLNEAQSDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  
Subjt:  IKNE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVT

Query:  STGEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE----------
        STGEMNN Q               FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE          
Subjt:  STGEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE----------

Query:  --KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDE
          +KELHSTET FSSGQDHRSSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDE
Subjt:  --KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDE

Query:  FSISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII
        FSISSSQLSE            LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII
Subjt:  FSISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII

Query:  L
        L
Subjt:  L

XP_004147616.1 uncharacterized protein LOC101221869 isoform X2 [Cucumis sativus]1.6e-25683.91Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFRKQVHQLHQLYSVQRILMQNFGF+ELDRCRFKKAGIIPTFMPYA PTRYDPFMKETVVSSI M EKHPAKNHKLRH PLDLQLPPDQYVSLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ
        DLEELDLSLDLKIGNPKKEND+EILSYKKSR            SVDGDAENVYSLDLNVPTIQ +EFET  NH SSDNL +KNE LRPREARYLDLNEAQ
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ

Query:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFSNC
        SDDMIT HYSTS SSPGIKEAD KGQQANCSS+IWVR+KNNYCS ESSTLEQDANLDV DCGSGNERNETHSTESK K TSTGEMNNCQ        S  
Subjt:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFSNC

Query:  YDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRSSG
        + +ESKKLEAVIEPPAD+HARLQKSEVCSDCSHAVEDGCNSILT TVSG STCNAENDS GEK                 KELHSTET FSSGQDHRSSG
Subjt:  YDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRSSG

Query:  SIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE------------L
        SIESEHGEESS+MKVLLQNAVETLI MSLNDSAFDHDC+TKTESSEM KDQVDQPQHSCDSFELLVLKQTEN+EDDEFS+SSSQLSE            L
Subjt:  SIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE------------L

Query:  RRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
        RRGRRLKDFR+EILPGLSCLSRHEICEDINI+EAVLRSREYRKN+AKI+DGQKVCSP KSKRSQSRSRLNNTRRRIIL
Subjt:  RRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

XP_008437121.1 PREDICTED: uncharacterized protein LOC103482637 isoform X1 [Cucumis melo]2.7e-25682.3Show/hide
Query:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL
        MLSQEVMFRKQ                  VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKL
Subjt:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL

Query:  RHRPLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIK
        RH PLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLPIK
Subjt:  RHRPLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIK

Query:  NE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTST
        NE LRPREARYLDLNEAQSDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  ST
Subjt:  NE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTST

Query:  GEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------
        GEMNN Q               FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE            
Subjt:  GEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------

Query:  KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFS
        +KELHSTET FSSGQDHRSSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDEFS
Subjt:  KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFS

Query:  ISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
        ISSSQLSE            LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
Subjt:  ISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

XP_008437122.1 PREDICTED: uncharacterized protein LOC103482637 isoform X2 [Cucumis melo]1.2e-25984.85Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKLRH PLDLQLPPDQYVSLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ
        DLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLPIKNE LRPREARYLDLNEAQ
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ

Query:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY----------
        SDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  STGEMNN Q           
Subjt:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY----------

Query:  ----FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------KKELHSTETNFSSGQDHR
            FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE            +KELHSTET FSSGQDHR
Subjt:  ----FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------KKELHSTETNFSSGQDHR

Query:  SSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------
        SSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDEFSISSSQLSE          
Subjt:  SSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------

Query:  --LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
          LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
Subjt:  --LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

XP_031742261.1 uncharacterized protein LOC101221869 isoform X1 [Cucumis sativus]5.2e-25583.62Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFRKQVHQLHQLYSVQRILMQNFGF+ELDRCRFKKAGIIPTFMPYA PTRYDPFMKETVVSSI M EKHPAKNHKLRH PLDLQLPPDQYVSLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSI--EFETRHNHSSSDNLPIKNE-LRPREARYLDLNE
        DLEELDLSLDLKIGNPKKEND+EILSYKKSR            SVDGDAENVYSLDLNVPTIQ +  EFET  NH SSDNL +KNE LRPREARYLDLNE
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSI--EFETRHNHSSSDNLPIKNE-LRPREARYLDLNE

Query:  AQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFS
        AQSDDMIT HYSTS SSPGIKEAD KGQQANCSS+IWVR+KNNYCS ESSTLEQDANLDV DCGSGNERNETHSTESK K TSTGEMNNCQ        S
Subjt:  AQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFS

Query:  NCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRS
          + +ESKKLEAVIEPPAD+HARLQKSEVCSDCSHAVEDGCNSILT TVSG STCNAENDS GEK                 KELHSTET FSSGQDHRS
Subjt:  NCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRS

Query:  SGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE-----------
        SGSIESEHGEESS+MKVLLQNAVETLI MSLNDSAFDHDC+TKTESSEM KDQVDQPQHSCDSFELLVLKQTEN+EDDEFS+SSSQLSE           
Subjt:  SGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE-----------

Query:  -LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
         LRRGRRLKDFR+EILPGLSCLSRHEICEDINI+EAVLRSREYRKN+AKI+DGQKVCSP KSKRSQSRSRLNNTRRRIIL
Subjt:  -LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

TrEMBL top hitse value%identityAlignment
A0A0A0KMU7 Uncharacterized protein7.8e-25783.91Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFRKQVHQLHQLYSVQRILMQNFGF+ELDRCRFKKAGIIPTFMPYA PTRYDPFMKETVVSSI M EKHPAKNHKLRH PLDLQLPPDQYVSLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ
        DLEELDLSLDLKIGNPKKEND+EILSYKKSR            SVDGDAENVYSLDLNVPTIQ +EFET  NH SSDNL +KNE LRPREARYLDLNEAQ
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSRH-----------SVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ

Query:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFSNC
        SDDMIT HYSTS SSPGIKEAD KGQQANCSS+IWVR+KNNYCS ESSTLEQDANLDV DCGSGNERNETHSTESK K TSTGEMNNCQ        S  
Subjt:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQ------YFSNC

Query:  YDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRSSG
        + +ESKKLEAVIEPPAD+HARLQKSEVCSDCSHAVEDGCNSILT TVSG STCNAENDS GEK                 KELHSTET FSSGQDHRSSG
Subjt:  YDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK-----------------KELHSTETNFSSGQDHRSSG

Query:  SIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE------------L
        SIESEHGEESS+MKVLLQNAVETLI MSLNDSAFDHDC+TKTESSEM KDQVDQPQHSCDSFELLVLKQTEN+EDDEFS+SSSQLSE            L
Subjt:  SIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE------------L

Query:  RRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
        RRGRRLKDFR+EILPGLSCLSRHEICEDINI+EAVLRSREYRKN+AKI+DGQKVCSP KSKRSQSRSRLNNTRRRIIL
Subjt:  RRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

A0A1S3ASW5 uncharacterized protein LOC103482637 isoform X25.8e-26084.85Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKLRH PLDLQLPPDQYVSLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ
        DLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLPIKNE LRPREARYLDLNEAQ
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNE-LRPREARYLDLNEAQ

Query:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY----------
        SDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  STGEMNN Q           
Subjt:  SDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY----------

Query:  ----FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------KKELHSTETNFSSGQDHR
            FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE            +KELHSTET FSSGQDHR
Subjt:  ----FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------KKELHSTETNFSSGQDHR

Query:  SSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------
        SSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDEFSISSSQLSE          
Subjt:  SSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------

Query:  --LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
          LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
Subjt:  --LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

A0A1S3ATS7 uncharacterized protein LOC103482637 isoform X11.3e-25682.3Show/hide
Query:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL
        MLSQEVMFRKQ                  VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKL
Subjt:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL

Query:  RHRPLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIK
        RH PLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLPIK
Subjt:  RHRPLDLQLPPDQYVSLIDLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIK

Query:  NE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTST
        NE LRPREARYLDLNEAQSDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  ST
Subjt:  NE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTST

Query:  GEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------
        GEMNN Q               FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE            
Subjt:  GEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE------------

Query:  KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFS
        +KELHSTET FSSGQDHRSSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDEFS
Subjt:  KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFS

Query:  ISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
        ISSSQLSE            LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL
Subjt:  ISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL

A0A5A7T8W5 DNA-binding WRKY4.3e-25582.03Show/hide
Query:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL
        MLSQEVMFRKQ                  VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYA PTRYDPF KETVVSSI MLEKHPAKNHKL
Subjt:  MLSQEVMFRKQ------------------VHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKL

Query:  RHRPLDLQLPPDQYVSLI--DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLP
        RH PLDLQLPPDQYVSLI  DLEELDLSLDLKIGNPKKE DEEILSYKKSR            SVDGDAEN+YSLDLNVPTIQSIEFET  NH SSDNLP
Subjt:  RHRPLDLQLPPDQYVSLI--DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLP

Query:  IKNE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVT
        IKNE LRPREARYLDLNEAQSDDMIT HYSTS SS G KEADSKGQQANCSSQIWV +KNNYCSTESST EQDANLDVMDCGSGNER ETHSTESK K  
Subjt:  IKNE-LRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVT

Query:  STGEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE----------
        STGEMNN Q               FSNCYDEESKKLEAVI PPADIHARLQKSEVCSDCSHAVEDGCNSILTVT+SG+STC AENDS GE          
Subjt:  STGEMNNCQY--------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGE----------

Query:  --KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDE
          +KELHSTET FSSGQDHRSSGSIESEHGEESS+M+VLLQNAVETLICMSLNDSAFDHDC TKTESSEM KDQVDQPQHSCDSFELLVL QTEN+EDDE
Subjt:  --KKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDE

Query:  FSISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII
        FSISSSQLSE            LRRGRRLKDFRREILPGLSCLSRHEICEDINI+E VLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII
Subjt:  FSISSSQLSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRII

Query:  L
        L
Subjt:  L

A0A6J1KAE2 uncharacterized protein LOC111491562 isoform X31.4e-17363.59Show/hide
Query:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI
        MLSQEVMFR QV +LH+LY VQRILMQNFG EE DR  F+KAG+  TFMPYA P RYDPFMKET V SIRML+K PA+N KL+ + L+LQLP DQY+SLI
Subjt:  MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLI

Query:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSI--EFETRHNHSSSDNLPIKN-ELRPREARYLDLNE
        DLEELDLSLDL +GN +KE+D+EIL ++KSR            S+D DAENVYSLDLNVPTI+    E E  H H SSD LPIKN +    EA YLDLNE
Subjt:  DLEELDLSLDLKIGNPKKENDEEILSYKKSR-----------HSVDGDAENVYSLDLNVPTIQSI--EFETRHNHSSSDNLPIKN-ELRPREARYLDLNE

Query:  AQSD-------DMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY-
        AQ+D       D+I   YSTS SS G K A  K QQANC+S I V+EKNN CSTESSTL+QDA             NETH+TESKFK T+T EM N Q+ 
Subjt:  AQSD-------DMITIHYSTSCSSPGIKEADSKGQQANCSSQIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQY-

Query:  -------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK----------------KELH
                     F NC D ES+KLEA+IEPPAD H R+QKSEVCSDC+HA EDGCN +L  T+SG+S CNAENDS GEK                K LH
Subjt:  -------------FSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVSGVSTCNAENDSSGEK----------------KELH

Query:  STETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQ
        STET  S+ QDH+SS SIESEH EESSEMK+LLQNA E+L+ MSL DS   HDCNTKTES+ MGK +VDQPQHS DSFELLVLKQ EN ED+EFS+ SSQ
Subjt:  STETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQ

Query:  LSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRL
        LSE            LRRGRRLKDF+REILP LSCLSRHEICEDINI+EAVLRSREYRK RAK+QDGQK   PTKSKRS+ R+ L
Subjt:  LSE------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12120.1 Plant protein of unknown function (DUF863)9.1e-1635.04Show/hide
Query:  SGVSTCNAENDSSGEKKELHS---------TETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTK----TESSEMGKDQ
        S  S C  EN+S  E +   S         T   F++ +D        +E  ++SSE   ++Q A E+L+ +S   S  + D  +K    T SS   +D 
Subjt:  SGVSTCNAENDSSGEKKELHS---------TETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTK----TESSEMGKDQ

Query:  VDQPQH-------SCDSFELLVLKQTENEEDDEFSISSSQLSE---------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREY
         D+P+        S DS+E   L  +E   +++F +SS  L E               LRRGRR+K+F++EILP L+ LSRHEI ED+NILEAVLRSREY
Subjt:  VDQPQH-------SCDSFELLVLKQTENEEDDEFSISSSQLSE---------------LRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREY

Query:  RKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRR
        +K + K +D +   +P ++KRS  +  +   RR+
Subjt:  RKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRR

AT1G26620.1 Plant protein of unknown function (DUF863)7.7e-0730.69Show/hide
Query:  ESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDD--------------EFSISSSQLSELRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSRE
        E+++   ++ D      D FE + L   E +E+D              +  I+  +  + RRGR  +DF+R+ LPGLS LSRHE+ EDI +   ++++ +
Subjt:  ESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDD--------------EFSISSSQLSELRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSRE

Query:  Y
        Y
Subjt:  Y

AT1G62530.1 Plant protein of unknown function (DUF863)7.7e-1536.53Show/hide
Query:  GEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------LRRGRRLKD
        GE+S E   ++Q A E L+ +S       H               V +P  SCDSFEL  L+  E   ++   +SS  + +          LRRGRR+K+
Subjt:  GEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------LRRGRRLKD

Query:  FRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPT-KSKRSQSRSRLNNTRR
        F++EILP L  LSRHEI EDIN+LE V RSR+Y+K + K +DG+  C P  ++ +  +++ +   RR
Subjt:  FRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPT-KSKRSQSRSRLNNTRR

AT1G62530.2 Plant protein of unknown function (DUF863)7.7e-1536.53Show/hide
Query:  GEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------LRRGRRLKD
        GE+S E   ++Q A E L+ +S       H               V +P  SCDSFEL  L+  E   ++   +SS  + +          LRRGRR+K+
Subjt:  GEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEEDDEFSISSSQLSE----------LRRGRRLKD

Query:  FRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPT-KSKRSQSRSRLNNTRR
        F++EILP L  LSRHEI EDIN+LE V RSR+Y+K + K +DG+  C P  ++ +  +++ +   RR
Subjt:  FRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPT-KSKRSQSRSRLNNTRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAGCCAGGAGGTTATGTTTAGGAAGCAGGTTCATCAATTACATCAATTGTACAGTGTACAAAGGATACTAATGCAGAATTTTGGCTTCGAGGAGCTTGATCGATG
CCGTTTTAAGAAAGCAGGGATAATACCAACATTCATGCCATATGCATATCCCACAAGATATGATCCATTCATGAAAGAAACCGTAGTTTCCTCAATTCGCATGCTTGAAA
AGCACCCAGCTAAAAACCACAAGCTTCGGCACAGGCCACTCGATTTACAGCTTCCTCCAGATCAGTACGTTAGCCTCATTGACTTGGAAGAGTTAGATCTTTCCCTTGAT
CTCAAAATTGGGAACCCAAAAAAGGAGAATGACGAAGAAATACTTTCATACAAGAAGTCTCGTCATTCAGTTGATGGGGATGCAGAAAATGTATATTCTTTAGATCTAAA
TGTTCCAACAATTCAATCCATAGAATTTGAAACCCGCCATAATCATAGCTCAAGTGACAATCTCCCTATCAAGAATGAGTTGAGACCCCGTGAAGCAAGATATCTTGACC
TTAATGAAGCTCAGAGCGATGATATGATTACGATTCACTATTCAACCTCATGTTCGTCACCTGGCATCAAGGAAGCAGATAGCAAGGGACAACAAGCCAATTGCTCCTCA
CAAATTTGGGTCAGAGAGAAGAATAACTACTGTTCTACTGAATCTTCCACACTTGAACAAGATGCAAATCTAGATGTGATGGATTGTGGAAGTGGAAATGAGAGAAATGA
AACTCACTCAACAGAGTCCAAATTTAAGGTAACAAGTACAGGTGAGATGAATAACTGTCAATACTTCAGTAATTGTTACGATGAAGAAAGTAAAAAATTGGAGGCAGTAA
TTGAGCCTCCTGCTGACATCCATGCAAGGCTTCAAAAGAGTGAAGTATGCTCTGATTGTAGTCATGCTGTGGAAGATGGTTGCAACAGCATATTGACGGTGACTGTATCT
GGCGTGTCTACCTGTAACGCAGAAAATGACTCAAGTGGGGAGAAGAAGGAGCTGCATAGTACAGAAACCAACTTTTCCAGTGGGCAAGATCATAGGTCTTCTGGTAGCAT
TGAATCAGAACATGGTGAAGAATCTTCTGAAATGAAAGTTCTACTTCAAAACGCGGTCGAAACACTTATTTGTATGTCTTTAAATGATTCAGCCTTTGATCATGATTGCA
ACACAAAAACAGAATCTAGTGAGATGGGAAAAGATCAGGTGGATCAACCACAACACTCTTGTGATTCCTTTGAGTTACTAGTCTTAAAGCAAACAGAAAACGAAGAAGAC
GACGAGTTCTCCATATCATCATCACAGTTATCTGAATTAAGAAGAGGACGAAGACTGAAAGACTTCCGAAGAGAGATACTTCCTGGTCTATCCTGTCTTTCAAGACATGA
GATTTGTGAAGATATTAACATTTTGGAGGCTGTTTTACGGTCAAGAGAATACCGAAAAAACCGAGCTAAAATCCAAGATGGACAGAAAGTTTGCAGTCCCACAAAAAGCA
AACGATCTCAATCAAGATCTCGGCTAAACAACACAAGACGAAGAATCATTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAGCCAGGAGGTTATGTTTAGGAAGCAGGTTCATCAATTACATCAATTGTACAGTGTACAAAGGATACTAATGCAGAATTTTGGCTTCGAGGAGCTTGATCGATG
CCGTTTTAAGAAAGCAGGGATAATACCAACATTCATGCCATATGCATATCCCACAAGATATGATCCATTCATGAAAGAAACCGTAGTTTCCTCAATTCGCATGCTTGAAA
AGCACCCAGCTAAAAACCACAAGCTTCGGCACAGGCCACTCGATTTACAGCTTCCTCCAGATCAGTACGTTAGCCTCATTGACTTGGAAGAGTTAGATCTTTCCCTTGAT
CTCAAAATTGGGAACCCAAAAAAGGAGAATGACGAAGAAATACTTTCATACAAGAAGTCTCGTCATTCAGTTGATGGGGATGCAGAAAATGTATATTCTTTAGATCTAAA
TGTTCCAACAATTCAATCCATAGAATTTGAAACCCGCCATAATCATAGCTCAAGTGACAATCTCCCTATCAAGAATGAGTTGAGACCCCGTGAAGCAAGATATCTTGACC
TTAATGAAGCTCAGAGCGATGATATGATTACGATTCACTATTCAACCTCATGTTCGTCACCTGGCATCAAGGAAGCAGATAGCAAGGGACAACAAGCCAATTGCTCCTCA
CAAATTTGGGTCAGAGAGAAGAATAACTACTGTTCTACTGAATCTTCCACACTTGAACAAGATGCAAATCTAGATGTGATGGATTGTGGAAGTGGAAATGAGAGAAATGA
AACTCACTCAACAGAGTCCAAATTTAAGGTAACAAGTACAGGTGAGATGAATAACTGTCAATACTTCAGTAATTGTTACGATGAAGAAAGTAAAAAATTGGAGGCAGTAA
TTGAGCCTCCTGCTGACATCCATGCAAGGCTTCAAAAGAGTGAAGTATGCTCTGATTGTAGTCATGCTGTGGAAGATGGTTGCAACAGCATATTGACGGTGACTGTATCT
GGCGTGTCTACCTGTAACGCAGAAAATGACTCAAGTGGGGAGAAGAAGGAGCTGCATAGTACAGAAACCAACTTTTCCAGTGGGCAAGATCATAGGTCTTCTGGTAGCAT
TGAATCAGAACATGGTGAAGAATCTTCTGAAATGAAAGTTCTACTTCAAAACGCGGTCGAAACACTTATTTGTATGTCTTTAAATGATTCAGCCTTTGATCATGATTGCA
ACACAAAAACAGAATCTAGTGAGATGGGAAAAGATCAGGTGGATCAACCACAACACTCTTGTGATTCCTTTGAGTTACTAGTCTTAAAGCAAACAGAAAACGAAGAAGAC
GACGAGTTCTCCATATCATCATCACAGTTATCTGAATTAAGAAGAGGACGAAGACTGAAAGACTTCCGAAGAGAGATACTTCCTGGTCTATCCTGTCTTTCAAGACATGA
GATTTGTGAAGATATTAACATTTTGGAGGCTGTTTTACGGTCAAGAGAATACCGAAAAAACCGAGCTAAAATCCAAGATGGACAGAAAGTTTGCAGTCCCACAAAAAGCA
AACGATCTCAATCAAGATCTCGGCTAAACAACACAAGACGAAGAATCATTTTGTGA
Protein sequenceShow/hide protein sequence
MLSQEVMFRKQVHQLHQLYSVQRILMQNFGFEELDRCRFKKAGIIPTFMPYAYPTRYDPFMKETVVSSIRMLEKHPAKNHKLRHRPLDLQLPPDQYVSLIDLEELDLSLD
LKIGNPKKENDEEILSYKKSRHSVDGDAENVYSLDLNVPTIQSIEFETRHNHSSSDNLPIKNELRPREARYLDLNEAQSDDMITIHYSTSCSSPGIKEADSKGQQANCSS
QIWVREKNNYCSTESSTLEQDANLDVMDCGSGNERNETHSTESKFKVTSTGEMNNCQYFSNCYDEESKKLEAVIEPPADIHARLQKSEVCSDCSHAVEDGCNSILTVTVS
GVSTCNAENDSSGEKKELHSTETNFSSGQDHRSSGSIESEHGEESSEMKVLLQNAVETLICMSLNDSAFDHDCNTKTESSEMGKDQVDQPQHSCDSFELLVLKQTENEED
DEFSISSSQLSELRRGRRLKDFRREILPGLSCLSRHEICEDINILEAVLRSREYRKNRAKIQDGQKVCSPTKSKRSQSRSRLNNTRRRIIL