; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005243 (gene) of Snake gourd v1 genome

Gene IDTan0005243
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEnhancer of polycomb-like protein
Genome locationLG11:7826059..7830525
RNA-Seq ExpressionTan0005243
SyntenyTan0005243
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0032777 - Piccolo NuA4 histone acetyltransferase complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
InterPro domainsIPR019542 - Enhancer of polycomb-like, N-terminal
IPR024943 - Enhancer of polycomb protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602262.1 Organelle RRM domain-containing protein 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.95Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DWYPV++ RGNGGGSGQVRFH KWQG+RNVKPKRVVVVNIREEE  DACVAEVP+
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE

Query:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR
        PVKV  RINGDGE G VDRM+G+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS    PPP+
Subjt:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR

Query:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS
        DRVLT+F+GS+ DG GCFSDFMLSVLRH KSPELG+AK SAFLLS PIH VFAS+GMRFLQSYP  GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LHS
Subjt:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS

Query:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM
         M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R M
Subjt:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM

Query:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL
        RRPRSHSLAAMQKA GSL A+DMK  VSFPSAASC RH NS  R+SAGRIRE SS AL SAMDV SSCCNANILIVESDKCLR EG  IVLEFS+SCEWL
Subjt:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL

Query:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV
        LVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI V
Subjt:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV

Query:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK
         DDEVCRA+AK+ ANYDMDSDDE+WLSKFNDEL+ATD HHE +SVD+FELMVDAFEKGFFCNPDAFS+E+AP DIC HLGS+SIVESLF YWMKKRKQRK
Subjt:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK

Query:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT
        +SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQ G GR TQSS+LE I S+RDAME  QN VQKY+E KA AD+CVE+AV KRQRAQLLL+NADLAT
Subjt:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT

Query:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        YKAMTALRIAEAIQASELLEAE     A AAA  ASCFLE
Subjt:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

KAG7032944.1 hypothetical protein SDJN02_06995, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.95Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DWYPV++ RGNGGGSGQVRFH KWQG+RNVKPKRVVVVNIREEE  DACVAEVP+
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE

Query:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR
        PVKV  RINGDGE G VDRM+G+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS    PPP+
Subjt:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR

Query:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS
        DRVLT+F+GS+ DG GCFSDFMLSVLRH KSPELG+AK SAFLLS PIH VFAS+GMRFLQSYP  GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LHS
Subjt:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS

Query:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM
         M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R M
Subjt:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM

Query:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL
        RRPRSHSLAAMQKA GSL A+DMK  VSFPSAASC RH NS  R+SAGRIRE SS AL SAMDV SSCCNANILIVESDKCLR EG  IVLEFS+SCEWL
Subjt:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL

Query:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV
        LVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI V
Subjt:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV

Query:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK
         DDEVCRA+AK+ ANYDMDSDDE+WLSKFNDEL+ATD HHE +SVD+FELMVDAFEKGFFCNPDAFS+E+AP DIC HLGS+SIVESLF YWMKKRKQRK
Subjt:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK

Query:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT
        +SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQ G GR TQSS+LE I S+RDAME  QN VQKY+E KA AD+CVE+AV KRQRAQLLL+NADLAT
Subjt:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT

Query:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        YKAMTALRIAEAIQASELLEAE     A AAA  ASCFLE
Subjt:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

XP_022960781.1 uncharacterized protein LOC111461478 [Cucurbita moschata]0.0e+0080.83Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DWYPV++ RGNGGGSGQVRFH KWQG+RNVKPKRVVVVNIREEE  DACVAEVP+
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE

Query:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR
        PVKV  RINGDGE G VDRM+G+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS    PPPR
Subjt:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR

Query:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS
        DRVLT+F+GS+ DG GCFSDFMLSVLRH KSPELG+AK SAFLLS PIH VFAS+GMRFLQSYP  GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LHS
Subjt:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS

Query:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM
         M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R M
Subjt:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM

Query:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL
        R+PRSHSLAAMQKA GSL A+DMK  VSFPSAASC RH NS  R+SAG IRE SS AL SAMDV SSCCNANILIVESDKCLR EG  IVLEFS+SCEWL
Subjt:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL

Query:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV
        LVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI V
Subjt:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV

Query:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK
         DDEVCRA+AK+ ANYDMDSDDE+WLSKFNDELIATD HHE +SVD+FELMVDAFEKGFFCNPDAFS+E+AP DIC HLGS+SIVESLF YWMKKRKQRK
Subjt:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK

Query:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT
        +SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQ G GR TQSS+LE I SR+DAME  QN VQKY+E KAAAD+CVE+AV KRQRAQLLL+NADLAT
Subjt:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT

Query:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        YKAMTALRIAEAIQASELLE       A AAA  ASCFLE
Subjt:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

XP_022990242.1 uncharacterized protein LOC111487183 isoform X1 [Cucurbita maxima]0.0e+0081.83Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE-DACVAEVPEP
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DW+PV+D RGNGGGSGQV FH KWQG+RNVKP RVVVVNIREEE DACVAEVP+P
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE-DACVAEVPEP

Query:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS------PPP
        VKV  RINGDGE G VDRMFG+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS      PPP
Subjt:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS------PPP

Query:  RDRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH
        RDRVLTIFSGS+LDG GCFSDFMLSVLR+LKSPELG+AK SAFLLS+PIH VFAS+GMRFLQSYPP GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LH
Subjt:  RDRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH

Query:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR
        S M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R 
Subjt:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR

Query:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEW
        MRRPRSHSLAAMQKA GSL A+DMK  VSFPSAASC RHKN   R+SAGRIRE SSTAL SAMDV SSCCNANILIVESDKCLR EGA IVLEFS+SCEW
Subjt:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEW

Query:  LLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIY
        LLVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI 
Subjt:  LLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIY

Query:  VNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQR
        VNDDEVCRA+AK+ ANYDMDSDDE+WLSKFNDEL+ATD HHE +SVDNFELMVDAFEKGFF NPDAFS+E+APADIC HLGS+SIVESLF YW KKRKQR
Subjt:  VNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQR

Query:  KTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADL
        K+SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQSG  GR TQSS+LE I SRRDAME  QN VQKY+E KAAAD+CVE+AV KRQRAQLLL+NADL
Subjt:  KTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADL

Query:  ATYKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        ATYKAMTALRIAEAIQASELLEAE     A AAA  ASC+LE
Subjt:  ATYKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

XP_023516479.1 uncharacterized protein LOC111780334 [Cucurbita pepo subsp. pepo]0.0e+0081.5Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DWYPV++ RGNGGGSGQVRFH KWQG+RNVKPK VVVVNIREEE  DACVAEVP+
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE

Query:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDRVL
        PVKV  RINGDGE G VDRMFG+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSISPPPRDRVL
Subjt:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDRVL

Query:  TIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFL
        T+F+GS+ DG GCFSDFMLSVLRH KSPELG+AK SAFLLS PIH VFAS+GMRFLQSYP  GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LHS M  
Subjt:  TIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFL

Query:  RATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR
        R TWIQA LVYNN QLDVDMSSD+ +D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R MRRPR
Subjt:  RATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR

Query:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK
        SHSLAAMQKA GSL A+DMK  VSFPSAASC RHKN   R+SAGRIRE SSTAL SAMDV SSCCNANILIVESDKCLR EGA IVLEFS+SCEWLLVVK
Subjt:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK

Query:  KDGSTRYTHKAENV-MKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYVNDD
        KDGSTRYT KA+ V MKP+SCNRFTHAILWSSDNGWKLEF NRRDWF+FKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI VNDD
Subjt:  KDGSTRYTHKAENV-MKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYVNDD

Query:  EVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSL
        EVCRA+AK+ ANYDMDSDDE+WLSKFNDEL+ATD HHE +S DNFELMVDAFEKGFFCNPDAFS+E+APADIC HLGS+SIVESLF YW KKRKQRK+SL
Subjt:  EVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSL

Query:  IRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYK
        IRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQSG  GR TQSS+LE   SRRDAME  QN +QKY+E KAAAD+CVE+AV KRQRAQLLL+NADLATYK
Subjt:  IRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYK

Query:  AMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        AMTALRIAEAIQASELLEAE     A AAA  ASCFLE
Subjt:  AMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

TrEMBL top hitse value%identityAlignment
A0A1S3C3K7 Enhancer of polycomb-like protein0.0e+0077.06Show/hide
Query:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP
        MP+ GMRRTRVFGL KG+DGARVLRSGRRL PE GE KLKKSKD +DWYP+ID RGNGGGSG  R H KW  VRNVKPKRVVVVNIRE++DACV +VPEP
Subjt:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP

Query:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDRVLT
        VKV PRI  D +S  VDRMFGKVY RKRKRG  ++G+VFDEM+ DN +SGDRMFGLRFIRRQRSRK+DV HWE TAGGRST L FHRQSI P PRD  LT
Subjt:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDRVLT

Query:  IFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLRA
        IF+GS++DGGCFSDF+L+VLRHLKSP L VAK SAFLLS+PI+GVFA +GMRFLQ YPPTGS GMC IFGA QSIPMFHLDFSAVP+ FM+L+S+MFLR 
Subjt:  IFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLRA

Query:  TWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPRSH
        TWIQA LVYNN QLDVD+SSDSE+++VEE   SS P    LE K MAF  D  KTRS+SHPS+R++RLG+RTMQYRNGFSSRG++KRRSS  R+RRPRSH
Subjt:  TWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPRSH

Query:  SLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKN----STLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLV
        SLAAMQK   SLA +D+K GVSFPS ASCNRHK+     ++R+SAGRI+E SST LGSAMDVDSSCCNANILIVE+DKCLR EGA IVLEFS+SCEWLLV
Subjt:  SLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKN----STLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLV

Query:  VKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVND
        VKKDGSTRYTHKAE VMKPSS NRFTHAILWS DNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVPRVSEVP YV SSGA FQR D YI VND
Subjt:  VKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVND

Query:  DEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTS
        DEVCRA+ K+ ANYDMDS+DE WL +FN+ LIATDKH E +S DNFEL VDAFEKGF+CNPDAFSDEKAPADIC  LGS SIVESL+TYW KKRKQRK+S
Subjt:  DEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTS

Query:  LIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKA
        LIRVFQA+Q+KRKPP++PKP+MRR+RS KRQPSQSGS RT Q S+LEAI  RRDA+E+QNAVQKY+E+KAAA+KC+E+AV KRQRAQLLLENADLA YKA
Subjt:  LIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKA

Query:  MTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        M+ALRIAEAI+AS         D A AAATAA CFLE
Subjt:  MTALRIAEAIQASELLEAETDADPATAAATAASCFLE

A0A6J1FH41 Enhancer of polycomb-like protein0.0e+0078.32Show/hide
Query:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP
        MP+ GMRRTRVFGL KG+DGARVLRSGRRL PE GE KLKKSKD +DWYPVID+RGNGGGSGQVR H KW  VRNVKPKRVVVVNIREEEDACVA+VPEP
Subjt:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP

Query:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPP-PRDRVL
        +K+ PRI  DGESG VDRMFGKVY RKRKRG  +NG  FDEM+GDNA+SGDRMFGLRFIRRQRSRKTD++HWEPTA GRSTKL FHR S+SPP P DRVL
Subjt:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPP-PRDRVL

Query:  TIFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLR
        TIF+GS+++ GCFSDF+ SVLRHL SPEL VAK+++FLLS+ I+GVFAS GM FLQ YPPTGSSGMCVIFG+ Q IPMFHLDFSAVP  FMYLHS+MFLR
Subjt:  TIFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLR

Query:  ATWIQAHLVYNNKQLDVDMSSDSEDDN-VEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR
         TWIQA LVYNN+QLDVDMSSDSE+D+ VEEQ VS+ P    L+ K +AFGVDHT  RS S  S+RASRLGSR +QYRNGFSSRG++KRRSS  RMRRPR
Subjt:  ATWIQAHLVYNNKQLDVDMSSDSEDDN-VEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR

Query:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK
        SHSLAAMQK  G    +DMK  VSFPS ASCNRHKNS LR+S+G    VSSTALGSAMDVDSSCCNANILIVE+D+C+R EGA IVLEFS+SCEWLL VK
Subjt:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK

Query:  KDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVNDDE
        K+GSTRYTHKAE VMKP+ CNRFTHAILWS+DNGWKLEF NRRDW IFKDLYKECSDRNIPC TAKAIPVPRVSEVP YVDSS   F+RPD YI VN+DE
Subjt:  KDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVNDDE

Query:  VCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLI
        VCR  AK+ ANYDMDS+DE+WLSKFNDELIATDK HE +S D+FELM+DAFEK  FCNPDAFSDEKAP D+ M LGSRS VESLFTYW +KR+QRK+ LI
Subjt:  VCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLI

Query:  RVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMT
        RVFQAHQ+KRKPPV+PKPIMRR+RS KRQPSQSGSGR TQSS+L+AIVSRRDA+EEQNAVQKY+EAKAAA++C+ESAV KRQRAQLLLENADLA YKA+ 
Subjt:  RVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMT

Query:  ALRIAEAIQASELLEAETDADPATAAATAASCFLE
        ALRIAEAIQASEL E       A AAA AA+CFLE
Subjt:  ALRIAEAIQASELLEAETDADPATAAATAASCFLE

A0A6J1H8K0 Enhancer of polycomb-like protein0.0e+0080.83Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DWYPV++ RGNGGGSGQVRFH KWQG+RNVKPKRVVVVNIREEE  DACVAEVP+
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE--DACVAEVPE

Query:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR
        PVKV  RINGDGE G VDRM+G+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS    PPPR
Subjt:  PVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS----PPPR

Query:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS
        DRVLT+F+GS+ DG GCFSDFMLSVLRH KSPELG+AK SAFLLS PIH VFAS+GMRFLQSYP  GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LHS
Subjt:  DRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHS

Query:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM
         M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R M
Subjt:  KMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRM

Query:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL
        R+PRSHSLAAMQKA GSL A+DMK  VSFPSAASC RH NS  R+SAG IRE SS AL SAMDV SSCCNANILIVESDKCLR EG  IVLEFS+SCEWL
Subjt:  RRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWL

Query:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV
        LVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI V
Subjt:  LVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIYV

Query:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK
         DDEVCRA+AK+ ANYDMDSDDE+WLSKFNDELIATD HHE +SVD+FELMVDAFEKGFFCNPDAFS+E+AP DIC HLGS+SIVESLF YWMKKRKQRK
Subjt:  NDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRK

Query:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT
        +SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQ G GR TQSS+LE I SR+DAME  QN VQKY+E KAAAD+CVE+AV KRQRAQLLL+NADLAT
Subjt:  TSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLAT

Query:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        YKAMTALRIAEAIQASELLE       A AAA  ASCFLE
Subjt:  YKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

A0A6J1JI48 Enhancer of polycomb-like protein0.0e+0081.83Show/hide
Query:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE-DACVAEVPEP
        MPTVGMRRTRV GLKGVDG RVLRSGRRLC E  EAKLKK+KDV+DW+PV+D RGNGGGSGQV FH KWQG+RNVKP RVVVVNIREEE DACVAEVP+P
Subjt:  MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEE-DACVAEVPEP

Query:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS------PPP
        VKV  RINGDGE G VDRMFG+VYRRKRKRGLS+NGDVFDEM        DRMFGLRFIRRQRSRK  V HWEPTAGG S KL FH+QSIS      PPP
Subjt:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSIS------PPP

Query:  RDRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH
        RDRVLTIFSGS+LDG GCFSDFMLSVLR+LKSPELG+AK SAFLLS+PIH VFAS+GMRFLQSYPP GSSGMCVIFGA+QSIPMFHLDFSAVP+ FM+LH
Subjt:  RDRVLTIFSGSNLDG-GCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH

Query:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR
        S M  R TWIQA LVYNN QLDVDMSSD+E+D+ EE  VSS PG   LE K+M  GVDHTK+RSISHPS+RASRLGSRT+QYRNGFS RG++KRRSS R 
Subjt:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR

Query:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEW
        MRRPRSHSLAAMQKA GSL A+DMK  VSFPSAASC RHKN   R+SAGRIRE SSTAL SAMDV SSCCNANILIVESDKCLR EGA IVLEFS+SCEW
Subjt:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEW

Query:  LLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIY
        LLVVKKDGSTRYT KA+ VMKP+SCNRFTHAILWSSDNGWKLEF NRRDWFIFKDLYKECSDRNIPCS AKAIPVP VSEVPGYVDSSG +F+RPD YI 
Subjt:  LLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSG-AFQRPDKYIY

Query:  VNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQR
        VNDDEVCRA+AK+ ANYDMDSDDE+WLSKFNDEL+ATD HHE +SVDNFELMVDAFEKGFF NPDAFS+E+APADIC HLGS+SIVESLF YW KKRKQR
Subjt:  VNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQR

Query:  KTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADL
        K+SLIRVFQAHQAKRKPPVIPK IMRRRRSFKRQPSQSG  GR TQSS+LE I SRRDAME  QN VQKY+E KAAAD+CVE+AV KRQRAQLLL+NADL
Subjt:  KTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSG-SGRTTQSSLLEAIVSRRDAME-EQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADL

Query:  ATYKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE
        ATYKAMTALRIAEAIQASELLEAE     A AAA  ASC+LE
Subjt:  ATYKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE

A0A6J1JXY7 Enhancer of polycomb-like protein0.0e+0078.08Show/hide
Query:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP
        MP+ GMRRTRVFGL KG+DGARVLRSGRRL PE GE KLKKSKD +DWYPVID+RGNGGGSGQVR H KW  VRNVKPKRVVVVNIREEEDACV +VPEP
Subjt:  MPTVGMRRTRVFGL-KGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEP

Query:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPP-PRDRVL
        +KVFPRI   GESG VDRMFGKVY RKRKRG ++NG VFDEM+GDNA+SGDRMFGLRFIRRQRSRKTD++ WEPTA GRSTKL  HR SISPP P DRVL
Subjt:  VKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPP-PRDRVL

Query:  TIFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLR
        TIF+GS+++ GCFSDF+ SVLRHL SP+L VAK+++FLLS+ I+GVFAS GMRFLQ YPPTGSSGMCVIFG+ Q IP+FHLDFSAVP  FMYLHSKMFLR
Subjt:  TIFSGSNLDGGCFSDFMLSVLRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLR

Query:  ATWIQAHLVYNNKQLDVDMSSDSEDDN-VEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR
         TWIQA LVYNN+QLDVDMSSDSE+D+ VEEQ VS+ P R  L+ K++AFGVDHT TRS SH S+RASRLGSR +QYRNGFSSRG++KRRSS  RMRRPR
Subjt:  ATWIQAHLVYNNKQLDVDMSSDSEDDN-VEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPR

Query:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK
        SHSLAAMQK  G    +DMK  VSFPS ASCNRHKNS LR+S+GR   VSSTALGSAMDVDSSCCNANILIVE+D+C+R EGA IVLEFS+SCEWLL VK
Subjt:  SHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVK

Query:  KDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVNDDE
        K+GSTRYTHK E VMKP+ CNRFTHAILWS+DNGWKLEF NRRDW IFKDLYKECSDRNIPC TAKAIPVPRVSEVP YVDSS   F+RPD YI VNDDE
Subjt:  KDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGA-FQRPDKYIYVNDDE

Query:  VCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLI
        VCRA AK+ ANYDMDS+DE+WLSKFNDELIATDK H+ +S D+FELM+DAFEK  FCNPDAFSDEKAP D+ M LGSR  VESLFTYW +KR+QRK+ LI
Subjt:  VCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLI

Query:  RVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMT
        RVFQAHQ+KRKPPV+PKPIMRR+RS KRQ SQ  SGR TQSS+L+AIVSRRDA+EEQNA+QKY++AKAAA++C+ESAV KRQRAQLLLENADLA YKA+ 
Subjt:  RVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMT

Query:  ALRIAEAIQASELLEAETDADPATAAATAASCFLE
        ALRIAEAIQASEL  A      A AAA AA+CFLE
Subjt:  ALRIAEAIQASELLEAETDADPATAAATAASCFLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32620.1 Enhancer of polycomb-like transcription factor protein2.7e-3834.04Show/hide
Query:  SFPSAASCNRHKNSTLRESAGR--IREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVKKDGSTRYTHKAENVMKPSSCN
        S PS  S +R+K S L+    +   R  +    G   D++SS C+AN+L+   D+  R  GA I LE   + EW L VK  G+T+Y+H+A   ++P S N
Subjt:  SFPSAASCNRHKNSTLRESAGR--IREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVKKDGSTRYTHKAENVMKPSSCN

Query:  RFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVP--RVSEVPGYVDSSGAFQR-PDKYIYVNDDEVCRALAKTAANYDMDSDDE
        RFTHA++W     W LEF +R  WF+FK++++EC +RN   +  + IP+P  R+ E   +  +   F R   KY    + +V  AL  +   YDMDSDDE
Subjt:  RFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVP--RVSEVPGYVDSSGAFQR-PDKYIYVNDDEVCRALAKTAANYDMDSDDE

Query:  DWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLIRVFQ
          L +  +   A +     I+ D FE  +D FEK  F             ++   +GS   +E+++  W  KR+++   LIR  Q
Subjt:  DWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLIRVFQ

AT4G32620.1 Enhancer of polycomb-like transcription factor protein4.5e-0146Show/hide
Query:  KYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMTALRIAEAIQAS
        K ++A  AA +    A +KR+RA+ L   ADLA  KA  AL  AEA++AS
Subjt:  KYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMTALRIAEAIQAS

AT4G32620.2 Enhancer of polycomb-like transcription factor protein2.7e-3834.04Show/hide
Query:  SFPSAASCNRHKNSTLRESAGR--IREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVKKDGSTRYTHKAENVMKPSSCN
        S PS  S +R+K S L+    +   R  +    G   D++SS C+AN+L+   D+  R  GA I LE   + EW L VK  G+T+Y+H+A   ++P S N
Subjt:  SFPSAASCNRHKNSTLRESAGR--IREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVKKDGSTRYTHKAENVMKPSSCN

Query:  RFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVP--RVSEVPGYVDSSGAFQR-PDKYIYVNDDEVCRALAKTAANYDMDSDDE
        RFTHA++W     W LEF +R  WF+FK++++EC +RN   +  + IP+P  R+ E   +  +   F R   KY    + +V  AL  +   YDMDSDDE
Subjt:  RFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVP--RVSEVPGYVDSSGAFQR-PDKYIYVNDDEVCRALAKTAANYDMDSDDE

Query:  DWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLIRVFQ
          L +  +   A +     I+ D FE  +D FEK  F             ++   +GS   +E+++  W  KR+++   LIR  Q
Subjt:  DWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSDEKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLIRVFQ

AT4G32620.2 Enhancer of polycomb-like transcription factor protein4.5e-0146Show/hide
Query:  KYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMTALRIAEAIQAS
        K ++A  AA +    A +KR+RA+ L   ADLA  KA  AL  AEA++AS
Subjt:  KYKEAKAAADKCVESAVVKRQRAQLLLENADLATYKAMTALRIAEAIQAS

AT5G04670.1 Enhancer of polycomb-like transcription factor protein5.6e-12939.28Show/hide
Query:  MPTVGMRR-TRVFG-LKGVDGARVLRSGRRLCPEPGEAKLKKSKDVT--DWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEV
        MP+VGMRR TRVFG +K  DGARVLRSGRR+ P  GE K++++ DV   D   V+ N+    G+   +  S     +   PK+V       E++  V + 
Subjt:  MPTVGMRR-TRVFG-LKGVDGARVLRSGRRLCPEPGEAKLKKSKDVT--DWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEV

Query:  PEPVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDR
        P   +   R  G G+   VD+MFG VY RKRKR       + +    D   S + +  L+F RR+R     VS                           
Subjt:  PEPVKVFPRINGDGESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDR

Query:  VLTIFSGSNLDGGCFSDFMLSV----LRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH
        VLT+     +D  C   + L+V    +R+++  EL ++ +++F LS PI+ VFA  G+RFL    P  S G+C  FGAM  +P+F  DF+ +P  FM +H
Subjt:  VLTIFSGSNLDGGCFSDFMLSV----LRHLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLH

Query:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR
          +F+R   +     +  K L +  +   E D+  E A+   P  C      +  G+         HPS+RAS+L     QYR    S   QKRRSS RR
Subjt:  SKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQAVSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRR

Query:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAM-DVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCE
         RR R+ S  A +   G+   +      +  +A S  + ++S L  S+     +S   +     ++DS CC+ANIL++ SD+C R EG  ++LE SSS E
Subjt:  MRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESAGRIREVSSTALGSAM-DVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCE

Query:  WLLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPG---YVDSSGAFQRPD-
        W LV+KKDG+ RY+H A+  M+P S NR THA +W   + WKLEF +R+DW  FKD+YKEC +RN+   + K IP+P V EV G   Y+D+  +F RP  
Subjt:  WLLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYKECSDRNIPCSTAKAIPVPRVSEVPG---YVDSSGAFQRPD-

Query:  KYIYVNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELI-ATDKHHEYISVDNFELMVDAFEKGFFCNP-DAFSDEKAPA-DICMHLGSRSIVESLFTYW
         YI VN+DEV RA+A++ A YDMDS+DE+WL + N +++   D  +  +  + FELM+D FEK  F +P D   DEKA       +LG + +VE++  YW
Subjt:  KYIYVNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELI-ATDKHHEYISVDNFELMVDAFEKGFFCNP-DAFSDEKAPA-DICMHLGSRSIVESLFTYW

Query:  MKKRKQRKTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLL
        +KKRKQRK  L+R+FQ HQ K K  ++ KP+ R+RRSFKRQ SQ   G+  Q+S     V   +  EE++ + + +EAK  ADK +E+A+ KR+RAQ+L 
Subjt:  MKKRKQRKTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCVESAVVKRQRAQLLL

Query:  ENADLATYKAMTALRIAEAIQASELLEAET
        ENADLA YKAM ALRIAEAI+ +E  E +T
Subjt:  ENADLATYKAMTALRIAEAIQASELLEAET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACGGTGGGGATGAGAAGGACGAGAGTTTTTGGTTTGAAGGGTGTTGATGGAGCTAGAGTTTTAAGGTCTGGAAGACGGCTTTGCCCTGAACCGGGTGAGGCGAA
GCTGAAGAAGTCTAAAGATGTCACTGATTGGTACCCTGTCATCGATAACAGAGGCAATGGGGGCGGAAGTGGCCAGGTTAGGTTCCATAGTAAGTGGCAAGGAGTACGAA
ATGTCAAACCAAAGAGGGTTGTCGTTGTTAACATTCGTGAGGAGGAGGATGCTTGTGTAGCGGAAGTGCCTGAACCAGTGAAGGTTTTTCCTAGGATTAATGGCGATGGT
GAATCTGGTGGTGTGGATAGGATGTTTGGGAAAGTTTATAGAAGAAAGAGGAAGAGGGGTCTTTCAGATAATGGTGATGTTTTTGATGAAATGGATGGTGATAATGCTAT
GTCAGGAGATAGGATGTTTGGACTCCGCTTCATTCGAAGACAGAGATCGAGGAAGACCGATGTTTCACATTGGGAGCCTACTGCAGGTGGCCGTTCGACTAAACTGCTTT
TCCATAGGCAGAGCATTTCACCGCCACCCCGGGATCGGGTTCTCACTATTTTTTCCGGGAGTAATCTTGATGGTGGCTGTTTTTCAGATTTTATGCTTTCGGTTCTTAGA
CACTTGAAGAGTCCGGAGCTGGGTGTGGCTAAGGTTTCTGCATTTTTATTGTCTGACCCAATCCATGGAGTTTTTGCTTCTCAGGGAATGCGTTTCTTGCAGAGTTATCC
TCCTACCGGAAGCTCTGGCATGTGTGTGATTTTTGGGGCCATGCAGTCGATTCCAATGTTTCATTTGGATTTTTCCGCTGTTCCTATCTCTTTTATGTACTTGCACTCCA
AGATGTTTCTTAGAGCGACTTGGATTCAGGCTCATCTTGTATATAATAACAAGCAGTTAGATGTAGATATGAGTAGCGATAGTGAAGACGACAACGTTGAAGAGCAAGCT
GTTTCCAGTGCTCCAGGAAGATGTCCTTTGGAAAGCAAAGCCATGGCCTTTGGAGTTGATCATACTAAGACCCGATCCATTTCACATCCATCTCTTAGAGCTTCAAGGTT
AGGTAGTCGGACCATGCAATACAGAAATGGTTTCAGCTCTCGTGGTCTACAGAAAAGGAGAAGTTCGCGTCGGCGGATGAGGAGACCTAGAAGCCATTCTCTTGCTGCTA
TGCAAAAAGCTTTTGGCTCTTTAGCAGCTGAAGATATGAAACACGGTGTATCTTTTCCTTCTGCTGCATCTTGTAACAGGCACAAGAACTCAACCCTGAGAGAGTCTGCT
GGGCGCATCAGAGAAGTGAGTTCTACTGCATTGGGATCAGCAATGGATGTTGACTCATCATGCTGCAATGCAAATATATTAATTGTAGAATCTGACAAATGTTTGAGAGT
AGAGGGAGCCATTATTGTGCTAGAGTTTTCTTCTTCCTGTGAATGGCTTCTAGTTGTCAAGAAAGATGGTTCAACTAGATACACCCACAAAGCAGAAAACGTCATGAAGC
CCTCTTCTTGCAATCGTTTTACACATGCAATATTGTGGTCATCAGATAATGGTTGGAAGCTAGAGTTTTCAAATCGAAGGGATTGGTTCATTTTCAAGGATCTATACAAG
GAGTGTTCTGATCGCAATATTCCATGTTCTACTGCTAAAGCTATTCCTGTGCCCAGAGTGTCTGAAGTTCCAGGTTATGTCGATAGTAGTGGTGCTTTTCAAAGGCCAGA
TAAGTACATCTACGTAAACGATGATGAGGTATGTAGAGCTTTGGCAAAGACTGCAGCAAACTACGACATGGACTCTGATGATGAGGATTGGTTAAGCAAGTTCAACGACG
AGCTTATTGCAACAGACAAGCACCATGAATATATTTCAGTGGATAATTTTGAGTTGATGGTCGATGCTTTTGAGAAGGGATTTTTCTGTAATCCCGATGCCTTCTCCGAT
GAGAAAGCACCTGCTGATATATGCATGCATCTCGGTAGCCGGTCGATAGTCGAGTCTTTATTTACTTATTGGATGAAGAAACGGAAACAGAGAAAAACATCTTTGATTAG
GGTTTTCCAGGCTCATCAGGCAAAGAGGAAACCTCCAGTGATTCCTAAACCTATCATGCGAAGAAGAAGATCATTCAAAAGGCAGCCTAGCCAATCTGGGAGTGGGAGAA
CTACCCAATCAAGTCTTTTAGAAGCCATAGTCTCGAGACGAGATGCTATGGAGGAACAGAATGCAGTTCAAAAGTACAAAGAAGCGAAGGCAGCAGCAGACAAATGCGTC
GAAAGCGCTGTTGTTAAGCGCCAAAGGGCTCAGTTGCTGTTGGAGAATGCAGATTTAGCAACTTACAAAGCCATGACAGCACTCAGAATTGCCGAAGCAATTCAAGCATC
AGAATTGCTGGAAGCTGAAACTGATGCTGATCCGGCGACTGCTGCTGCTACTGCTGCATCTTGTTTTCTCGAATGA
mRNA sequenceShow/hide mRNA sequence
GCAAATCAATAAAAAAAAAAAAGAAAAAAAGAAAAAAAAGAAGCCCTAGAAGCTGAATCGAAAAACGCTTTTCCGTTCCAAATTTTCTCTTCACCCCCTAGAACGAATCT
GACCGTCGATCTTTTCATCCAACGGCTCCCCACAGTCGGCGACCGCTATTTTCCCCTTCTTTCCGCCCCCACGTCCAGTGGTTTTTATCATAAACCCTAGCCGTCTTCTC
TCTCTCTATTCTGCTCTGTCAGTTTGCATGAGACTCTCTCACTCTCAGCGATTCTCGATCATTCTTTCGGATCTAAACTTTGGGGTTCTTTTTTGGGTTCTCCCTCTTTC
TTTTTCTCCAACTTCTTTTTCGGGTTTTTCATAGCCTCTGTAAATGGATCTTTGATTCTTTTTTTTTTCTTTTTTCTTTTTGTTTTCCCTTTTGTTTTTGGGTTTTTGGT
TTGAATCTAGATTATCGTAGAGACAGTGTCCGATTCAGTGTAGATTTTTGTAATTTTCTCTTATTTCTGCATGTTTCTCTGTTTGTTTCCGTCGTTTGATTTTTAATTTT
TCCCCCCTTTCGAATTCTTTGGCGCCAAGTCTCGATTTCGAGTTGGATTGCATTTGAAAATTGTGGCTCATCGTTGTATACAGACACTGGTTTGATTGAACGAGATGGAT
GCTTGTTAAGCACTGGCGACGGCCGCCGATTCCGAGGCCGGTTCCGAGTTTGGTGACGGAGTCAATTCGAGAAAGGTCGATTGTCGAGTGGTGTTGATTCTTATAACCGT
ACGGCAATGCCGACGGTGGGGATGAGAAGGACGAGAGTTTTTGGTTTGAAGGGTGTTGATGGAGCTAGAGTTTTAAGGTCTGGAAGACGGCTTTGCCCTGAACCGGGTGA
GGCGAAGCTGAAGAAGTCTAAAGATGTCACTGATTGGTACCCTGTCATCGATAACAGAGGCAATGGGGGCGGAAGTGGCCAGGTTAGGTTCCATAGTAAGTGGCAAGGAG
TACGAAATGTCAAACCAAAGAGGGTTGTCGTTGTTAACATTCGTGAGGAGGAGGATGCTTGTGTAGCGGAAGTGCCTGAACCAGTGAAGGTTTTTCCTAGGATTAATGGC
GATGGTGAATCTGGTGGTGTGGATAGGATGTTTGGGAAAGTTTATAGAAGAAAGAGGAAGAGGGGTCTTTCAGATAATGGTGATGTTTTTGATGAAATGGATGGTGATAA
TGCTATGTCAGGAGATAGGATGTTTGGACTCCGCTTCATTCGAAGACAGAGATCGAGGAAGACCGATGTTTCACATTGGGAGCCTACTGCAGGTGGCCGTTCGACTAAAC
TGCTTTTCCATAGGCAGAGCATTTCACCGCCACCCCGGGATCGGGTTCTCACTATTTTTTCCGGGAGTAATCTTGATGGTGGCTGTTTTTCAGATTTTATGCTTTCGGTT
CTTAGACACTTGAAGAGTCCGGAGCTGGGTGTGGCTAAGGTTTCTGCATTTTTATTGTCTGACCCAATCCATGGAGTTTTTGCTTCTCAGGGAATGCGTTTCTTGCAGAG
TTATCCTCCTACCGGAAGCTCTGGCATGTGTGTGATTTTTGGGGCCATGCAGTCGATTCCAATGTTTCATTTGGATTTTTCCGCTGTTCCTATCTCTTTTATGTACTTGC
ACTCCAAGATGTTTCTTAGAGCGACTTGGATTCAGGCTCATCTTGTATATAATAACAAGCAGTTAGATGTAGATATGAGTAGCGATAGTGAAGACGACAACGTTGAAGAG
CAAGCTGTTTCCAGTGCTCCAGGAAGATGTCCTTTGGAAAGCAAAGCCATGGCCTTTGGAGTTGATCATACTAAGACCCGATCCATTTCACATCCATCTCTTAGAGCTTC
AAGGTTAGGTAGTCGGACCATGCAATACAGAAATGGTTTCAGCTCTCGTGGTCTACAGAAAAGGAGAAGTTCGCGTCGGCGGATGAGGAGACCTAGAAGCCATTCTCTTG
CTGCTATGCAAAAAGCTTTTGGCTCTTTAGCAGCTGAAGATATGAAACACGGTGTATCTTTTCCTTCTGCTGCATCTTGTAACAGGCACAAGAACTCAACCCTGAGAGAG
TCTGCTGGGCGCATCAGAGAAGTGAGTTCTACTGCATTGGGATCAGCAATGGATGTTGACTCATCATGCTGCAATGCAAATATATTAATTGTAGAATCTGACAAATGTTT
GAGAGTAGAGGGAGCCATTATTGTGCTAGAGTTTTCTTCTTCCTGTGAATGGCTTCTAGTTGTCAAGAAAGATGGTTCAACTAGATACACCCACAAAGCAGAAAACGTCA
TGAAGCCCTCTTCTTGCAATCGTTTTACACATGCAATATTGTGGTCATCAGATAATGGTTGGAAGCTAGAGTTTTCAAATCGAAGGGATTGGTTCATTTTCAAGGATCTA
TACAAGGAGTGTTCTGATCGCAATATTCCATGTTCTACTGCTAAAGCTATTCCTGTGCCCAGAGTGTCTGAAGTTCCAGGTTATGTCGATAGTAGTGGTGCTTTTCAAAG
GCCAGATAAGTACATCTACGTAAACGATGATGAGGTATGTAGAGCTTTGGCAAAGACTGCAGCAAACTACGACATGGACTCTGATGATGAGGATTGGTTAAGCAAGTTCA
ACGACGAGCTTATTGCAACAGACAAGCACCATGAATATATTTCAGTGGATAATTTTGAGTTGATGGTCGATGCTTTTGAGAAGGGATTTTTCTGTAATCCCGATGCCTTC
TCCGATGAGAAAGCACCTGCTGATATATGCATGCATCTCGGTAGCCGGTCGATAGTCGAGTCTTTATTTACTTATTGGATGAAGAAACGGAAACAGAGAAAAACATCTTT
GATTAGGGTTTTCCAGGCTCATCAGGCAAAGAGGAAACCTCCAGTGATTCCTAAACCTATCATGCGAAGAAGAAGATCATTCAAAAGGCAGCCTAGCCAATCTGGGAGTG
GGAGAACTACCCAATCAAGTCTTTTAGAAGCCATAGTCTCGAGACGAGATGCTATGGAGGAACAGAATGCAGTTCAAAAGTACAAAGAAGCGAAGGCAGCAGCAGACAAA
TGCGTCGAAAGCGCTGTTGTTAAGCGCCAAAGGGCTCAGTTGCTGTTGGAGAATGCAGATTTAGCAACTTACAAAGCCATGACAGCACTCAGAATTGCCGAAGCAATTCA
AGCATCAGAATTGCTGGAAGCTGAAACTGATGCTGATCCGGCGACTGCTGCTGCTACTGCTGCATCTTGTTTTCTCGAATGAGATTTGATTTTCGATAACGGCGGGATTC
ATTCATAACTATGGATGTTTTCTTTATGAAGATCCCAATTAATTATTGTATAGTATTTCATTAATGTGTACATTCTATTTTGTATGATTAGAAGAATCTCTCACACCCCT
CATTATGTACAATGAATGAGACAGTCCTGATTTTTCCTAGGTCCAATAATATACGAGGTAAGTTTAGTAGTGTAATTGGTTAGTTTATGTATCTGAATATGATGCTCTAC
TGTACAATTTTAAAAACATCAAATAACACATAATAAAAGAGGACT
Protein sequenceShow/hide protein sequence
MPTVGMRRTRVFGLKGVDGARVLRSGRRLCPEPGEAKLKKSKDVTDWYPVIDNRGNGGGSGQVRFHSKWQGVRNVKPKRVVVVNIREEEDACVAEVPEPVKVFPRINGDG
ESGGVDRMFGKVYRRKRKRGLSDNGDVFDEMDGDNAMSGDRMFGLRFIRRQRSRKTDVSHWEPTAGGRSTKLLFHRQSISPPPRDRVLTIFSGSNLDGGCFSDFMLSVLR
HLKSPELGVAKVSAFLLSDPIHGVFASQGMRFLQSYPPTGSSGMCVIFGAMQSIPMFHLDFSAVPISFMYLHSKMFLRATWIQAHLVYNNKQLDVDMSSDSEDDNVEEQA
VSSAPGRCPLESKAMAFGVDHTKTRSISHPSLRASRLGSRTMQYRNGFSSRGLQKRRSSRRRMRRPRSHSLAAMQKAFGSLAAEDMKHGVSFPSAASCNRHKNSTLRESA
GRIREVSSTALGSAMDVDSSCCNANILIVESDKCLRVEGAIIVLEFSSSCEWLLVVKKDGSTRYTHKAENVMKPSSCNRFTHAILWSSDNGWKLEFSNRRDWFIFKDLYK
ECSDRNIPCSTAKAIPVPRVSEVPGYVDSSGAFQRPDKYIYVNDDEVCRALAKTAANYDMDSDDEDWLSKFNDELIATDKHHEYISVDNFELMVDAFEKGFFCNPDAFSD
EKAPADICMHLGSRSIVESLFTYWMKKRKQRKTSLIRVFQAHQAKRKPPVIPKPIMRRRRSFKRQPSQSGSGRTTQSSLLEAIVSRRDAMEEQNAVQKYKEAKAAADKCV
ESAVVKRQRAQLLLENADLATYKAMTALRIAEAIQASELLEAETDADPATAAATAASCFLE