; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004138 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004138
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEnhancer of polycomb-like protein
Genome locationChr08:14094881..14098250
RNA-Seq ExpressionHG10004138
SyntenyHG10004138
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0032777 - Piccolo NuA4 histone acetyltransferase complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
InterPro domainsIPR019542 - Enhancer of polycomb-like, N-terminal
IPR024943 - Enhancer of polycomb protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140897.1 uncharacterized protein LOC101207239 [Cucumis sativus]0.0e+0085.85Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KV PRIG++ KS   DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRKTDVE WESTAGGR++ LHF RQRI  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRH KSP LSVAKFSAFLLSNPIN VFALKGMRFLQGYPP G  GM  IFG RQSIPMFHLDFSA+PLPFMFL+S+MFLRVT 
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVDISSDSEEDSVEELHV S PVSSLERKPMAF  D PK RSVSHPSVRA+RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR
        MQK+IG LAVDD+K  VSFPS ASCNRH++S +RDSAGRIRE +STALGS+MDVDSSCC ANILIVEADKC REEGAN++LEFS+SCEWLLVVKKDGSTR
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR

Query:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT
        YTHKAERVMKPSSCNRFTHAILWS+DN WKLEFPNRRDWFIFKDLYKECSDRNIPC IAKAIPVPRVSEVPDYVDSSG SFQRPDTYISVNDDEV RAMT
Subjt:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT

Query:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH
        K TANYDMDS+DEEWLIEFNDGLIAT+KH EC SE+ FE MVD FEKGF+CNPDAFSDEKAPADIC  L+S  IVESLY YWTKKR+QRKSSLIRVFQA+
Subjt:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH

Query:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE
        QSKRKPP+VPKP+MRR+RSLKRQPSQSGSGRT Q SILEAI+ RRDA+EDQNA+QKYEE+KAA EKCIENAV+KRQRAQLLLENADLA YKAM ALRIAE
Subjt:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE

Query:  AIQASESPEA---AASCFLE
        AI+ S+SPEA   AA+CFLE
Subjt:  AIQASESPEA---AASCFLE

XP_008456589.1 PREDICTED: uncharacterized protein LOC103496500 isoform X1 [Cucumis melo]0.0e+0085.25Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KVLPRIG++ KS D DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRK+DVE WESTAGGRS  LHF RQ I  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRHLKSP LSVAKFSAFLLSNPINGVFALKGMRFLQGYPP GS GMC IFG RQSIPMFHLDFSAVPLPFMFL+S+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVD+SSDSEE+SVEEL  SSPPVSSLERKPMAF  D PK RSVSHPSVR++RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        MQK I SLAVDD+K  VSFPS ASCNRH++      +RDSAGRI+E SST LGS+MDVDSSCCNANILIVEADKC REEGAN++LEFS+SCEWLLVVKKD
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAERVMKPSS NRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYV SSG SFQR DTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RAMTK TANYDMDS+DE WL+EFN+GLIAT+KH EC+SE+ FEL VD FEKGF+CNPDAFSDEKAPADIC  L S  IVESLY YWTKKR+QRKSSLIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA
        FQA+QSKRKPP+VPKP+MRR+RSLKRQPSQSGS RT Q SILE    AI  RRDA+EDQNAVQKYEE+KAAAEKCIENAVNKRQRAQLLLENADLA YKA
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA

Query:  MLALRIAEAIQASESPEAA--ASCFLE
        M ALRIAEAI+AS+S EAA  A+CFLE
Subjt:  MLALRIAEAIQASESPEAA--ASCFLE

XP_008456590.1 PREDICTED: uncharacterized protein LOC103496500 isoform X2 [Cucumis melo]0.0e+0085.66Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KVLPRIG++ KS D DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRK+DVE WESTAGGRS  LHF RQ I  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRHLKSP LSVAKFSAFLLSNPINGVFALKGMRFLQGYPP GS GMC IFG RQSIPMFHLDFSAVPLPFMFL+S+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVD+SSDSEE+SVEEL  SSPPVSSLERKPMAF  D PK RSVSHPSVR++RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        MQK I SLAVDD+K  VSFPS ASCNRH++      +RDSAGRI+E SST LGS+MDVDSSCCNANILIVEADKC REEGAN++LEFS+SCEWLLVVKKD
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAERVMKPSS NRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYV SSG SFQR DTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RAMTK TANYDMDS+DE WL+EFN+GLIAT+KH EC+SE+ FEL VD FEKGF+CNPDAFSDEKAPADIC  L S  IVESLY YWTKKR+QRKSSLIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL
        FQA+QSKRKPP+VPKP+MRR+RSLKRQPSQSGS RT Q SILEAI  RRDA+EDQNAVQKYEE+KAAAEKCIENAVNKRQRAQLLLENADLA YKAM AL
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL

Query:  RIAEAIQASESPEAA--ASCFLE
        RIAEAI+AS+S EAA  A+CFLE
Subjt:  RIAEAIQASESPEAA--ASCFLE

XP_023550905.1 uncharacterized protein LOC111808903 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.36Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVK+KKSKDASDWYPVI+SRGNGGGSG  RLHGKWTQVRNVKPKRVVVVNIRE++DACVVKVPEP+
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISP--PRDRVLT
        KVLPRIGS+G+SGD DRMFGKVY+RKRKRGR ENG  F EME DN +SGDRMFGLRF RRQRSRKTD+  WE TA GRS KLHF R  +SP  PRDRVLT
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISP--PRDRVLT

Query:  IFAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRV
        IFAGSS++ GCFSDFI SVLRHL SP+L+VAK S+FLLSN INGVFA  GMRFLQGYPP GSSGMCVIFG RQ IPMFHLDFSAVP PFM+LHSKMFLR 
Subjt:  IFAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRV

Query:  TWIQARLVYNNNQLDVDISSDSEEDS-VEELHVSSPPV-SSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----
        T IQARLVYNN QLDVD+SSDSEEDS VEE HVS+PPV SSL+ K +AFGVDH   RS S  SVRASRLGSR +QYRNGFSSRGIRKRRSSLRMRR    
Subjt:  TWIQARLVYNNNQLDVDISSDSEEDS-VEELHVSSPPV-SSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----

Query:  SLAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        SLAAMQKT+G   +DDMK SVSFPS ASCNRH+NS LRDS+GR   VSSTALGS+MDVDSSCCNANILIVEAD+C REEGAN++LEFS+SCEWLL VKK+
Subjt:  SLAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAE VMKP+ CNRFTHAILWS DN WKLEFPNRRDW IFKDLYKECSDRNIPC  AKAIPVPRVSEVPDYVDSS   F+RPDTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RA  K TANYDMDS+DEEWL +FND LIAT+K HEC+S + FELM+D FEK  FCNPDAFSDEKAP D+ M L SR  VESL+ YWT+KRRQRKS LIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL
        FQAHQSKRKPPVVPKPIMRR+RS+KRQPSQSGSGR TQSSIL+AI+SRRDA+E+QNAVQKYEEAKAAAE+C+E+AV+KRQRAQLLLENADLAAYKA++AL
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL

Query:  RIAEAIQASESPE-----AAASCFLE
        RIAEAIQASE PE     AAA+CFLE
Subjt:  RIAEAIQASESPE-----AAASCFLE

XP_038884677.1 uncharacterized protein LOC120075394 [Benincasa hispida]0.0e+0089.47Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVKVKKSKDA+DWYPVIESRGN    GHGRLHGKWT VRNVKPKR VVV+ R DDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KV PRI ++GKSGDGDRMFGKVYTRKRKRGRLENGE F EMESDNVLSGDRMFGLRF RRQRSRKTDVE WESTAGGRS KLHFRRQRIS PRD+VLTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSSLDGGCFSDFILSVLRHLKSPDLS+AKFSAFLLSNPINGVFALKGMRFLQGYPP GSSGM VIFG RQSIPMFHLDFSAVPLPFMFLHS+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVDISSDSEED VEELHVSS  VSSLE KPMAFG D PK RSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRM+R    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR
        +QKTIGSLAVDD+K SV+F S ASCNRH+NS  RDSAGRIREVSSTALGS+MDVDSSCCNANILI+E+DKC REEGA+++LE S+SCEWLL +KKDGSTR
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR

Query:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT
        YTHKAERVMKPSSCNRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEV DYVDSSGV F R DTYISVNDDEV RAMT
Subjt:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT

Query:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH
        K TANYDMDSDDEEWLIEFND LIAT+KH ECISEE FELM+D FEK FFCNPDAFSDEKAPADICMHL SR IVESLYAYWTKKR+QRKSSLIRVFQAH
Subjt:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH

Query:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE
        QSKRKPPVVPKPIMRRRRSLKRQPSQSG+GRTTQ SILEAIISRRD MEDQNA+QKYEEAKAAAEKC ENAVNKRQRAQLLLENADLAAYKAMLALRIAE
Subjt:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE

Query:  AIQASESPEAAASCFLE
        AIQAS S  A A+CFLE
Subjt:  AIQASESPEAAASCFLE

TrEMBL top hitse value%identityAlignment
A0A0A0K9C9 Enhancer of polycomb-like protein0.0e+0085.85Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KV PRIG++ KS   DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRKTDVE WESTAGGR++ LHF RQRI  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRH KSP LSVAKFSAFLLSNPIN VFALKGMRFLQGYPP G  GM  IFG RQSIPMFHLDFSA+PLPFMFL+S+MFLRVT 
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVDISSDSEEDSVEELHV S PVSSLERKPMAF  D PK RSVSHPSVRA+RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR
        MQK+IG LAVDD+K  VSFPS ASCNRH++S +RDSAGRIRE +STALGS+MDVDSSCC ANILIVEADKC REEGAN++LEFS+SCEWLLVVKKDGSTR
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTR

Query:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT
        YTHKAERVMKPSSCNRFTHAILWS+DN WKLEFPNRRDWFIFKDLYKECSDRNIPC IAKAIPVPRVSEVPDYVDSSG SFQRPDTYISVNDDEV RAMT
Subjt:  YTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMT

Query:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH
        K TANYDMDS+DEEWLIEFNDGLIAT+KH EC SE+ FE MVD FEKGF+CNPDAFSDEKAPADIC  L+S  IVESLY YWTKKR+QRKSSLIRVFQA+
Subjt:  KGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAH

Query:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE
        QSKRKPP+VPKP+MRR+RSLKRQPSQSGSGRT Q SILEAI+ RRDA+EDQNA+QKYEE+KAA EKCIENAV+KRQRAQLLLENADLA YKAM ALRIAE
Subjt:  QSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAE

Query:  AIQASESPEA---AASCFLE
        AI+ S+SPEA   AA+CFLE
Subjt:  AIQASESPEA---AASCFLE

A0A1S3C3K7 Enhancer of polycomb-like protein0.0e+0085.66Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KVLPRIG++ KS D DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRK+DVE WESTAGGRS  LHF RQ I  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRHLKSP LSVAKFSAFLLSNPINGVFALKGMRFLQGYPP GS GMC IFG RQSIPMFHLDFSAVPLPFMFL+S+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVD+SSDSEE+SVEEL  SSPPVSSLERKPMAF  D PK RSVSHPSVR++RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        MQK I SLAVDD+K  VSFPS ASCNRH++      +RDSAGRI+E SST LGS+MDVDSSCCNANILIVEADKC REEGAN++LEFS+SCEWLLVVKKD
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAERVMKPSS NRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYV SSG SFQR DTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RAMTK TANYDMDS+DE WL+EFN+GLIAT+KH EC+SE+ FEL VD FEKGF+CNPDAFSDEKAPADIC  L S  IVESLY YWTKKR+QRKSSLIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL
        FQA+QSKRKPP+VPKP+MRR+RSLKRQPSQSGS RT Q SILEAI  RRDA+EDQNAVQKYEE+KAAAEKCIENAVNKRQRAQLLLENADLA YKAM AL
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLAL

Query:  RIAEAIQASESPEAA--ASCFLE
        RIAEAI+AS+S EAA  A+CFLE
Subjt:  RIAEAIQASESPEAA--ASCFLE

A0A1S3C3P3 Enhancer of polycomb-like protein0.0e+0085.25Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KVLPRIG++ KS D DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRK+DVE WESTAGGRS  LHF RQ I  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRHLKSP LSVAKFSAFLLSNPINGVFALKGMRFLQGYPP GS GMC IFG RQSIPMFHLDFSAVPLPFMFL+S+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVD+SSDSEE+SVEEL  SSPPVSSLERKPMAF  D PK RSVSHPSVR++RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        MQK I SLAVDD+K  VSFPS ASCNRH++      +RDSAGRI+E SST LGS+MDVDSSCCNANILIVEADKC REEGAN++LEFS+SCEWLLVVKKD
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAERVMKPSS NRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYV SSG SFQR DTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RAMTK TANYDMDS+DE WL+EFN+GLIAT+KH EC+SE+ FEL VD FEKGF+CNPDAFSDEKAPADIC  L S  IVESLY YWTKKR+QRKSSLIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA
        FQA+QSKRKPP+VPKP+MRR+RSLKRQPSQSGS RT Q SILE    AI  RRDA+EDQNAVQKYEE+KAAAEKCIENAVNKRQRAQLLLENADLA YKA
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA

Query:  MLALRIAEAIQASESPEAA--ASCFLE
        M ALRIAEAI+AS+S EAA  A+CFLE
Subjt:  MLALRIAEAIQASESPEAA--ASCFLE

A0A5D3CKR0 Enhancer of polycomb-like protein0.0e+0085.25Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKG+DGARVLRSGRRLWPESGEVK+KKSKDASDWYP+I+ RGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF
        KVLPRIG++ KS D DRMFGKVY+RKRKRGRLE+GE F EMESDNVLSGDRMFGLRF RRQRSRK+DVE WESTAGGRS  LHF RQ I  PRD  LTIF
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIF

Query:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW
        AGSS+DGGCFSDFIL+VLRHLKSP LSVAKFSAFLLSNPINGVFALKGMRFLQGYPP GS GMC IFG RQSIPMFHLDFSAVPLPFMFL+S+MFLRVTW
Subjt:  AGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTW

Query:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA
        IQARLVYNNNQLDVD+SSDSEE+SVEEL  SSPPVSSLERKPMAF  D PK RSVSHPSVR++RLG+RTMQYRNGFSSRGIRKRRSSLR+RR    SLAA
Subjt:  IQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----SLAA

Query:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD
        MQK I SLAVDD+K  VSFPS ASCNRH++      +RDSAGRI+E SST LGS+MDVDSSCCNANILIVEADKC REEGAN++LEFS+SCEWLLVVKKD
Subjt:  MQKTIGSLAVDDMKHSVSFPSAASCNRHQN----SVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKD

Query:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS
        GSTRYTHKAERVMKPSS NRFTHAILWSVDN WKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYV SSG SFQR DTYISVNDDEV 
Subjt:  GSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVS

Query:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV
        RAMTK TANYDMDS+DE WL+EFN+GLIAT+KH EC+SE+ FEL VD FEKGF+CNPDAFSDEKAPADIC  L S  IVESLY YWTKKR+QRKSSLIRV
Subjt:  RAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRV

Query:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA
        FQA+QSKRKPP+VPKP+MRR+RSLKRQPSQSGS RT Q SILE    AI  RRDA+EDQNAVQKYEE+KAAAEKCIENAVNKRQRAQLLLENADLA YKA
Subjt:  FQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILE----AIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKA

Query:  MLALRIAEAIQASESPEAA--ASCFLE
        M ALRIAEAI+AS+S EAA  A+CFLE
Subjt:  MLALRIAEAIQASESPEAA--ASCFLE

A0A6J1FH41 Enhancer of polycomb-like protein0.0e+0080.48Show/hide
Query:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV
        MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVK+KKSKDASDWYPVI+SRGNGGGSG  RLHGKWTQVRNVKPKRVVVVNIRE++DACV KVPEP+
Subjt:  MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPV

Query:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISP--PRDRVLT
        K+LPRIGS+G+SGD DRMFGKVY+RKRKRGR ENG  F EME DN +SGDRMFGLRF RRQRSRKTD+  WE TA GRS KLHF R  +SP  P DRVLT
Subjt:  KVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRF-RRQRSRKTDVEDWESTAGGRSAKLHFRRQRISP--PRDRVLT

Query:  IFAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRV
        IFAGSS++ GCFSDFI SVLRHL SP+L+VAK ++FLLSN INGVFA  GM FLQGYPP GSSGMCVIFG RQ IPMFHLDFSAVP PFM+LHS+MFLR 
Subjt:  IFAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRV

Query:  TWIQARLVYNNNQLDVDISSDSEEDS-VEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----S
        TWIQARLVYNN QLDVD+SSDSEEDS VEE HVS+PPVSSL+ K +AFGVDH   RS S  SVRASRLGSR +QYRNGFSSRGIRKRRSSLRMRR    S
Subjt:  TWIQARLVYNNNQLDVDISSDSEEDS-VEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR----S

Query:  LAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDG
        LAAMQKT+G   +DDMK SVSFPS ASCNRH+NS LRDS+G    VSSTALGS+MDVDSSCCNANILIVEAD+C REEGAN++LEFS+SCEWLL VKK+G
Subjt:  LAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDG

Query:  STRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSR
        STRYTHKAE VMKP+ CNRFTHAILWS DN WKLEFPNRRDW IFKDLYKECSDRNIPC  AKAIPVPRVSEVPDYVDSS   F+RPDTYISVN+DEV R
Subjt:  STRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSR

Query:  AMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVF
           K TANYDMDS+DEEWL +FND LIAT+K HEC+S + FELM+D FEK  FCNPDAFSDEKAP D+ M L SR  VESL+ YWT+KRRQRKS LIRVF
Subjt:  AMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVF

Query:  QAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALR
        QAHQSKRKPPVVPKPIMRR+RS+KRQPSQSGSGR TQSSIL+AI+SRRDA+E+QNAVQKYEEAKAAAE+C+E+AV+KRQRAQLLLENADLAAYKA++ALR
Subjt:  QAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALR

Query:  IAEAIQASESPE-----AAASCFLE
        IAEAIQASE PE     AAA+CFLE
Subjt:  IAEAIQASESPE-----AAASCFLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32620.1 Enhancer of polycomb-like transcription factor protein1.0e-4233.45Show/hide
Query:  SVSFPSAASCNRHQNSVLRDSAGR--IREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTRYTHKAERVMKPSS
        S S PS  S +R++ S+L+    +   R  +    G   D++SS C+AN+L+   D+ +RE GA + LE   + EW L VK  G+T+Y+H+A + ++P S
Subjt:  SVSFPSAASCNRHQNSVLRDSAGR--IREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTRYTHKAERVMKPSS

Query:  CNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVP-DYVDSSGVSFQRPDT-YISVNDDEVSRAMTKGTANYDMDSD
         NRFTHA++W     W LEFP+R  WF+FK++++EC +RN   ++ + IP+P +  +  D  D +   F R  + Y    + +V  A+      YDMDSD
Subjt:  CNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVP-DYVDSSGVSFQRPDT-YISVNDDEVSRAMTKGTANYDMDSD

Query:  DEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQ
        DE+ L+   +   A       I+E+ FE  +D FEK  F             ++   + S   +E++Y  W  KR+++   LIR  Q
Subjt:  DEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQ

AT4G32620.2 Enhancer of polycomb-like transcription factor protein1.0e-4233.45Show/hide
Query:  SVSFPSAASCNRHQNSVLRDSAGR--IREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTRYTHKAERVMKPSS
        S S PS  S +R++ S+L+    +   R  +    G   D++SS C+AN+L+   D+ +RE GA + LE   + EW L VK  G+T+Y+H+A + ++P S
Subjt:  SVSFPSAASCNRHQNSVLRDSAGR--IREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTRYTHKAERVMKPSS

Query:  CNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVP-DYVDSSGVSFQRPDT-YISVNDDEVSRAMTKGTANYDMDSD
         NRFTHA++W     W LEFP+R  WF+FK++++EC +RN   ++ + IP+P +  +  D  D +   F R  + Y    + +V  A+      YDMDSD
Subjt:  CNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVP-DYVDSSGVSFQRPDT-YISVNDDEVSRAMTKGTANYDMDSD

Query:  DEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQ
        DE+ L+   +   A       I+E+ FE  +D FEK  F             ++   + S   +E++Y  W  KR+++   LIR  Q
Subjt:  DEEWLIEFNDGLIATEKHHECISEEKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQ

AT5G04670.1 Enhancer of polycomb-like transcription factor protein5.5e-12939.54Show/hide
Query:  MPS-GMRR-TRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPE
        MPS GMRR TRVFG+VK  DGARVLRSGRR+WP  GE KV+++ D  D       +      G+    GK +  +   PK+V      + DD  V K   
Subjt:  MPS-GMRR-TRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPE

Query:  PVKVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRFRRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTI
          +   R    G     D+MFG VY+RKRKR          E  S +                          S    RS K + RR+++S     VLT+
Subjt:  PVKVLPRIGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRFRRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTI

Query:  FAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVT
            S +   F       +R+++  +L ++  ++F LS PIN VFA  G+RFL    P  S G+C  FG    +P+F  DF+ +P  FM +H  +F+RV 
Subjt:  FAGSSLDGGCFSDFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVT

Query:  -----WIQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR--
             +++  L   NN ++       E DS  EL +  P      R  +  G+         HPSVRAS+L     QYR    S   +KRRSSLR RR  
Subjt:  -----WIQARLVYNNNQLDVDISSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRR--

Query:  --SLAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSM-DVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVV
          S  A +   G+   D      +  +A S  + ++SVL +S+     +S   +  +  ++DS CC+ANIL++ +D+C REEG +V+LE SSS EW LV+
Subjt:  --SLAAMQKTIGSLAVDDMKHSVSFPSAASCNRHQNSVLRDSAGRIREVSSTALGSSM-DVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVV

Query:  KKDGSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEV---PDYVDSSGVSFQRPDTYISV
        KKDG+ RY+H A+R M+P S NR THA +W   ++WKLEF +R+DW  FKD+YKEC +RN+     K IP+P V EV    +Y+D+     + P +YISV
Subjt:  KKDGSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWKLEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEV---PDYVDSSGVSFQRPDTYISV

Query:  NDDEVSRAMTKGTANYDMDSDDEEWLIEFNDGLIATE-KHHECISEEKFELMVDGFEKGFFCNP-DAFSDEKAPA-DICMHLSSRPIVESLYAYWTKKRR
        N+DEVSRAM +  A YDMDS+DEEWL   N  ++  E   +  +  E FELM+DGFEK  F +P D   DEKA       +L  + +VE+++ YW KKR+
Subjt:  NDDEVSRAMTKGTANYDMDSDDEEWLIEFNDGLIATE-KHHECISEEKFELMVDGFEKGFFCNP-DAFSDEKAPA-DICMHLSSRPIVESLYAYWTKKRR

Query:  QRKSSLIRVFQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADL
        QRK+ L+R+FQ HQ K K  ++ KP+ R+RRS KRQ SQ   G+  Q+S     +   +  E+++ + + EEAK  A+K +E A+ KR+RAQ+L ENADL
Subjt:  QRKSSLIRVFQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEAIISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADL

Query:  AAYKAMLALRIAEAIQASESPE
        A YKAM ALRIAEAI+ +ES E
Subjt:  AAYKAMLALRIAEAIQASESPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTCTGGGATGAGAAGAACTAGAGTTTTTGGTTTGGTGAAGGGTCTTGATGGAGCTAGAGTTCTAAGGTCTGGAAGACGGCTTTGGCCTGAATCTGGAGAG
GTGAAGGTTAAGAAGTCTAAAGACGCTAGTGATTGGTACCCTGTTATTGAGAGCAGAGGAAATGGGGGCGGAAGTGGCCATGGTAGGCTCCATGGTAAGTGGACA
CAAGTTCGAAATGTCAAGCCAAAGAGGGTTGTAGTTGTAAACATTCGTGAGGATGATGATGCTTGTGTTGTGAAAGTGCCTGAACCAGTGAAGGTTTTGCCTAGG
ATTGGTAGCAATGGCAAGTCTGGTGATGGGGATAGAATGTTTGGGAAAGTTTATACAAGGAAGAGGAAGAGGGGTCGTTTGGAAAATGGGGAGTTTTTTTATGAA
ATGGAGAGTGATAATGTTCTTTCAGGGGATAGGATGTTTGGACTCCGGTTTCGAAGACAGAGATCGAGGAAGACTGATGTTGAAGATTGGGAGTCTACTGCAGGT
GGCCGTTCTGCTAAACTGCATTTCCGTAGGCAGAGGATTTCGCCACCCCGGGATCGGGTTCTTACTATTTTTGCGGGGAGTAGTCTTGATGGTGGCTGTTTTTCA
GATTTTATACTCTCGGTTCTTAGACATTTGAAGAGTCCTGACCTGAGTGTGGCTAAGTTCTCTGCATTTTTGTTATCTAATCCTATCAATGGAGTTTTTGCTTTG
AAGGGAATGCGTTTCTTGCAGGGTTATCCTCCTCCTGGAAGTTCTGGCATGTGTGTGATTTTTGGGGTCAGGCAGTCAATTCCAATGTTTCATTTGGATTTTTCC
GCTGTTCCTCTCCCTTTTATGTTTTTGCATTCCAAGATGTTTCTTAGAGTGACTTGGATTCAGGCTCGTCTTGTATATAATAACAACCAGTTAGATGTAGATATA
AGTAGTGATAGTGAAGAAGATAGTGTTGAAGAGCTACATGTTTCCAGTCCTCCTGTAAGTTCTTTGGAACGCAAGCCCATGGCCTTTGGAGTTGATCATCCTAAG
ATTCGATCTGTTTCACATCCATCTGTTAGAGCTTCAAGGTTAGGTAGTCGGACCATGCAATACAGAAACGGTTTCAGCTCTCGTGGTATACGGAAAAGGAGAAGT
TCACTGAGAATGAGGAGGTCTCTTGCTGCTATGCAAAAAACTATCGGCTCTTTGGCGGTTGATGATATGAAACACAGTGTATCTTTTCCTTCTGCAGCATCTTGC
AACCGGCACCAGAACTCAGTCCTGAGAGATTCTGCTGGGCGCATCAGAGAAGTGAGTTCTACTGCATTGGGATCATCAATGGATGTTGACTCATCATGCTGCAAT
GCAAATATATTAATAGTAGAAGCTGATAAATGTTTTAGAGAAGAGGGAGCCAATGTCCTGTTAGAGTTCTCTTCATCGTGCGAATGGCTTCTAGTGGTCAAGAAA
GATGGTTCGACTAGATACACCCACAAAGCCGAAAGAGTAATGAAGCCCTCTTCTTGCAATCGTTTTACACATGCAATATTGTGGTCAGTAGATAACAGTTGGAAG
CTAGAGTTTCCTAATCGAAGGGATTGGTTTATTTTCAAGGACTTGTACAAGGAGTGTTCTGATCGCAATATACCATGTTCTATTGCTAAAGCCATTCCTGTGCCA
AGAGTGTCTGAAGTTCCAGATTATGTCGATAGTAGTGGTGTTTCTTTTCAAAGGCCAGATACGTACATCTCTGTAAACGATGACGAGGTATCTAGAGCGATGACA
AAAGGTACTGCAAACTACGACATGGATTCTGATGACGAGGAATGGCTAATCGAGTTTAATGATGGACTTATTGCAACAGAGAAGCACCATGAATGTATCTCAGAG
GAGAAATTTGAGTTGATGGTCGATGGTTTTGAGAAGGGATTTTTCTGTAATCCCGATGCCTTCTCTGACGAGAAAGCACCTGCTGATATATGCATGCATCTCAGT
AGCCGGCCGATAGTAGAGTCTTTATATGCTTATTGGACGAAGAAACGAAGACAAAGAAAATCATCTTTGATAAGGGTTTTCCAGGCTCATCAATCAAAGAGGAAA
CCTCCCGTGGTCCCTAAACCAATCATGCGAAGAAGAAGATCACTCAAAAGGCAGCCTAGCCAATCTGGGAGTGGTAGAACTACCCAATCAAGTATTTTAGAAGCC
ATAATTTCGAGACGAGATGCGATGGAGGACCAGAATGCTGTACAAAAGTATGAAGAAGCAAAGGCAGCAGCAGAGAAATGCATTGAAAATGCTGTTAATAAGCGG
CAAAGGGCACAATTGCTGTTGGAGAATGCAGATTTGGCTGCTTACAAAGCCATGTTAGCACTCAGAATTGCCGAAGCAATTCAAGCATCAGAATCTCCGGAAGCT
GCTGCTTCTTGTTTTCTCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGTCTGGGATGAGAAGAACTAGAGTTTTTGGTTTGGTGAAGGGTCTTGATGGAGCTAGAGTTCTAAGGTCTGGAAGACGGCTTTGGCCTGAATCTGGAGAG
GTGAAGGTTAAGAAGTCTAAAGACGCTAGTGATTGGTACCCTGTTATTGAGAGCAGAGGAAATGGGGGCGGAAGTGGCCATGGTAGGCTCCATGGTAAGTGGACA
CAAGTTCGAAATGTCAAGCCAAAGAGGGTTGTAGTTGTAAACATTCGTGAGGATGATGATGCTTGTGTTGTGAAAGTGCCTGAACCAGTGAAGGTTTTGCCTAGG
ATTGGTAGCAATGGCAAGTCTGGTGATGGGGATAGAATGTTTGGGAAAGTTTATACAAGGAAGAGGAAGAGGGGTCGTTTGGAAAATGGGGAGTTTTTTTATGAA
ATGGAGAGTGATAATGTTCTTTCAGGGGATAGGATGTTTGGACTCCGGTTTCGAAGACAGAGATCGAGGAAGACTGATGTTGAAGATTGGGAGTCTACTGCAGGT
GGCCGTTCTGCTAAACTGCATTTCCGTAGGCAGAGGATTTCGCCACCCCGGGATCGGGTTCTTACTATTTTTGCGGGGAGTAGTCTTGATGGTGGCTGTTTTTCA
GATTTTATACTCTCGGTTCTTAGACATTTGAAGAGTCCTGACCTGAGTGTGGCTAAGTTCTCTGCATTTTTGTTATCTAATCCTATCAATGGAGTTTTTGCTTTG
AAGGGAATGCGTTTCTTGCAGGGTTATCCTCCTCCTGGAAGTTCTGGCATGTGTGTGATTTTTGGGGTCAGGCAGTCAATTCCAATGTTTCATTTGGATTTTTCC
GCTGTTCCTCTCCCTTTTATGTTTTTGCATTCCAAGATGTTTCTTAGAGTGACTTGGATTCAGGCTCGTCTTGTATATAATAACAACCAGTTAGATGTAGATATA
AGTAGTGATAGTGAAGAAGATAGTGTTGAAGAGCTACATGTTTCCAGTCCTCCTGTAAGTTCTTTGGAACGCAAGCCCATGGCCTTTGGAGTTGATCATCCTAAG
ATTCGATCTGTTTCACATCCATCTGTTAGAGCTTCAAGGTTAGGTAGTCGGACCATGCAATACAGAAACGGTTTCAGCTCTCGTGGTATACGGAAAAGGAGAAGT
TCACTGAGAATGAGGAGGTCTCTTGCTGCTATGCAAAAAACTATCGGCTCTTTGGCGGTTGATGATATGAAACACAGTGTATCTTTTCCTTCTGCAGCATCTTGC
AACCGGCACCAGAACTCAGTCCTGAGAGATTCTGCTGGGCGCATCAGAGAAGTGAGTTCTACTGCATTGGGATCATCAATGGATGTTGACTCATCATGCTGCAAT
GCAAATATATTAATAGTAGAAGCTGATAAATGTTTTAGAGAAGAGGGAGCCAATGTCCTGTTAGAGTTCTCTTCATCGTGCGAATGGCTTCTAGTGGTCAAGAAA
GATGGTTCGACTAGATACACCCACAAAGCCGAAAGAGTAATGAAGCCCTCTTCTTGCAATCGTTTTACACATGCAATATTGTGGTCAGTAGATAACAGTTGGAAG
CTAGAGTTTCCTAATCGAAGGGATTGGTTTATTTTCAAGGACTTGTACAAGGAGTGTTCTGATCGCAATATACCATGTTCTATTGCTAAAGCCATTCCTGTGCCA
AGAGTGTCTGAAGTTCCAGATTATGTCGATAGTAGTGGTGTTTCTTTTCAAAGGCCAGATACGTACATCTCTGTAAACGATGACGAGGTATCTAGAGCGATGACA
AAAGGTACTGCAAACTACGACATGGATTCTGATGACGAGGAATGGCTAATCGAGTTTAATGATGGACTTATTGCAACAGAGAAGCACCATGAATGTATCTCAGAG
GAGAAATTTGAGTTGATGGTCGATGGTTTTGAGAAGGGATTTTTCTGTAATCCCGATGCCTTCTCTGACGAGAAAGCACCTGCTGATATATGCATGCATCTCAGT
AGCCGGCCGATAGTAGAGTCTTTATATGCTTATTGGACGAAGAAACGAAGACAAAGAAAATCATCTTTGATAAGGGTTTTCCAGGCTCATCAATCAAAGAGGAAA
CCTCCCGTGGTCCCTAAACCAATCATGCGAAGAAGAAGATCACTCAAAAGGCAGCCTAGCCAATCTGGGAGTGGTAGAACTACCCAATCAAGTATTTTAGAAGCC
ATAATTTCGAGACGAGATGCGATGGAGGACCAGAATGCTGTACAAAAGTATGAAGAAGCAAAGGCAGCAGCAGAGAAATGCATTGAAAATGCTGTTAATAAGCGG
CAAAGGGCACAATTGCTGTTGGAGAATGCAGATTTGGCTGCTTACAAAGCCATGTTAGCACTCAGAATTGCCGAAGCAATTCAAGCATCAGAATCTCCGGAAGCT
GCTGCTTCTTGTTTTCTCGAATGA
Protein sequenceShow/hide protein sequence
MPSGMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKVKKSKDASDWYPVIESRGNGGGSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVLPR
IGSNGKSGDGDRMFGKVYTRKRKRGRLENGEFFYEMESDNVLSGDRMFGLRFRRQRSRKTDVEDWESTAGGRSAKLHFRRQRISPPRDRVLTIFAGSSLDGGCFS
DFILSVLRHLKSPDLSVAKFSAFLLSNPINGVFALKGMRFLQGYPPPGSSGMCVIFGVRQSIPMFHLDFSAVPLPFMFLHSKMFLRVTWIQARLVYNNNQLDVDI
SSDSEEDSVEELHVSSPPVSSLERKPMAFGVDHPKIRSVSHPSVRASRLGSRTMQYRNGFSSRGIRKRRSSLRMRRSLAAMQKTIGSLAVDDMKHSVSFPSAASC
NRHQNSVLRDSAGRIREVSSTALGSSMDVDSSCCNANILIVEADKCFREEGANVLLEFSSSCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTHAILWSVDNSWK
LEFPNRRDWFIFKDLYKECSDRNIPCSIAKAIPVPRVSEVPDYVDSSGVSFQRPDTYISVNDDEVSRAMTKGTANYDMDSDDEEWLIEFNDGLIATEKHHECISE
EKFELMVDGFEKGFFCNPDAFSDEKAPADICMHLSSRPIVESLYAYWTKKRRQRKSSLIRVFQAHQSKRKPPVVPKPIMRRRRSLKRQPSQSGSGRTTQSSILEA
IISRRDAMEDQNAVQKYEEAKAAAEKCIENAVNKRQRAQLLLENADLAAYKAMLALRIAEAIQASESPEAAASCFLE