; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g08210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g08210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationchr8:6059958..6071916
RNA-Seq ExpressionMoc08g08210
SyntenyMoc08g08210
Gene Ontology termsGO:0006384 - transcription initiation from RNA polymerase III promoter (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR024761 - Transcription factor IIIC, 90kDa subunit, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily
IPR044230 - General transcription factor 3C polypeptide 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029436.1 hypothetical protein SDJN02_07775 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0075.87Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQ VTL  AP YPNAIAWSDENLIAVASGPLVTILNP SPFGARGTITIP S+P  IGVIERKDLF+ CLLPTCL+RD +P  +SIAWSPLG+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDF+AEWIEIMDISN LYDY  S+KFGELDVP S+ SD P +  GS +DVQEHFT ED KRR   A NLNN S LNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        L+KS  KRP+RT +SSV   I+AQQYASRSAMLLS+V+AWSPVM+PSH VH H NSSVSVLA+G KSG+V+FWKVNVP+CYSLAEC VPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSW+NCI+W +FDSDSSNPK+LLATGSTDGSV+IWQ  C+ELLASSD+NFASFSLLKEVISG  VP TLLSL++PN  VHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR
        + SSEFD+V  Y AHDHVVTG AW FD RYLFTCSEDNIL GW+LD SSLR VPISSHIP  G+SIDLPD+FRSCFG+AVSPGNLVAAVVRNFD+ESLDR
Subjt:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR

Query:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC
        MYQAR+Q+AA+QFFWI GEE++ +PNS SYF  E   D+SKKE V WESSM WSLN+FK+LNKPMV+WDVVAAL+AFRQSIPE+VD+ILLKW ++SYL+ 
Subjt:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC

Query:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY
        NEELSA KIL  VSRNVSTFSTRQLHLLNVICRRVVLSEL+QDQVNN+LQ+LE+LND E++K ILWKELLLSSER+LRQRLIGL   +CAKL +LS ++Y
Subjt:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY

Query:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------
        RPG+WYPIGLVEMQQW+RYNHE+L+ES+  IAS+E    H SEHSA EQCT+CSASVPFESPE GFCQG K +   GQ+HKLV                 
Subjt:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------

Query:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                 RL PDILFQMSE PDFSSLTLSDS+IPSKPLCPFCGILLQRRQPDFLLS C V
Subjt:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

XP_022144265.1 uncharacterized protein LOC111013991 isoform X1 [Momordica charantia]0.0e+0096.86Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
        IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
Subjt:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM

Query:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
        YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
Subjt:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN

Query:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
        EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
Subjt:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR

Query:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------
        PGYWYP GLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV                  
Subjt:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------

Query:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
Subjt:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

XP_022144266.1 uncharacterized protein LOC111013991 isoform X2 [Momordica charantia]0.0e+0096.4Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLN    LNQP
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
        IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
Subjt:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM

Query:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
        YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
Subjt:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN

Query:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
        EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
Subjt:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR

Query:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------
        PGYWYP GLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV                  
Subjt:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------

Query:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
Subjt:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

XP_022961658.1 uncharacterized protein LOC111462361 [Cucurbita moschata]0.0e+0075.75Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQ VTL  AP YPNAIAWSDENLIAVASGPLVTILNP SPFGARGTITIP S+P  IGVI RKDLF+ CLLPTCL+RD +P  +SIAWSPLG+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDF+AEWIEIMDISN LYDY  S+KFGELDVP S+ SD P +  GS +DVQEHFT ED KRR   A NLNN S LNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        L+KS  KRP+RT +SSV   I+AQQYASRSAMLLS+V+AWSPVM+PSH VH H NSSVSVLA+G KSG+V+FWKVNVP+CYSLAEC VPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSW+NCI+W +FDSDSSNPK+LLATGSTDGSV+IWQ  C+ELLASSD+NFASFSLLKEVISG  VP TLLSL++PN  VHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR
        + SSEFD+V  Y AHDHVVTG AW FD RYLFTCSEDNIL GW+LD SSLR VPISSHIP  G+SIDLPD+FRSCFG+AVSPGNLVAAVVRNFD+ESLDR
Subjt:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR

Query:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC
        MYQAR+Q+AA+QFFWI GEE++ +PNS SYF  E   D+SKKE V WESSM WSLN+FK+LNKPMV+WDVVAAL+AFRQSIPE+VD+ILLKW ++SYL+ 
Subjt:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC

Query:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY
        NEELSA KIL  VSRNVSTFSTRQLHLLNVICRRVVLSEL+QDQVNN+LQ+LE+LND E++K ILWKELLLSSER+LRQRLIGL   +CAKL +LS ++Y
Subjt:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY

Query:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------
        RPG+WYPIGLVEMQQW+RYNHE+L+ES+  IAS+E    H SEHSA EQCT+CSASVPFESPE GFCQG K +   GQ+HKLV                 
Subjt:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------

Query:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                 RL PDILFQMSE PDFSSLTLSDS+IPSKPLCPFCGILLQRRQPDFLLS C V
Subjt:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

XP_022996975.1 uncharacterized protein LOC111492045 [Cucurbita maxima]0.0e+0076.01Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQ VTL  AP YPNAIAWSDENLIAVASGPLVTILNP SPFGARGTITIP S+P  IGVIERKDLF+ CLLPTCL+RD +P  +SIAWSPLG+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDF+AEWIEIMDISN LYDY  S+KFGELDVP SK SD   +  GS VDVQEHFT ED KRR   A NLNN SSLNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRP-RRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ
        L+KSKEKRP RRT +SSV   I+AQQYASRSAMLLS+V+AWSPVM+PSH VH H NSSVSVLA+G KSG V+FWKVNVP+CYSLAEC VPTRALLVGLLQ
Subjt:  LDKSKEKRP-RRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ

Query:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF
        AHNSW+NCISW +FDSDSSN K+LLATGSTDGSV+IWQC C+ELLASSD+NFASFSLLKEVISG  VP TLLSL++PN  VHKLFLAIGRGSGSLEIRIF
Subjt:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF

Query:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD
        N+ SSEFD+V  Y AHDHVVTG AW FD RYLFTCSEDNIL GW+LD SSLR VPISS IP  G+SIDLPD+FRSCFG+AVSPGNLVAAVVRNFD+ESLD
Subjt:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD

Query:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ
        RMYQAR+Q+AA+QFFWI GEE++ +PNS SYF  E   D+SKKE V WESSM WSLN+FK+LNKPMV+WDVVAAL+AFRQSIPE+VD+ILLKWL++SYL+
Subjt:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ

Query:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITK
         NEE SA KIL  +SRNVST+STRQLHLLNVICRRVVLSEL+QDQVNN+LQ+LE+LND E++K ILWKELLLSSER+LRQRLIGL   +CAKL +LS T+
Subjt:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITK

Query:  YRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV----------------
        YRPG+WYPIGLVEMQQW+RYNHE+++ES+  IAS+E G  H SEHSA EQCT+CSASVPFESPE GFCQG K +   GQ+HKLV                
Subjt:  YRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV----------------

Query:  ----------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                  RL PDILFQMS+ PDFSSLTL DS+IPSKPLCPFCGILLQRRQPDFLLS C V
Subjt:  ----------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

TrEMBL top hitse value%identityAlignment
A0A5A7VH44 WD_REPEATS_REGION domain-containing protein0.0e+0073.62Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVE++FQ V+L  AP YPNAIAWSDENLIA+ASGPLVTI+NP SPFGARGTITIP ++P +IG++ERKDLFS CLL TCL+RD QP  +S+AWSP+G+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDFSAEWIEI+DISN LYDYL SIK+GELDV  SK SDIP + +GS VDVQE+FT ++ KRR  D L  +NESSLNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGE-SSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ
        L+KSKEKR RR  E SSV   ISAQQYASRSAMLLSLV+AWSPV++PS   H H NSS  VLA+G KSG+V+FWKVNVP+CYSLAECMVPT ALLVG+LQ
Subjt:  LDKSKEKRPRRTGE-SSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ

Query:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF
        AHNSWINCISW LFDSDSS+ K+L+ATGSTDGSV+IWQC C+ELLASSDSNFASFSLLKEVISGE VP T+LSL++PNL  HKLFLAIGRGSGSLEIRIF
Subjt:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF

Query:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD
        N+ +SEFDNV LY AH HVVTG+AW  D RYLFTCSEDN L GW+LD SSLR VPISSHIP  G SIDLPDTFRSCFGIA+SPGNLV AVVRNFD+ESLD
Subjt:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD

Query:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ
        +MYQAR Q+AAVQFFWIGGEE+EVMPNS SYF  E F ++SKKEFV WESSM WSLN+ K+LNKPMVVW+VVAAL+AFR SIPEYVD+ILLKWLA+SYL 
Subjt:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ

Query:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNL-----EKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPS
         + ELSA KIL  +S+NVSTFSTRQLHLLN+ICRRVVLSE +QDQVN+ELQNL     E+L+D E+EK ILWK+LLLSSER+LRQRLIGL  FACAKL S
Subjt:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNL-----EKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPS

Query:  LSITKYRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------
        LSIT+YRPG+WYPIGL EMQQWV  N E+L+ES+K +AS +AG    S+HS++EQCT+CSA VP ESPEFG CQG K + G  Q+HKL+           
Subjt:  LSITKYRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------

Query:  ---------------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                       RL PDILFQMSE P+F SL LSDSEIPSKPLCPFCGILLQRRQPDFLLS CPV
Subjt:  ---------------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

A0A6J1CR60 uncharacterized protein LOC111013991 isoform X10.0e+0096.86Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
        IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
Subjt:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM

Query:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
        YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
Subjt:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN

Query:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
        EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
Subjt:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR

Query:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------
        PGYWYP GLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV                  
Subjt:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------

Query:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
Subjt:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

A0A6J1CST2 uncharacterized protein LOC111013991 isoform X20.0e+0096.4Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLN    LNQP
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
        IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM
Subjt:  IPSSEFDNVLYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRM

Query:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
        YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN
Subjt:  YQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCN

Query:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
        EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR
Subjt:  EELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYR

Query:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------
        PGYWYP GLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV                  
Subjt:  PGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV------------------

Query:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
Subjt:  --------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

A0A6J1HCF6 uncharacterized protein LOC1114623610.0e+0075.75Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQ VTL  AP YPNAIAWSDENLIAVASGPLVTILNP SPFGARGTITIP S+P  IGVI RKDLF+ CLLPTCL+RD +P  +SIAWSPLG+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDF+AEWIEIMDISN LYDY  S+KFGELDVP S+ SD P +  GS +DVQEHFT ED KRR   A NLNN S LNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA
        L+KS  KRP+RT +SSV   I+AQQYASRSAMLLS+V+AWSPVM+PSH VH H NSSVSVLA+G KSG+V+FWKVNVP+CYSLAEC VPTRALLVGLLQA
Subjt:  LDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQA

Query:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN
        HNSW+NCI+W +FDSDSSNPK+LLATGSTDGSV+IWQ  C+ELLASSD+NFASFSLLKEVISG  VP TLLSL++PN  VHKLFLAIGRGSGSLEIRIFN
Subjt:  HNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFN

Query:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR
        + SSEFD+V  Y AHDHVVTG AW FD RYLFTCSEDNIL GW+LD SSLR VPISSHIP  G+SIDLPD+FRSCFG+AVSPGNLVAAVVRNFD+ESLDR
Subjt:  IPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDR

Query:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC
        MYQAR+Q+AA+QFFWI GEE++ +PNS SYF  E   D+SKKE V WESSM WSLN+FK+LNKPMV+WDVVAAL+AFRQSIPE+VD+ILLKW ++SYL+ 
Subjt:  MYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQC

Query:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY
        NEELSA KIL  VSRNVSTFSTRQLHLLNVICRRVVLSEL+QDQVNN+LQ+LE+LND E++K ILWKELLLSSER+LRQRLIGL   +CAKL +LS ++Y
Subjt:  NEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKY

Query:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------
        RPG+WYPIGLVEMQQW+RYNHE+L+ES+  IAS+E    H SEHSA EQCT+CSASVPFESPE GFCQG K +   GQ+HKLV                 
Subjt:  RPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV-----------------

Query:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                 RL PDILFQMSE PDFSSLTLSDS+IPSKPLCPFCGILLQRRQPDFLLS C V
Subjt:  ---------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

A0A6J1K896 uncharacterized protein LOC1114920450.0e+0076.01Show/hide
Query:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP
        MVESYFQ VTL  AP YPNAIAWSDENLIAVASGPLVTILNP SPFGARGTITIP S+P  IGVIERKDLF+ CLLPTCL+RD +P  +SIAWSPLG+AP
Subjt:  MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAP

Query:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP
        +AGCLLAVCT+EG VKLYR PFCDF+AEWIEIMDISN LYDY  S+KFGELDVP SK SD   +  GS VDVQEHFT ED KRR   A NLNN SSLNQ 
Subjt:  DAGCLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQP

Query:  LDKSKEKRP-RRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ
        L+KSKEKRP RRT +SSV   I+AQQYASRSAMLLS+V+AWSPVM+PSH VH H NSSVSVLA+G KSG V+FWKVNVP+CYSLAEC VPTRALLVGLLQ
Subjt:  LDKSKEKRP-RRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQ

Query:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF
        AHNSW+NCISW +FDSDSSN K+LLATGSTDGSV+IWQC C+ELLASSD+NFASFSLLKEVISG  VP TLLSL++PN  VHKLFLAIGRGSGSLEIRIF
Subjt:  AHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIF

Query:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD
        N+ SSEFD+V  Y AHDHVVTG AW FD RYLFTCSEDNIL GW+LD SSLR VPISS IP  G+SIDLPD+FRSCFG+AVSPGNLVAAVVRNFD+ESLD
Subjt:  NIPSSEFDNV-LYGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLD

Query:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ
        RMYQAR+Q+AA+QFFWI GEE++ +PNS SYF  E   D+SKKE V WESSM WSLN+FK+LNKPMV+WDVVAAL+AFRQSIPE+VD+ILLKWL++SYL+
Subjt:  RMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQ

Query:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITK
         NEE SA KIL  +SRNVST+STRQLHLLNVICRRVVLSEL+QDQVNN+LQ+LE+LND E++K ILWKELLLSSER+LRQRLIGL   +CAKL +LS T+
Subjt:  CNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITK

Query:  YRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV----------------
        YRPG+WYPIGLVEMQQW+RYNHE+++ES+  IAS+E G  H SEHSA EQCT+CSASVPFESPE GFCQG K +   GQ+HKLV                
Subjt:  YRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLV----------------

Query:  ----------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV
                  RL PDILFQMS+ PDFSSLTL DS+IPSKPLCPFCGILLQRRQPDFLLS C V
Subjt:  ----------RLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49400.1 Transducin/WD40 repeat-like superfamily protein3.5e-18240.93Show/hide
Query:  SYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAPDAG
        S FQ  +L T+P YPNA+AWS ENLIAVA+G LV I+NP  P G RG ITI ++  +QIG +  +DL +  LLP+ L R+  P VRS++WS +G++P+ G
Subjt:  SYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAPDAG

Query:  CLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSED--------HKRRNNDALNLN---
        CLLAVCT EGRVKLYR P+ DF AEWIEI+DIS  LY+ L+S+ FGE   P +  S            V EH   ED         KRR   A N+N   
Subjt:  CLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSED--------HKRRNNDALNLN---

Query:  --------------------NESSLNQPLDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTF
                             E  + +     +++R          Q IS Q Y SR A+L S  +AWS +++ S           S+LAIG+KSG V+ 
Subjt:  --------------------NESSLNQPLDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTF

Query:  WKVNVPDCYSLAECMVPTRALLVGLLQAHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLS
        WKV+ P+CY +    V     L  ++Q H+SW++ +SW +F  DSSNP+++L TGS DGSV+IW    ++L  S +   +SF LLKEV++   V ++ LS
Subjt:  WKVNVPDCYSLAECMVPTRALLVGLLQAHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLS

Query:  LSVPNLHVHKLFLAIGRGSGSLEIRIFNIPSSEFDNVL-YGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTF
          V N H + + LAIG+GSGS E+    I + +F+ ++   AH+ VVTG+AW +D R L++CS+DN +  W L  +++  VPI ++ P   S+ DLPD F
Subjt:  LSVPNLHVHKLFLAIGRGSGSLEIRIFNIPSSEFDNVL-YGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTF

Query:  RSCFGIAVSPGNLVAAVVRNFDIESLDRMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVA
         SC G+A+SPGNL  A+VRNF++E L+ MYQAR+Q+AAV+F W G ++     +S      E     SK EF +WES++ WSL +F  LNKP+V+WD+VA
Subjt:  RSCFGIAVSPGNLVAAVVRNFDIESLDRMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVA

Query:  ALVAFRQSIPEYVDYILLKWLASSYLQCNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLS
        A++AF+QS+PE+V+ +L KWL+ SYL  ++++S   ++ ++++  S   +R LH+LNVI RRV+LSEL  +++N +LQ   +  +DE E D LW +LL  
Subjt:  ALVAFRQSIPEYVDYILLKWLASSYLQCNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLS

Query:  SERDLRQRLIGLSLFACAKLPSLSITKYRPGY-WYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSA-----VEQCTFCSASVPFESPEFGF
        SER+LR+RL+GLS  A     S   T   P + W P GL  +QQWV  N + +   ++ ++ E      RS +S       E+C +C+A V F S E  F
Subjt:  SERDLRQRLIGLSLFACAKLPSLSITKYRPGY-WYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSA-----VEQCTFCSASVPFESPEFGF

Query:  CQG-------VKCSSGAGQTHKLVR-------------------------LPPDILFQMSEIP-DFSSLTLSD-SEIPSKPLCPFCGILLQRRQPDFLLS
        C+         K      ++HKL R                         L P+ LF +   P D  SL  S  S++ SKP C FCG+LLQR+QP+FLLS
Subjt:  CQG-------VKCSSGAGQTHKLVR-------------------------LPPDILFQMSEIP-DFSSLTLSD-SEIPSKPLCPFCGILLQRRQPDFLLS

Query:  PCPV
          PV
Subjt:  PCPV

AT3G49400.2 Transducin/WD40 repeat-like superfamily protein8.2e-17139.82Show/hide
Query:  SYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAPDAG
        S FQ  +L T+P YPNA+AWS ENLIAVA+G LV I+NP  P G RG ITI ++  +QIG +  +DL +  LLP+ L R+  P VRS++WS +G++P+ G
Subjt:  SYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAPDAG

Query:  CLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSED--------HKRRNNDALNLN---
        CLLAVCT EGRVKLYR P+ DF AEWIEI+DIS  LY+ L+S+ FGE   P +  S            V EH   ED         KRR   A N+N   
Subjt:  CLLAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSED--------HKRRNNDALNLN---

Query:  --------------------NESSLNQPLDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTF
                             E  + +     +++R          Q IS Q Y SR A+L S  +AWS +++ S           S+LAIG+KSG V+ 
Subjt:  --------------------NESSLNQPLDKSKEKRPRRTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTF

Query:  WKVNVPDCYSLAECMVPTRALLVGLLQAHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLS
        WKV+ P+CY +    V     L  ++Q H+SW++ +SW +F  DSSNP+++L TGS DGSV+IW    ++L  S +   +SF LLKEV++   V ++ LS
Subjt:  WKVNVPDCYSLAECMVPTRALLVGLLQAHNSWINCISWALFDSDSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLS

Query:  LSVPNLHVHKLFLAIGRGSGSLEIRIFNIPSSEFDNVL-YGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTF
          V N H + + LAIG+GSGS E+    I + +F+ ++   AH+ V                  DN +  W L  +++  VPI ++ P   S+ DLPD F
Subjt:  LSVPNLHVHKLFLAIGRGSGSLEIRIFNIPSSEFDNVL-YGAHDHVVTGIAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTF

Query:  RSCFGIAVSPGNLVAAVVRNFDIESLDRMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVA
         SC G+A+SPGNL  A+VRNF++E L+ MYQAR+Q+AAV+F W G ++     +S      E     SK EF +WES++ WSL +F  LNKP+V+WD+VA
Subjt:  RSCFGIAVSPGNLVAAVVRNFDIESLDRMYQARAQRAAVQFFWIGGEEMEVMPNSCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVA

Query:  ALVAFRQSIPEYVDYILLKWLASSYLQCNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLS
        A++AF+QS+PE+V+ +L KWL+ SYL  ++++S   ++ ++++  S   +R LH+LNVI RRV+LSEL  +++N +LQ   +  +DE E D LW +LL  
Subjt:  ALVAFRQSIPEYVDYILLKWLASSYLQCNEELSAPKILLRVSRNVSTFSTRQLHLLNVICRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLS

Query:  SERDLRQRLIGLSLFACAKLPSLSITKYRPGY-WYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSA-----VEQCTFCSASVPFESPEFGF
        SER+LR+RL+GLS  A     S   T   P + W P GL  +QQWV  N + +   ++ ++ E      RS +S       E+C +C+A V F S E  F
Subjt:  SERDLRQRLIGLSLFACAKLPSLSITKYRPGY-WYPIGLVEMQQWVRYNHENLRESVKGIASEEAGIHHRSEHSA-----VEQCTFCSASVPFESPEFGF

Query:  CQG-------VKCSSGAGQTHKLVR-------------------------LPPDILFQMSEIP-DFSSLTLSD-SEIPSKPLCPFCGILLQRRQPDFLLS
        C+         K      ++HKL R                         L P+ LF +   P D  SL  S  S++ SKP C FCG+LLQR+QP+FLLS
Subjt:  CQG-------VKCSSGAGQTHKLVR-------------------------LPPDILFQMSEIP-DFSSLTLSD-SEIPSKPLCPFCGILLQRRQPDFLLS

Query:  PCPV
          PV
Subjt:  PCPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAATCATATTTTCAGGGTGTCACGTTGAGCACTGCCCCAATATACCCAAATGCCATTGCATGGTCTGATGAGAATTTAATCGCCGTTGCCTCCGGCCCC
CTTGTCACTATACTGAATCCGACATCACCTTTTGGAGCACGAGGCACTATTACAATCCCTGAAAGTAATCCATTTCAAATAGGGGTGATAGAAAGGAAAGATTTA
TTTTCTAGTTGCTTGTTGCCAACTTGCTTAACTCGGGACATCCAACCTTCTGTGCGGTCCATCGCATGGTCTCCTCTTGGAATCGCTCCTGATGCAGGGTGTTTG
TTGGCTGTTTGCACAACAGAAGGCCGTGTGAAGCTTTACCGTTCTCCTTTCTGTGATTTTAGTGCTGAATGGATTGAGATTATGGACATATCAAATAATCTTTAC
GATTATCTTGCAAGTATTAAATTTGGGGAGTTGGATGTTCCTTGCTCAAAATTTTCTGATATACCAAGAGAGGCAAATGGGAGTGTTGTCGATGTCCAAGAGCAT
TTCACAAGCGAGGACCATAAGCGAAGAAACAATGATGCATTGAACTTAAACAATGAAAGCAGTTTGAATCAACCATTGGATAAATCGAAAGAGAAACGTCCCAGG
AGGACAGGAGAAAGCTCCGTGATCCAATTTATTAGTGCACAACAATATGCTTCTCGCAGCGCAATGCTGTTGTCTCTTGTTCTTGCTTGGTCCCCAGTGATGCAG
CCATCTCATAGTGTTCATCCGCATCTGAATTCATCTGTCAGTGTTCTTGCCATAGGAGCAAAGTCTGGTGAAGTTACATTTTGGAAGGTTAATGTACCAGATTGC
TACTCCCTTGCTGAGTGCATGGTTCCAACAAGAGCTCTGCTTGTTGGGCTTCTTCAGGCACACAATTCATGGATCAACTGTATCAGCTGGGCGTTGTTTGATTCT
GATTCATCAAACCCAAAGATTTTATTGGCTACTGGGAGCACGGATGGGAGTGTGAGGATCTGGCAATGTTATTGTAAAGAACTATTAGCATCTTCAGACTCTAAT
TTTGCTTCCTTCTCCCTATTGAAGGAGGTAATCAGTGGTGAAGCAGTGCCAATTACTCTACTTTCACTCAGTGTTCCCAACTTACATGTGCATAAACTATTTTTA
GCTATTGGCAGAGGATCTGGATCACTTGAAATAAGGATATTTAACATACCTAGCAGTGAATTTGATAATGTACTGTATGGTGCACATGATCATGTTGTTACAGGT
ATAGCTTGGGGTTTTGATGCACGTTATTTGTTCACCTGCAGTGAGGATAATATTCTGTGTGGTTGGAATTTAGATGGGAGTTCTCTCCGTGGAGTGCCCATTTCA
TCACACATCCCTCATTTTGGAAGCTCAATTGATCTTCCAGATACATTTCGGTCATGCTTTGGCATTGCAGTGTCCCCAGGAAATCTTGTGGCTGCTGTGGTTCGC
AACTTTGATATTGAATCACTTGACCGAATGTATCAAGCAAGGGCTCAGAGAGCTGCTGTTCAATTCTTCTGGATTGGAGGAGAAGAAATGGAAGTCATGCCAAAC
AGTTGCTCATACTTTGATGCTGAAAAATTTCCGGATCTTTCTAAGAAGGAATTTGTTCATTGGGAATCCAGTATGTCGTGGTCTTTAAATAAATTTAAAGATCTG
AATAAGCCTATGGTTGTTTGGGATGTTGTAGCAGCCTTGGTGGCTTTTAGGCAGTCTATACCAGAATATGTCGACTACATTCTACTTAAGTGGCTTGCATCATCG
TATCTCCAATGCAACGAGGAGCTATCCGCTCCAAAGATTTTGTTGCGTGTATCAAGAAATGTGTCGACATTTTCCACTCGCCAGCTTCACCTCCTTAATGTTATT
TGTAGACGTGTAGTTCTTTCAGAATTGATGCAGGATCAAGTGAATAACGAACTGCAGAACTTGGAAAAACTTAATGACGATGAAGATGAAAAGGATATTTTGTGG
AAAGAGTTGCTTTTAAGCAGTGAAAGAGACCTCCGTCAGAGGCTAATCGGTCTTAGTCTTTTTGCCTGTGCAAAGCTTCCTTCACTGTCCATTACCAAATATCGA
CCTGGATACTGGTATCCCATTGGATTAGTTGAAATGCAGCAGTGGGTTAGATATAATCATGAAAATTTACGTGAATCGGTAAAAGGCATTGCATCAGAAGAAGCT
GGAATACACCATAGGAGCGAGCATTCAGCAGTGGAGCAGTGTACCTTCTGTTCAGCATCGGTTCCATTCGAGTCTCCCGAGTTCGGGTTTTGCCAGGGCGTTAAG
TGCAGTAGTGGAGCTGGTCAGACTCACAAACTAGTAAGATTGCCTCCAGATATACTTTTCCAGATGTCTGAGATCCCTGACTTCAGTTCGTTAACACTATCTGAT
TCTGAGATACCCTCGAAACCATTGTGTCCCTTTTGCGGTATACTGCTACAACGTCGACAACCAGATTTTCTACTGTCACCTTGCCCGGTATTCCAGAAAATGGGA
TTAGCAATTCCACCACGCTTATGGTTCCGTCGCAACCTGATCATTACAACAAATTCCGAATTTTTTAATGCTCAGTGGCTCAGTTCAGTTCCGTGGCAATGGAGT
GGAGCTCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAATCATATTTTCAGGGTGTCACGTTGAGCACTGCCCCAATATACCCAAATGCCATTGCATGGTCTGATGAGAATTTAATCGCCGTTGCCTCCGGCCCC
CTTGTCACTATACTGAATCCGACATCACCTTTTGGAGCACGAGGCACTATTACAATCCCTGAAAGTAATCCATTTCAAATAGGGGTGATAGAAAGGAAAGATTTA
TTTTCTAGTTGCTTGTTGCCAACTTGCTTAACTCGGGACATCCAACCTTCTGTGCGGTCCATCGCATGGTCTCCTCTTGGAATCGCTCCTGATGCAGGGTGTTTG
TTGGCTGTTTGCACAACAGAAGGCCGTGTGAAGCTTTACCGTTCTCCTTTCTGTGATTTTAGTGCTGAATGGATTGAGATTATGGACATATCAAATAATCTTTAC
GATTATCTTGCAAGTATTAAATTTGGGGAGTTGGATGTTCCTTGCTCAAAATTTTCTGATATACCAAGAGAGGCAAATGGGAGTGTTGTCGATGTCCAAGAGCAT
TTCACAAGCGAGGACCATAAGCGAAGAAACAATGATGCATTGAACTTAAACAATGAAAGCAGTTTGAATCAACCATTGGATAAATCGAAAGAGAAACGTCCCAGG
AGGACAGGAGAAAGCTCCGTGATCCAATTTATTAGTGCACAACAATATGCTTCTCGCAGCGCAATGCTGTTGTCTCTTGTTCTTGCTTGGTCCCCAGTGATGCAG
CCATCTCATAGTGTTCATCCGCATCTGAATTCATCTGTCAGTGTTCTTGCCATAGGAGCAAAGTCTGGTGAAGTTACATTTTGGAAGGTTAATGTACCAGATTGC
TACTCCCTTGCTGAGTGCATGGTTCCAACAAGAGCTCTGCTTGTTGGGCTTCTTCAGGCACACAATTCATGGATCAACTGTATCAGCTGGGCGTTGTTTGATTCT
GATTCATCAAACCCAAAGATTTTATTGGCTACTGGGAGCACGGATGGGAGTGTGAGGATCTGGCAATGTTATTGTAAAGAACTATTAGCATCTTCAGACTCTAAT
TTTGCTTCCTTCTCCCTATTGAAGGAGGTAATCAGTGGTGAAGCAGTGCCAATTACTCTACTTTCACTCAGTGTTCCCAACTTACATGTGCATAAACTATTTTTA
GCTATTGGCAGAGGATCTGGATCACTTGAAATAAGGATATTTAACATACCTAGCAGTGAATTTGATAATGTACTGTATGGTGCACATGATCATGTTGTTACAGGT
ATAGCTTGGGGTTTTGATGCACGTTATTTGTTCACCTGCAGTGAGGATAATATTCTGTGTGGTTGGAATTTAGATGGGAGTTCTCTCCGTGGAGTGCCCATTTCA
TCACACATCCCTCATTTTGGAAGCTCAATTGATCTTCCAGATACATTTCGGTCATGCTTTGGCATTGCAGTGTCCCCAGGAAATCTTGTGGCTGCTGTGGTTCGC
AACTTTGATATTGAATCACTTGACCGAATGTATCAAGCAAGGGCTCAGAGAGCTGCTGTTCAATTCTTCTGGATTGGAGGAGAAGAAATGGAAGTCATGCCAAAC
AGTTGCTCATACTTTGATGCTGAAAAATTTCCGGATCTTTCTAAGAAGGAATTTGTTCATTGGGAATCCAGTATGTCGTGGTCTTTAAATAAATTTAAAGATCTG
AATAAGCCTATGGTTGTTTGGGATGTTGTAGCAGCCTTGGTGGCTTTTAGGCAGTCTATACCAGAATATGTCGACTACATTCTACTTAAGTGGCTTGCATCATCG
TATCTCCAATGCAACGAGGAGCTATCCGCTCCAAAGATTTTGTTGCGTGTATCAAGAAATGTGTCGACATTTTCCACTCGCCAGCTTCACCTCCTTAATGTTATT
TGTAGACGTGTAGTTCTTTCAGAATTGATGCAGGATCAAGTGAATAACGAACTGCAGAACTTGGAAAAACTTAATGACGATGAAGATGAAAAGGATATTTTGTGG
AAAGAGTTGCTTTTAAGCAGTGAAAGAGACCTCCGTCAGAGGCTAATCGGTCTTAGTCTTTTTGCCTGTGCAAAGCTTCCTTCACTGTCCATTACCAAATATCGA
CCTGGATACTGGTATCCCATTGGATTAGTTGAAATGCAGCAGTGGGTTAGATATAATCATGAAAATTTACGTGAATCGGTAAAAGGCATTGCATCAGAAGAAGCT
GGAATACACCATAGGAGCGAGCATTCAGCAGTGGAGCAGTGTACCTTCTGTTCAGCATCGGTTCCATTCGAGTCTCCCGAGTTCGGGTTTTGCCAGGGCGTTAAG
TGCAGTAGTGGAGCTGGTCAGACTCACAAACTAGTAAGATTGCCTCCAGATATACTTTTCCAGATGTCTGAGATCCCTGACTTCAGTTCGTTAACACTATCTGAT
TCTGAGATACCCTCGAAACCATTGTGTCCCTTTTGCGGTATACTGCTACAACGTCGACAACCAGATTTTCTACTGTCACCTTGCCCGGTATTCCAGAAAATGGGA
TTAGCAATTCCACCACGCTTATGGTTCCGTCGCAACCTGATCATTACAACAAATTCCGAATTTTTTAATGCTCAGTGGCTCAGTTCAGTTCCGTGGCAATGGAGT
GGAGCTCGCTAA
Protein sequenceShow/hide protein sequence
MVESYFQGVTLSTAPIYPNAIAWSDENLIAVASGPLVTILNPTSPFGARGTITIPESNPFQIGVIERKDLFSSCLLPTCLTRDIQPSVRSIAWSPLGIAPDAGCL
LAVCTTEGRVKLYRSPFCDFSAEWIEIMDISNNLYDYLASIKFGELDVPCSKFSDIPREANGSVVDVQEHFTSEDHKRRNNDALNLNNESSLNQPLDKSKEKRPR
RTGESSVIQFISAQQYASRSAMLLSLVLAWSPVMQPSHSVHPHLNSSVSVLAIGAKSGEVTFWKVNVPDCYSLAECMVPTRALLVGLLQAHNSWINCISWALFDS
DSSNPKILLATGSTDGSVRIWQCYCKELLASSDSNFASFSLLKEVISGEAVPITLLSLSVPNLHVHKLFLAIGRGSGSLEIRIFNIPSSEFDNVLYGAHDHVVTG
IAWGFDARYLFTCSEDNILCGWNLDGSSLRGVPISSHIPHFGSSIDLPDTFRSCFGIAVSPGNLVAAVVRNFDIESLDRMYQARAQRAAVQFFWIGGEEMEVMPN
SCSYFDAEKFPDLSKKEFVHWESSMSWSLNKFKDLNKPMVVWDVVAALVAFRQSIPEYVDYILLKWLASSYLQCNEELSAPKILLRVSRNVSTFSTRQLHLLNVI
CRRVVLSELMQDQVNNELQNLEKLNDDEDEKDILWKELLLSSERDLRQRLIGLSLFACAKLPSLSITKYRPGYWYPIGLVEMQQWVRYNHENLRESVKGIASEEA
GIHHRSEHSAVEQCTFCSASVPFESPEFGFCQGVKCSSGAGQTHKLVRLPPDILFQMSEIPDFSSLTLSDSEIPSKPLCPFCGILLQRRQPDFLLSPCPVFQKMG
LAIPPRLWFRRNLIITTNSEFFNAQWLSSVPWQWSGAR