; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010571 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010571
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUbiquitin-conjugating enzyme E2C-binding protein
Genome locationscaffold35:637934..650444
RNA-Seq ExpressionMS010571
SyntenyMS010571
Gene Ontology termsGO:0000209 - protein polyubiquitination (biological process)
GO:0006513 - protein monoubiquitination (biological process)
GO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0051865 - protein autoubiquitination (biological process)
GO:0000151 - ubiquitin ligase complex (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030332 - cyclin binding (molecular function)
GO:0031624 - ubiquitin conjugating enzyme binding (molecular function)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
InterPro domainsIPR019193 - Ubiquitin-conjugating enzyme E2-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149986.1 uncharacterized protein LOC101204887 [Cucumis sativus]1.7e-25073.67Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSE  TV++PRKWRFTWEAQSHIP LRLLLFDS TNPSLQC+NLKV LNL QSVVC  WLQDL++SIRVP+PPVLVD++SPLSFRAFEDHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ S+E+G  +SKASKPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLV RYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRC KGVCLLTLTTITLSKDD+IGHVFPD +GT+Q KDESDFADG+ LTEAK+E  CN TS +KVK KQ N+K+L ANMEG   EK  +EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHH ESNVL+HLD+DCMHHTC T K DPKP+N +D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC + ESGNLL                                            REYTLERMFANQLLESA++ESSFRT+VKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

XP_008440769.1 PREDICTED: uncharacterized protein LOC103485086 [Cucumis melo]1.5e-25173.67Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSEF+TV++P KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNL QSVVC  W QDL +SIRVP+PPVLVD+ESPLSFRAF+DHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ S+E+GN +SKASKPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRC KGVCLLTLTTITLSKDD+IGHVFPD +GT++FKDESDFADG+ LTEAK+E  CN TS +KVK KQ N+K L ANMEG A +K  +EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL+HLD DCMHHTC T KLDPKPIN +D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC + ESGNLL                                            REYTLERMFANQLLESA +ESSFRTVVKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQ+E+LVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

XP_022133273.1 uncharacterized protein LOC111005900 [Momordica charantia]7.4e-29987.5Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS--------GALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS        G + F              ++   MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS--------GALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSN CGPTDGGVRLF
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTCSSVESGNLL                                            REYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

XP_023543348.1 uncharacterized protein LOC111803252 [Cucurbita pepo subsp. pepo]1.7e-25074.17Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        M SE  +V++PRKWRFTWEAQSHIPTLRLLLFDS+TNPSLQCQNLKVHLNL QSVVC  WLQDLE+SIRVP+PPVLVD+ESPLSFRAFEDHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ SE RG+  SKA KPL MD        SG + F              ++   MPSVNWREVADNWFG+CCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRCAKGVCLLTLTTITLSKDD+IGHVFPDYDGTR+ KDESDF DGNWLTEAKQE QCN TS ++VK KQ N K L A  EG+A  K  +EVDSP +T 
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPD   HGESNVL+ LDRDCMHHTC TY+LDPKPINT+D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYPC NGCGPTDGGVRLF
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC S ES NL                                             R+YTLE+MFA+QLLESAN+ESSFRTVVKELKTKS MLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIA+EVFMLAHQIEEL EIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

XP_038883816.1 uncharacterized protein LOC120074678 isoform X1 [Benincasa hispida]1.2e-25675.33Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MS E DTV+SPRKWRFTWEAQSHIP LRLLLFDS+TNPSLQCQNLKVHLNL QSVVC  WLQDL++SIRVP+PPVLVD+ESPLSFRAFEDHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+  +ERGN +SKA+KPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRCAKGVCLLTLTTITLSKDD+ GHVFPDYDGTR+FKDESD  DGN LTEAKQE  CN TS +KVK KQ N K   A+MEG+A EK  EEVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
         PDCCHH ES+VL+HLDRDCMHHTC TY LDPKPIN++D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYPC  GCGPTD GVRLF
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC S ESGNLL                                            REYTLERMFANQLLESAN+ESSFRTVVKELKTK PMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMED AE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

TrEMBL top hitse value%identityAlignment
A0A0A0KKI8 Uncharacterized protein8.1e-25173.67Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSE  TV++PRKWRFTWEAQSHIP LRLLLFDS TNPSLQC+NLKV LNL QSVVC  WLQDL++SIRVP+PPVLVD++SPLSFRAFEDHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ S+E+G  +SKASKPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLV RYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRC KGVCLLTLTTITLSKDD+IGHVFPD +GT+Q KDESDFADG+ LTEAK+E  CN TS +KVK KQ N+K+L ANMEG   EK  +EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHH ESNVL+HLD+DCMHHTC T K DPKP+N +D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC + ESGNLL                                            REYTLERMFANQLLESA++ESSFRT+VKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

A0A1S3B1W7 uncharacterized protein LOC1034850867.3e-25273.67Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSEF+TV++P KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNL QSVVC  W QDL +SIRVP+PPVLVD+ESPLSFRAF+DHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ S+E+GN +SKASKPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRC KGVCLLTLTTITLSKDD+IGHVFPD +GT++FKDESDFADG+ LTEAK+E  CN TS +KVK KQ N+K L ANMEG A +K  +EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL+HLD DCMHHTC T KLDPKPIN +D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC + ESGNLL                                            REYTLERMFANQLLESA +ESSFRTVVKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQ+E+LVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

A0A5A7SM17 Ubiquitin-conjugating enzyme E2C-binding protein7.3e-25273.67Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSEF+TV++P KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNL QSVVC  W QDL +SIRVP+PPVLVD+ESPLSFRAF+DHIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ S+E+GN +SKASKPL MD        SG + F              +    MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRC KGVCLLTLTTITLSKDD+IGHVFPD +GT++FKDESDFADG+ LTEAK+E  CN TS +KVK KQ N+K L ANMEG A +K  +EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL+HLD DCMHHTC T KLDPKPIN +D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC + ESGNLL                                            REYTLERMFANQLLESA +ESSFRTVVKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIADEVFMLAHQ+E+LVEIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

A0A6J1BYQ5 uncharacterized protein LOC1110059003.6e-29987.5Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS--------GALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS        G + F              ++   MPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDS--------GALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSN CGPTDGGVRLF
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTCSSVESGNLL                                            REYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

A0A6J1IL55 uncharacterized protein LOC1114784313.4e-24973.5Show/hide
Query:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL
        M SE D+V+SPRKWRFTWEAQSHIPTLRLLLFDS+TNPSLQCQNLKVHLNL QSVVC  WLQD+E+SIRVP+PPVLVD+ESPLSFRAFE+HIEVKL LLL
Subjt:  MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLL

Query:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI+LNFDNVL+ SE+RG+  SKA KPL MD        SG + F              ++   MPSVNWREVADNWFG+CCCSFGG+SEKLVTRYT
Subjt:  PVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMD--------SGALQF--------------QSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP
        NSYRCAKGVCLLTLTTITLSKDD+IGH FPDYDGTR+ K+ESDF DGNWLTEAKQE QCN TS  +VK KQ N K L A  EG+A+ K  +EVDSP +T 
Subjt:  NSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTP

Query:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF
        IPD   HGESNVL+ LDRDCMHHTC TY+LDPKP+NT+D+SDDQ SFLNGFLGNIFMARLSNLSADFEW EFFCP+CSTLIGAYPC NGCGPTDGGVRLF
Subjt:  IPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV
        KCYVSTC S ES NL                                             REYTLE+MFA+QLLESAN+ESSFRTVVKELKTKS MLHIV
Subjt:  KCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIV

Query:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR
        LINS SWSCSGYCLGMEDTAE V K+DL+P+IKVLFSDC+KSAESHLRKLEEWVTKDIA+EVFMLAHQIEEL EIL S NDTLPSSCSSLDGLTLTSILR
Subjt:  LINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSILR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26750.1 CONTAINS InterPro DOMAIN/s: Ubiquitin-conjugating enzyme E2C-binding protein (InterPro:IPR019193); Has 26 Blast hits to 25 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 26; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.4e-10137.37Show/hide
Query:  KSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLE---------VSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLL
        K+ R WR+TWEAQSH P LRL LFDS TNP + C++L V   + +S +  TW+ + +         VS+ VPIP VL+D+ESP++F+A +DHIEV+L LL
Subjt:  KSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLE---------VSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLL

Query:  LPVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDSGALQFQ--------------SICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAK
        LPVDHP+V +F+ V +S E+            L   G + F                   MPS+NWRE ADNWFG+CCCSFGGISEK+V +YTNSY C+ 
Subjt:  LPVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDSGALQFQ--------------SICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAK

Query:  GVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKK-VKPKQSNDKTLAANMEGDATEKEREEVDSPNMTPIPDCCH
        G+CLL+ TT+ LSKDD++  +  +  GT     E +F       E+   L C++  ++   +  + N ++  +  E    + +  +    +   +P CC 
Subjt:  GVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKK-VKPKQSNDKTLAANMEGDATEKEREEVDSPNMTPIPDCCH

Query:  HGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCG--PTDGGVRLFKCYV
        H   +            +  + +L+ K      L+ D++  L+GFL ++FMA+ SN+S + EW+EF CP+CS+ +GAYP   G    P DGGVRLFKCY+
Subjt:  HGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCG--PTDGGVRLFKCYV

Query:  STCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIVLINS
        ST S                              T  +   +F             R+YTLERMF NQL+E + +E SF  +VK+L TKSP+ +IV++N 
Subjt:  STCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHREYTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIVLINS

Query:  YSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSI
         ++S +G C   +   E  S ++LS ++KVLFSDC+ S           V K I +EV++L  Q EEL+E++ + +  LPSSCS L G  ++S+
Subjt:  YSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILASGNDTLPSSCSSLDGLTLTSI

AT4G36440.1 unknown protein3.4e-10060.82Show/hide
Query:  GLGCICNITYESNCRVIIDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGFEKPHHAFGFSTEQTRVVLYMTAISSLSSLVHRPIIQVFPEIGLD
        GLGCIC++T +S CRV +DLAIPCE  GPRVFKGFTVG HPRSWEI+YNG+TQ GF+KP   F F TEQT + LYMTAI+SLS+LV +PII+V PE GLD
Subjt:  GLGCICNITYESNCRVIIDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGFEKPHHAFGFSTEQTRVVLYMTAISSLSSLVHRPIIQVFPEIGLD

Query:  VKVSGSGATGSYPTTLSPSMLMIDWRCDIARDIPYEVNITVPVADYEPISFFLTKMCENRQDRPGESMKGWATFGILSCIFMVVASLLCCGGFVYKAKVQ
        VK++GS  TG++PTTLSPS L++DW C+ +R  PYEVN+T+PV  Y+P+ FFLTK+CE  Q   G S KGWA FG+ SC+F+V ++L CCGGF+YK +V+
Subjt:  VKVSGSGATGSYPTTLSPSMLMIDWRCDIARDIPYEVNITVPVADYEPISFFLTKMCENRQDRPGESMKGWATFGILSCIFMVVASLLCCGGFVYKAKVQ

Query:  GQHGIDALPGMTLLSACLETVSGGGQSYPRAEGVNDAFVSDPSWEHPPSSSRRTWTA---SEKNYGSI
           G DALPGM+LLS  LETVSG GQSY R E +N+AF ++ SW+   +SS +  T    SE+ YG+I
Subjt:  GQHGIDALPGMTLLSACLETVSGGGQSYPRAEGVNDAFVSDPSWEHPPSSSRRTWTA---SEKNYGSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTGAATTCGATACAGTGAAAAGCCCTAGAAAATGGCGTTTTACATGGGAAGCGCAATCCCACATACCAACCCTACGTCTGTTGCTATTCGATTCCCATACCAA
TCCTTCTCTCCAATGTCAGAATCTCAAGGTTCATCTCAATCTCCCGCAGTCCGTCGTTTGCGCCACTTGGCTCCAAGACCTCGAAGTGTCGATTCGAGTTCCTATTCCTC
CGGTTTTGGTTGACTCTGAGTCGCCCTTGAGTTTTAGAGCTTTCGAAGATCATATCGAAGTCAAGCTCTTCTTGCTTCTTCCGGTCGATCACCCAATTGTTCTCAACTTC
GACAATGTGCTGAACTCCTCCGAAGAGCGAGGAAATAAGTACTCCAAGGCGTCGAAGCCGCTTTTGATGGACTCTGGTGCGCTTCAATTTCAATCGATTTGTTCTATGCC
ATCAGTCAATTGGCGAGAGGTTGCTGATAACTGGTTTGGGTCTTGCTGCTGCTCCTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACAAATTCCTATAGATGTG
CAAAGGGTGTCTGCCTACTCACTTTAACAACTATTACTCTTTCCAAGGATGACATTATTGGACATGTGTTCCCAGACTATGATGGGACCCGGCAATTCAAGGACGAATCA
GATTTTGCTGATGGCAATTGGTTAACGGAAGCTAAGCAGGAATTACAATGTAATCTTACATCTATGAAGAAGGTAAAACCTAAGCAGTCTAATGATAAAACCCTTGCTGC
AAACATGGAGGGTGATGCTACTGAGAAAGAAAGGGAAGAAGTTGATTCACCTAATATGACTCCAATTCCTGATTGTTGTCATCATGGTGAAAGTAATGTATTAAATCATC
TTGACAGAGACTGCATGCATCACACATGTAGCACGTATAAGTTAGACCCAAAGCCTATTAATACTATTGATCTTTCAGACGATCAGAGATCCTTTCTTAATGGTTTTCTT
GGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCAGATTTTGAGTGGGTTGAGTTTTTTTGCCCCAAGTGCTCTACTCTGATTGGGGCTTACCCTTGCAGTAATGG
CTGCGGACCTACAGATGGTGGAGTTCGACTCTTTAAATGTTATGTCTCAACATGTTCATCAGTTGAATCTGGAAATTTGTTGAGGATATTATATTACTATGTCATGCCTT
TCTTTTGGTTAAGTCTCTGGAAATCATGTCTGCATAATACAATAAGTGACTTTGAGATAATTTTTATCTCTTGGAGTGCTCAAGACTATCTGACTTATGCGCACAGGGAG
TACACCTTGGAAAGAATGTTTGCAAATCAGCTACTGGAAAGTGCAAATGACGAATCATCATTTCGCACTGTGGTTAAGGAGCTGAAAACCAAGTCTCCCATGCTACACAT
TGTTCTCATCAATTCATATTCTTGGTCGTGTAGTGGTTATTGTTTGGGCATGGAGGATACAGCTGAATCAGTTTCAAAGATTGATTTAAGTCCTGTCATCAAGGTGCTAT
TCTCTGATTGCAGCAAAAGTGCGGAGTCCCATTTGAGGAAACTTGAAGAGTGGGTAACAAAAGATATAGCGGATGAAGTTTTTATGTTAGCCCATCAAATAGAGGAATTA
GTTGAAATCCTAGCTTCAGGAAATGATACACTTCCATCTTCATGTTCTTCCCTTGATGGTTTAACTTTGACATCTATCCTGAGAAAAAGAACTTCTTATAACTTTTTATG
TGTTATGGACCTTGACATCACAATCGTTCCAATTTCCACCCGTCAGGATTATTACAATGGCGACCTGACTTCTTGTGGTCTGGGATGCATCTGCAATATCACTTATGAGT
CCAATTGCAGAGTTATTATTGATCTTGCCATCCCTTGTGAGATACAAGGTCCACGTGTTTTCAAAGGATTTACTGTTGGTTTCCACCCTCGATCCTGGGAAATTGTTTAC
AATGGTTTGACTCAATTAGGCTTCGAGAAGCCACACCATGCATTCGGCTTTAGCACAGAGCAGACTCGTGTGGTTCTTTATATGACTGCAATTTCATCACTTTCCTCTTT
GGTACATAGACCAATCATTCAGGTTTTTCCAGAAATTGGACTAGATGTGAAAGTATCAGGCTCAGGGGCAACTGGGAGCTACCCTACAACTTTGTCACCCTCCATGTTGA
TGATTGACTGGAGATGTGATATTGCCAGGGACATTCCATATGAAGTTAACATCACGGTCCCTGTGGCTGATTATGAACCAATTAGTTTTTTTCTTACCAAAATGTGTGAA
AATAGGCAGGACCGACCAGGAGAATCTATGAAAGGATGGGCGACATTTGGGATACTCTCTTGCATATTCATGGTCGTAGCATCACTACTTTGTTGTGGAGGGTTTGTTTA
TAAGGCCAAAGTGCAAGGCCAGCATGGAATCGATGCATTACCCGGCATGACACTGTTATCCGCTTGCTTGGAAACTGTAAGTGGAGGAGGACAAAGCTACCCGAGAGCGG
AAGGCGTCAACGACGCGTTCGTCAGTGATCCCTCCTGGGAACACCCACCATCTTCTTCTCGACGGACATGGACAGCATCTGAGAAAAATTATGGTTCAATA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTGAATTCGATACAGTGAAAAGCCCTAGAAAATGGCGTTTTACATGGGAAGCGCAATCCCACATACCAACCCTACGTCTGTTGCTATTCGATTCCCATACCAA
TCCTTCTCTCCAATGTCAGAATCTCAAGGTTCATCTCAATCTCCCGCAGTCCGTCGTTTGCGCCACTTGGCTCCAAGACCTCGAAGTGTCGATTCGAGTTCCTATTCCTC
CGGTTTTGGTTGACTCTGAGTCGCCCTTGAGTTTTAGAGCTTTCGAAGATCATATCGAAGTCAAGCTCTTCTTGCTTCTTCCGGTCGATCACCCAATTGTTCTCAACTTC
GACAATGTGCTGAACTCCTCCGAAGAGCGAGGAAATAAGTACTCCAAGGCGTCGAAGCCGCTTTTGATGGACTCTGGTGCGCTTCAATTTCAATCGATTTGTTCTATGCC
ATCAGTCAATTGGCGAGAGGTTGCTGATAACTGGTTTGGGTCTTGCTGCTGCTCCTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACAAATTCCTATAGATGTG
CAAAGGGTGTCTGCCTACTCACTTTAACAACTATTACTCTTTCCAAGGATGACATTATTGGACATGTGTTCCCAGACTATGATGGGACCCGGCAATTCAAGGACGAATCA
GATTTTGCTGATGGCAATTGGTTAACGGAAGCTAAGCAGGAATTACAATGTAATCTTACATCTATGAAGAAGGTAAAACCTAAGCAGTCTAATGATAAAACCCTTGCTGC
AAACATGGAGGGTGATGCTACTGAGAAAGAAAGGGAAGAAGTTGATTCACCTAATATGACTCCAATTCCTGATTGTTGTCATCATGGTGAAAGTAATGTATTAAATCATC
TTGACAGAGACTGCATGCATCACACATGTAGCACGTATAAGTTAGACCCAAAGCCTATTAATACTATTGATCTTTCAGACGATCAGAGATCCTTTCTTAATGGTTTTCTT
GGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCAGATTTTGAGTGGGTTGAGTTTTTTTGCCCCAAGTGCTCTACTCTGATTGGGGCTTACCCTTGCAGTAATGG
CTGCGGACCTACAGATGGTGGAGTTCGACTCTTTAAATGTTATGTCTCAACATGTTCATCAGTTGAATCTGGAAATTTGTTGAGGATATTATATTACTATGTCATGCCTT
TCTTTTGGTTAAGTCTCTGGAAATCATGTCTGCATAATACAATAAGTGACTTTGAGATAATTTTTATCTCTTGGAGTGCTCAAGACTATCTGACTTATGCGCACAGGGAG
TACACCTTGGAAAGAATGTTTGCAAATCAGCTACTGGAAAGTGCAAATGACGAATCATCATTTCGCACTGTGGTTAAGGAGCTGAAAACCAAGTCTCCCATGCTACACAT
TGTTCTCATCAATTCATATTCTTGGTCGTGTAGTGGTTATTGTTTGGGCATGGAGGATACAGCTGAATCAGTTTCAAAGATTGATTTAAGTCCTGTCATCAAGGTGCTAT
TCTCTGATTGCAGCAAAAGTGCGGAGTCCCATTTGAGGAAACTTGAAGAGTGGGTAACAAAAGATATAGCGGATGAAGTTTTTATGTTAGCCCATCAAATAGAGGAATTA
GTTGAAATCCTAGCTTCAGGAAATGATACACTTCCATCTTCATGTTCTTCCCTTGATGGTTTAACTTTGACATCTATCCTGAGAAAAAGAACTTCTTATAACTTTTTATG
TGTTATGGACCTTGACATCACAATCGTTCCAATTTCCACCCGTCAGGATTATTACAATGGCGACCTGACTTCTTGTGGTCTGGGATGCATCTGCAATATCACTTATGAGT
CCAATTGCAGAGTTATTATTGATCTTGCCATCCCTTGTGAGATACAAGGTCCACGTGTTTTCAAAGGATTTACTGTTGGTTTCCACCCTCGATCCTGGGAAATTGTTTAC
AATGGTTTGACTCAATTAGGCTTCGAGAAGCCACACCATGCATTCGGCTTTAGCACAGAGCAGACTCGTGTGGTTCTTTATATGACTGCAATTTCATCACTTTCCTCTTT
GGTACATAGACCAATCATTCAGGTTTTTCCAGAAATTGGACTAGATGTGAAAGTATCAGGCTCAGGGGCAACTGGGAGCTACCCTACAACTTTGTCACCCTCCATGTTGA
TGATTGACTGGAGATGTGATATTGCCAGGGACATTCCATATGAAGTTAACATCACGGTCCCTGTGGCTGATTATGAACCAATTAGTTTTTTTCTTACCAAAATGTGTGAA
AATAGGCAGGACCGACCAGGAGAATCTATGAAAGGATGGGCGACATTTGGGATACTCTCTTGCATATTCATGGTCGTAGCATCACTACTTTGTTGTGGAGGGTTTGTTTA
TAAGGCCAAAGTGCAAGGCCAGCATGGAATCGATGCATTACCCGGCATGACACTGTTATCCGCTTGCTTGGAAACTGTAAGTGGAGGAGGACAAAGCTACCCGAGAGCGG
AAGGCGTCAACGACGCGTTCGTCAGTGATCCCTCCTGGGAACACCCACCATCTTCTTCTCGACGGACATGGACAGCATCTGAGAAAAATTATGGTTCAATA
Protein sequenceShow/hide protein sequence
MSSEFDTVKSPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFEDHIEVKLFLLLPVDHPIVLNF
DNVLNSSEERGNKYSKASKPLLMDSGALQFQSICSMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGVCLLTLTTITLSKDDIIGHVFPDYDGTRQFKDES
DFADGNWLTEAKQELQCNLTSMKKVKPKQSNDKTLAANMEGDATEKEREEVDSPNMTPIPDCCHHGESNVLNHLDRDCMHHTCSTYKLDPKPINTIDLSDDQRSFLNGFL
GNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNGCGPTDGGVRLFKCYVSTCSSVESGNLLRILYYYVMPFFWLSLWKSCLHNTISDFEIIFISWSAQDYLTYAHRE
YTLERMFANQLLESANDESSFRTVVKELKTKSPMLHIVLINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKLEEWVTKDIADEVFMLAHQIEEL
VEILASGNDTLPSSCSSLDGLTLTSILRKRTSYNFLCVMDLDITIVPISTRQDYYNGDLTSCGLGCICNITYESNCRVIIDLAIPCEIQGPRVFKGFTVGFHPRSWEIVY
NGLTQLGFEKPHHAFGFSTEQTRVVLYMTAISSLSSLVHRPIIQVFPEIGLDVKVSGSGATGSYPTTLSPSMLMIDWRCDIARDIPYEVNITVPVADYEPISFFLTKMCE
NRQDRPGESMKGWATFGILSCIFMVVASLLCCGGFVYKAKVQGQHGIDALPGMTLLSACLETVSGGGQSYPRAEGVNDAFVSDPSWEHPPSSSRRTWTASEKNYGSI