; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013450 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013450
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUbiquitin-conjugating enzyme E2-binding protein
Genome locationchr1:50384998..50397183
RNA-Seq ExpressionLag0013450
SyntenyLag0013450
Gene Ontology termsGO:0000209 - protein polyubiquitination (biological process)
GO:0006513 - protein monoubiquitination (biological process)
GO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0051865 - protein autoubiquitination (biological process)
GO:0000151 - ubiquitin ligase complex (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030332 - cyclin binding (molecular function)
GO:0031624 - ubiquitin conjugating enzyme binding (molecular function)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
InterPro domainsIPR019193 - Ubiquitin-conjugating enzyme E2-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149986.1 uncharacterized protein LOC101204887 [Cucumis sativus]3.2e-28687.75Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSEL TVENPRKWRFTWEAQSHIP LRLLLFDS TNPSLQC+NLKV LNLQQSVVCV W QDL+MSI VP+PPVLVDA+SPLSFRAFEDHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFS+E+  S+SKASKPL MDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLV RYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRC KGVCLLTLTTITLSKDD+IGH+FPD++GT++ KDESD  D + LTEAK+ES CNHTSTEKVKSKQ N+K+L AN E   AEK S EVDSP+VTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHH ESNVL HLD+DCMHHTC   K DPKP N +DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C + ESGNLL EYTLERMFANQLLESA+EESSFRT+VKELKTKSPMLHIVLINSNSWSCSGYCLGMEDT E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

XP_008440769.1 PREDICTED: uncharacterized protein LOC103485086 [Cucumis melo]3.4e-28888.11Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSE +TVENP KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNLQQSVVCV W QDL MSI VP+PPVLVDAESPLSFRAF+DHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFS+E+ NS+SKASKPL MDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRC KGVCLLTLTTITLSKDD+IGH+FPD++GT+EFKDESD  D + LTEAK+ES CNHTSTEKVKSKQ N+KNL AN E  AA+K S EVDSPLVTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL HLD DCMHHTC   KLDPKP N +DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C + ESGNLL EYTLERMFANQLLESA EESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDT E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQ+E+LVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

XP_022133273.1 uncharacterized protein LOC111005900 [Momordica charantia]1.5e-29188.65Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSE DTV++PRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNL QSVVC TW QDLE+SI VPIPPVLVD+ESPLSFRAFEDHIEVKL LLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI++NFDNVL+ SEER N YSKASKPLLMDSDQ SLSR+GGVHFYCRNCSFRLS+SPLR+FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKD-SEVDSPLVTP
        NSYRCAKGVCLLTLTTITLSKDD+IGH+FPD DGTR+FKDESD  D NWLTEAKQE QCN TS +KVK KQ NDK LAAN E DA EK+  EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKD-SEVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHHGESNVL+HLDRDCMHHTC  YKLDPKP NTID+SDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCP+CSTLIGAYPCSN CGPTDGGVRLF
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C+SVESGNLL EYTLERMFANQLLESAN+ESSFRTVVKELKTKSPMLHIVLINS SWSCSGYCLGMEDT E V K+DL+P+IKVLFSDC+KSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQIEELVEIL S NDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

XP_023543348.1 uncharacterized protein LOC111803252 [Cucurbita pepo subsp. pepo]8.4e-28788.47Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        M SEL +VENPRKWRFTWEAQSHIPTLRLLLFDS+TNPSLQCQNLKVHLNLQQSVVCV W QDLEMSI VP+PPVLVDAESPLSFRAFEDHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFSE R +S SKA KPL MD DQ SLSRSGGVHFYCRNCSFRLS+SPLR+FVEMPSVNWREVADNWFG+CCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRCAKGVCLLTLTTITLSKDD+IGH+FPD DGTRE KDESD  D NWLTEAKQESQCNHTSTE+VKSKQFN KNL A TE +AA K S EVDSPLVT 
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPD   HGESNVL  LDRDCMHHTC  Y+LDPKP NT+D+SDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NGCGPTDGGVRLF
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C S ES NL  +YTLE+MFA+QLLESANEESSFRTVVKELKTKS MLHIVLINSNSWSCSGYCLGMEDT E+VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIA+EVFMLAHQIEEL EILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

XP_038883816.1 uncharacterized protein LOC120074678 isoform X1 [Benincasa hispida]5.6e-29188.83Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MS ELDTVE+PRKWRFTWEAQSHIP LRLLLFDS+TNPSLQCQNLKVHLNLQQSVVCV W QDL+MSI VP+PPVLVDAESPLSFRAFEDHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDF +ER NS+SKA+KPL MD DQISLSRSGGVHFYCRNCSFRLSK+PLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEK-DSEVDSPLVTP
        NSYRCAKGVCLLTLTTITLSKDD+ GH+FPD DGTREFKDESD+ D N LTEAKQES CNHTS EKVKSKQFN KN  A+ E +AAEK + EVDSP++TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEK-DSEVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
         PDCCHH ES+VL HLDRDCMHHTC  Y LDPKP N++DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC  GCGPTD GVRLF
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C S ESGNLL EYTLERMFANQLLESANEESSFRTVVKELKTK PMLHIVLINSNSWSCSGYCLGMED  E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

TrEMBL top hitse value%identityAlignment
A0A0A0KKI8 Uncharacterized protein1.5e-28687.75Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSEL TVENPRKWRFTWEAQSHIP LRLLLFDS TNPSLQC+NLKV LNLQQSVVCV W QDL+MSI VP+PPVLVDA+SPLSFRAFEDHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFS+E+  S+SKASKPL MDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLV RYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRC KGVCLLTLTTITLSKDD+IGH+FPD++GT++ KDESD  D + LTEAK+ES CNHTSTEKVKSKQ N+K+L AN E   AEK S EVDSP+VTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHH ESNVL HLD+DCMHHTC   K DPKP N +DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C + ESGNLL EYTLERMFANQLLESA+EESSFRT+VKELKTKSPMLHIVLINSNSWSCSGYCLGMEDT E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

A0A1S3B1W7 uncharacterized protein LOC1034850861.7e-28888.11Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSE +TVENP KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNLQQSVVCV W QDL MSI VP+PPVLVDAESPLSFRAF+DHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFS+E+ NS+SKASKPL MDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRC KGVCLLTLTTITLSKDD+IGH+FPD++GT+EFKDESD  D + LTEAK+ES CNHTSTEKVKSKQ N+KNL AN E  AA+K S EVDSPLVTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL HLD DCMHHTC   KLDPKP N +DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C + ESGNLL EYTLERMFANQLLESA EESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDT E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQ+E+LVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

A0A5A7SM17 Ubiquitin-conjugating enzyme E2C-binding protein1.7e-28888.11Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSE +TVENP KWRFTWEAQSHIP LRLLLFDS+TNPSL+C+NL VHLNLQQSVVCV W QDL MSI VP+PPVLVDAESPLSFRAF+DHIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFS+E+ NS+SKASKPL MDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRC KGVCLLTLTTITLSKDD+IGH+FPD++GT+EFKDESD  D + LTEAK+ES CNHTSTEKVKSKQ N+KNL AN E  AA+K S EVDSPLVTP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCC H ESNVL HLD DCMHHTC   KLDPKP N +DISDDQRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYP  NGCGPTDGGVR F
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C + ESGNLL EYTLERMFANQLLESA EESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDT E VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQ+E+LVEILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

A0A6J1BYQ5 uncharacterized protein LOC1110059007.2e-29288.65Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        MSSE DTV++PRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNL QSVVC TW QDLE+SI VPIPPVLVD+ESPLSFRAFEDHIEVKL LLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPI++NFDNVL+ SEER N YSKASKPLLMDSDQ SLSR+GGVHFYCRNCSFRLS+SPLR+FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKD-SEVDSPLVTP
        NSYRCAKGVCLLTLTTITLSKDD+IGH+FPD DGTR+FKDESD  D NWLTEAKQE QCN TS +KVK KQ NDK LAAN E DA EK+  EVDSP +TP
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKD-SEVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPDCCHHGESNVL+HLDRDCMHHTC  YKLDPKP NTID+SDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCP+CSTLIGAYPCSN CGPTDGGVRLF
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C+SVESGNLL EYTLERMFANQLLESAN+ESSFRTVVKELKTKSPMLHIVLINS SWSCSGYCLGMEDT E V K+DL+P+IKVLFSDC+KSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIADEVFMLAHQIEELVEIL S NDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

A0A6J1IL55 uncharacterized protein LOC1114784319.4e-28487.21Show/hide
Query:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL
        M SELD+VE+PRKWRFTWEAQSHIPTLRLLLFDS+TNPSLQCQNLKVHLNLQQSVVCV W QD+EMSI VP+PPVLVDAESPLSFRAFE+HIEVKLVLLL
Subjt:  MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLL

Query:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT
        PVDHPII+NFDNVLDFSE+R ++ SKA KPL MD DQ SLSRSGGVHFYCRNCSFRLS+SPLR+FVEMPSVNWREVADNWFG+CCCSFGG+SEKLVTRYT
Subjt:  PVDHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYT

Query:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP
        NSYRCAKGVCLLTLTTITLSKDD+IGH FPD DGTRE K+ESD  D NWLTEAKQESQCNHTST +VKSKQFN KNL A TE +A+ K S EVDSPLVT 
Subjt:  NSYRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDS-EVDSPLVTP

Query:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF
        IPD   HGESNVL  LDRDCMHHTC  Y+LDPKP NT+D+SDDQ SFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NGCGPTDGGVRLF
Subjt:  IPDCCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLF

Query:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        KCYVS+C S ES NL  EYTLE+MFA+QLLESANEESSFRTVVKELKTKS MLHIVLINSNSWSCSGYCLGMEDT E+VPKVDLNPIIKVLFSDCNKSAE
Subjt:  KCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL
        SHLRKLEEWVTKDIA+EVFMLAHQIEEL EILVSRNDTLPSSCSSLDGLTLTSIL
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26750.1 CONTAINS InterPro DOMAIN/s: Ubiquitin-conjugating enzyme E2C-binding protein (InterPro:IPR019193); Has 26 Blast hits to 25 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 26; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-12144.04Show/hide
Query:  RKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTW---------SQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLLPV
        R WR+TWEAQSH P LRL LFDS TNP + C++L V   + +S + VTW         S++  +S+ VPIP VL+D ESP++F+A +DHIEV+LVLLLPV
Subjt:  RKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTW---------SQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLLPV

Query:  DHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNS
        DHP++ +F+ V D  E+        S PL+M  D  +LS  GGVHFYCR+CS RL+K  L DF EMPS+NWRE ADNWFG+CCCSFGGISEK+V +YTNS
Subjt:  DHPIIINFDNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNS

Query:  YRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDSEVDSPLVTPIPD
        Y C+ G+CLL+ TT+ LSKDD++  +  +  GT E + ES +  +  +   +  S+ +  + E  +S   N       ++    +K S         +P 
Subjt:  YRCAKGVCLLTLTTITLSKDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDSEVDSPLVTPIPD

Query:  CCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCG--PTDGGVRLFK
        CC H   +            + +  +L+ K      ++ D++  L+GFL ++FMA+ SN+S + EW+EF CP+CS+ +GAYP   G    P DGGVRLFK
Subjt:  CCHHGESNVLDHLDRDCMHHTCDIYKLDPKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCG--PTDGGVRLFK

Query:  CYVS-SCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE
        CY+S S T+ ES ++  +YTLERMF NQL+E + EE SF  +VK+L TKSP+ +IV++N N++S +G C   ++       ++L+ I+KVLFSDCN S  
Subjt:  CYVS-SCTSVESGNLLSEYTLERMFANQLLESANEESSFRTVVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAE

Query:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSI
                 V K I +EV++L  Q EEL+E++ + +  LPSSCS L G  ++S+
Subjt:  SHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTLTSI

AT4G36440.1 unknown protein1.6e-14763.57Show/hide
Query:  VVEINAIAVPSSSCYVFDNSSHIIDFSSWIGQLFEYDGKDSDLVVRFCKDVESRSQMGYVDFGRFDKFNYFVSGSGHANFVQGYYNGDLTSCEQSYDKLG
        V+   ++ VP S+CY  DNSS ++DFSSWIG  FEYDGK+ DLVVRFCKDVE+R Q GYVDFGRFD  +YFVS S + +FV            QSYDKLG
Subjt:  VVEINAIAVPSSSCYVFDNSSHIIDFSSWIGQLFEYDGKDSDLVVRFCKDVESRSQMGYVDFGRFDKFNYFVSGSGHANFVQGYYNGDLTSCEQSYDKLG

Query:  RTSQVNVLCGGCLNGQCKGGLGCICNITYESNCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFSTEQTGVVLYMTAIAS
        RT+QVN++CG C +G+CKGGLGCIC++T +S CRV VDLAIPCE  GPRVFKGFTVG HPRSWEI+YNG+TQ G++KP R FSF TEQT + LYMTAIAS
Subjt:  RTSQVNVLCGGCLNGQCKGGLGCICNITYESNCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFSTEQTGVVLYMTAIAS

Query:  LSSLVQKPIIQVFPENGLEVKVSGSGATGSYPTTLSPSMLMIDWRCDVARDIPYEVNVTIPVADYEPISFLLTKMCEKRQDVQKDSMKGWATFGILSCIF
        LS+LV KPII+V PENGL+VK++GS  TG++PTTLSPS L++DW C+ +R  PYEVNVTIPV  Y+P+ F LTK+CE  Q  +  S KGWA FG+ SC+F
Subjt:  LSSLVQKPIIQVFPENGLEVKVSGSGATGSYPTTLSPSMLMIDWRCDVARDIPYEVNVTIPVADYEPISFLLTKMCEKRQDVQKDSMKGWATFGILSCIF

Query:  IVVASLFCCGGFVYKAQVQGQRGIDALPGMTLLSACLETVSGAGQSYPRAEGIDNAFVSEASWDRPSSSSS-SRRTWTPSEKNYGSI
        +V ++LFCCGGF+YK +V+  RG DALPGM+LLS  LETVSG+GQSY R E I+NAF +E SWDR S+SS+ +  T  PSE+ YG+I
Subjt:  IVVASLFCCGGFVYKAQVQGQRGIDALPGMTLLSACLETVSGAGQSYPRAEGIDNAFVSEASWDRPSSSSS-SRRTWTPSEKNYGSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTGAACTCGATACAGTCGAAAACCCTAGAAAATGGCGCTTCACATGGGAAGCGCAATCCCATATACCAACCTTACGTTTGTTGCTCTTCGATTCCCATACCAA
CCCTTCTCTTCAATGTCAGAATCTCAAGGTTCATCTCAACCTCCAGCAGTCCGTCGTTTGTGTGACTTGGTCACAAGACCTCGAAATGTCGATTGGAGTCCCTATTCCTC
CGGTTTTGGTTGACGCTGAGTCGCCACTGAGTTTTCGAGCTTTCGAAGACCATATTGAGGTCAAACTCGTCTTGCTTCTTCCGGTCGATCACCCAATTATTATCAACTTC
GACAACGTGCTGGACTTCTCCGAAGAGCGAGAAAATAGCTACTCCAAGGCGTCGAAGCCGCTCTTAATGGACTCTGATCAAATCAGTTTATCACGCAGTGGTGGCGTCCA
CTTTTATTGCAGAAATTGTTCTTTCAGGCTGAGTAAATCCCCTCTCAGAGATTTTGTTGAAATGCCATCTGTCAACTGGCGAGAGGTGGCTGATAACTGGTTTGGGTCTT
GCTGCTGCTCTTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACGAATTCCTATAGATGTGCAAAGGGTGTCTGCCTACTCACTTTAACAACTATTACTCTTTCC
AAGGATGACGTTATTGGACATATGTTCCCAGACTCTGATGGGACCCGAGAATTCAAGGATGAATCAGATATCATTGATGCCAATTGGTTAACTGAAGCTAAACAGGAATC
ACAATGTAATCATACATCTACAGAGAAGGTAAAATCTAAGCAGTTCAATGATAAAAACCTTGCTGCAAACACGGAGGCTGATGCTGCTGAGAAAGATAGTGAAGTTGATT
CACCTCTTGTGACTCCAATTCCTGACTGCTGTCATCATGGAGAAAGTAATGTACTTGATCATCTTGATAGAGATTGTATGCATCACACATGTGACATTTATAAGTTAGAC
CCCAAGCCTTTTAATACTATAGATATCTCAGATGATCAGAGATCCTTTCTTAATGGTTTTCTTGGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCGGATTTTGA
GTGGGTTGAGTTTTTTTGCCCCCAGTGCTCAACTTTGATTGGGGCTTATCCTTGCAGTAATGGCTGCGGACCTACAGATGGTGGAGTTAGACTCTTTAAATGCTATGTCT
CATCATGCACGTCAGTTGAATCTGGAAATTTGTTGAGCGAGTACACCTTGGAAAGAATGTTTGCAAATCAGCTACTGGAAAGTGCAAATGAAGAATCATCATTTCGTACT
GTGGTTAAGGAATTGAAAACCAAGTCTCCCATGCTGCACATTGTTCTCATTAATTCAAATTCTTGGTCGTGTAGTGGTTATTGTTTGGGCATGGAGGATACAACAGAATT
AGTTCCAAAGGTGGATTTAAATCCTATCATCAAGGTGCTATTCTCCGATTGCAACAAAAGTGCAGAATCTCACTTGAGGAAACTTGAAGAGTGGGTGACTAAAGATATAG
CAGATGAAGTTTTTATGTTAGCACATCAAATAGAGGAACTAGTTGAAATCCTAGTGTCAAGAAATGATACACTTCCGTCTTCATGTTCTTCTCTTGATGGTTTAACTTTG
ACATCTATCCTGAGCATTGAAGAACTCGTCGTTGTATACTGTGTTTCTGCTTTTAGAAGCCATCAGCCGCCTTGTAACTATTTATCGGCAATGGCCTACCAGATTTTGGA
TGCAATTCAAGGATGCGGAAAGCTTCCCCACAGAGCATATCTATTTGGGGATATCAACGAATTATGTATGGACATTGAAACATTTTCTTTTGATGTCGATAGCTTTGACT
GGTGCAGTAGAGCAATTGCTGCTCATATGAATGGCGCAGAGAGAGCTTGTATGATGATTTTATGTCAGAAGATGGAAAAAGCAAACAGATTCAATTGCCATATCATGCAG
GTCGAAGGTATATGGTGGAGGGAAAAACAGATATGGAACAGAGTTTTGAGAATCACTAGTGAATTAATCAAAGCATCGAAATTATTGTCACCATCAACCCTGATTAAGAC
ATTAGTTACTGTTATGTTTTCGCGTATTCATCCCGTTTCATTCAGCGCTGAGCAACAAAACGGAGCTATTCACTTGAAGGTTTCAGCGTTATTCGATGCTACAATCTCCC
TGCTCTACACTGACTTGACGAATGCTTTAACTCAAGCATTTGATTCAACTGCAGCAATACGCATAGTTGTTGAAATCAATGCAATTGCAGTGCCAAGTTCCAGTTGCTAT
GTTTTTGACAACTCTAGTCACATTATTGACTTTAGTAGCTGGATTGGACAACTATTTGAATATGATGGGAAGGATTCCGACTTGGTGGTTCGGTTTTGCAAAGATGTGGA
AAGTAGATCGCAAATGGGATATGTAGATTTTGGTCGATTTGACAAATTCAACTATTTTGTCTCTGGTTCAGGACATGCCAACTTTGTTCAAGGTTATTACAACGGCGACC
TGACTTCTTGTGAGCAGAGTTATGACAAATTGGGGAGGACTTCGCAGGTAAATGTTTTGTGTGGAGGATGTTTAAATGGACAATGTAAAGGTGGTCTGGGATGCATTTGC
AATATCACTTATGAATCCAATTGCAGAGTTATTGTTGATCTTGCCATCCCTTGTGAGATACAAGGCCCACGTGTTTTCAAAGGATTTACTGTTGGTTTTCACCCCCGATC
CTGGGAAATTGTTTACAATGGTTTGACTCAATTAGGCTATGAGAAGCCACACCGTGCATTCAGCTTCAGCACAGAGCAGACTGGTGTGGTTCTTTATATGACTGCAATTG
CGTCACTTTCCTCTTTGGTACAGAAACCAATCATTCAGGTTTTTCCAGAAAATGGACTGGAGGTGAAAGTATCGGGATCAGGGGCAACTGGGAGCTACCCTACAACTTTG
TCACCCTCAATGTTGATGATTGATTGGAGATGTGATGTTGCCAGAGACATTCCATATGAAGTTAATGTCACGATCCCTGTGGCTGATTATGAACCAATTAGTTTTCTTCT
TACCAAAATGTGTGAAAAAAGGCAGGACGTACAAAAAGATTCTATGAAAGGATGGGCCACATTTGGAATACTATCTTGCATATTCATAGTTGTAGCATCACTATTTTGCT
GTGGAGGATTTGTTTATAAGGCCCAAGTGCAAGGCCAGCGTGGAATTGATGCATTGCCGGGCATGACACTACTATCCGCTTGCTTGGAAACCGTGAGTGGTGCAGGACAA
AGCTACCCAAGAGCGGAAGGCATCGACAATGCATTCGTCAGTGAAGCCTCCTGGGATCGGCCATCATCTTCTTCTTCTTCTCGACGGACATGGACACCATCTGAGAAAAA
TTATGGTTCAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTGAACTCGATACAGTCGAAAACCCTAGAAAATGGCGCTTCACATGGGAAGCGCAATCCCATATACCAACCTTACGTTTGTTGCTCTTCGATTCCCATACCAA
CCCTTCTCTTCAATGTCAGAATCTCAAGGTTCATCTCAACCTCCAGCAGTCCGTCGTTTGTGTGACTTGGTCACAAGACCTCGAAATGTCGATTGGAGTCCCTATTCCTC
CGGTTTTGGTTGACGCTGAGTCGCCACTGAGTTTTCGAGCTTTCGAAGACCATATTGAGGTCAAACTCGTCTTGCTTCTTCCGGTCGATCACCCAATTATTATCAACTTC
GACAACGTGCTGGACTTCTCCGAAGAGCGAGAAAATAGCTACTCCAAGGCGTCGAAGCCGCTCTTAATGGACTCTGATCAAATCAGTTTATCACGCAGTGGTGGCGTCCA
CTTTTATTGCAGAAATTGTTCTTTCAGGCTGAGTAAATCCCCTCTCAGAGATTTTGTTGAAATGCCATCTGTCAACTGGCGAGAGGTGGCTGATAACTGGTTTGGGTCTT
GCTGCTGCTCTTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACGAATTCCTATAGATGTGCAAAGGGTGTCTGCCTACTCACTTTAACAACTATTACTCTTTCC
AAGGATGACGTTATTGGACATATGTTCCCAGACTCTGATGGGACCCGAGAATTCAAGGATGAATCAGATATCATTGATGCCAATTGGTTAACTGAAGCTAAACAGGAATC
ACAATGTAATCATACATCTACAGAGAAGGTAAAATCTAAGCAGTTCAATGATAAAAACCTTGCTGCAAACACGGAGGCTGATGCTGCTGAGAAAGATAGTGAAGTTGATT
CACCTCTTGTGACTCCAATTCCTGACTGCTGTCATCATGGAGAAAGTAATGTACTTGATCATCTTGATAGAGATTGTATGCATCACACATGTGACATTTATAAGTTAGAC
CCCAAGCCTTTTAATACTATAGATATCTCAGATGATCAGAGATCCTTTCTTAATGGTTTTCTTGGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCGGATTTTGA
GTGGGTTGAGTTTTTTTGCCCCCAGTGCTCAACTTTGATTGGGGCTTATCCTTGCAGTAATGGCTGCGGACCTACAGATGGTGGAGTTAGACTCTTTAAATGCTATGTCT
CATCATGCACGTCAGTTGAATCTGGAAATTTGTTGAGCGAGTACACCTTGGAAAGAATGTTTGCAAATCAGCTACTGGAAAGTGCAAATGAAGAATCATCATTTCGTACT
GTGGTTAAGGAATTGAAAACCAAGTCTCCCATGCTGCACATTGTTCTCATTAATTCAAATTCTTGGTCGTGTAGTGGTTATTGTTTGGGCATGGAGGATACAACAGAATT
AGTTCCAAAGGTGGATTTAAATCCTATCATCAAGGTGCTATTCTCCGATTGCAACAAAAGTGCAGAATCTCACTTGAGGAAACTTGAAGAGTGGGTGACTAAAGATATAG
CAGATGAAGTTTTTATGTTAGCACATCAAATAGAGGAACTAGTTGAAATCCTAGTGTCAAGAAATGATACACTTCCGTCTTCATGTTCTTCTCTTGATGGTTTAACTTTG
ACATCTATCCTGAGCATTGAAGAACTCGTCGTTGTATACTGTGTTTCTGCTTTTAGAAGCCATCAGCCGCCTTGTAACTATTTATCGGCAATGGCCTACCAGATTTTGGA
TGCAATTCAAGGATGCGGAAAGCTTCCCCACAGAGCATATCTATTTGGGGATATCAACGAATTATGTATGGACATTGAAACATTTTCTTTTGATGTCGATAGCTTTGACT
GGTGCAGTAGAGCAATTGCTGCTCATATGAATGGCGCAGAGAGAGCTTGTATGATGATTTTATGTCAGAAGATGGAAAAAGCAAACAGATTCAATTGCCATATCATGCAG
GTCGAAGGTATATGGTGGAGGGAAAAACAGATATGGAACAGAGTTTTGAGAATCACTAGTGAATTAATCAAAGCATCGAAATTATTGTCACCATCAACCCTGATTAAGAC
ATTAGTTACTGTTATGTTTTCGCGTATTCATCCCGTTTCATTCAGCGCTGAGCAACAAAACGGAGCTATTCACTTGAAGGTTTCAGCGTTATTCGATGCTACAATCTCCC
TGCTCTACACTGACTTGACGAATGCTTTAACTCAAGCATTTGATTCAACTGCAGCAATACGCATAGTTGTTGAAATCAATGCAATTGCAGTGCCAAGTTCCAGTTGCTAT
GTTTTTGACAACTCTAGTCACATTATTGACTTTAGTAGCTGGATTGGACAACTATTTGAATATGATGGGAAGGATTCCGACTTGGTGGTTCGGTTTTGCAAAGATGTGGA
AAGTAGATCGCAAATGGGATATGTAGATTTTGGTCGATTTGACAAATTCAACTATTTTGTCTCTGGTTCAGGACATGCCAACTTTGTTCAAGGTTATTACAACGGCGACC
TGACTTCTTGTGAGCAGAGTTATGACAAATTGGGGAGGACTTCGCAGGTAAATGTTTTGTGTGGAGGATGTTTAAATGGACAATGTAAAGGTGGTCTGGGATGCATTTGC
AATATCACTTATGAATCCAATTGCAGAGTTATTGTTGATCTTGCCATCCCTTGTGAGATACAAGGCCCACGTGTTTTCAAAGGATTTACTGTTGGTTTTCACCCCCGATC
CTGGGAAATTGTTTACAATGGTTTGACTCAATTAGGCTATGAGAAGCCACACCGTGCATTCAGCTTCAGCACAGAGCAGACTGGTGTGGTTCTTTATATGACTGCAATTG
CGTCACTTTCCTCTTTGGTACAGAAACCAATCATTCAGGTTTTTCCAGAAAATGGACTGGAGGTGAAAGTATCGGGATCAGGGGCAACTGGGAGCTACCCTACAACTTTG
TCACCCTCAATGTTGATGATTGATTGGAGATGTGATGTTGCCAGAGACATTCCATATGAAGTTAATGTCACGATCCCTGTGGCTGATTATGAACCAATTAGTTTTCTTCT
TACCAAAATGTGTGAAAAAAGGCAGGACGTACAAAAAGATTCTATGAAAGGATGGGCCACATTTGGAATACTATCTTGCATATTCATAGTTGTAGCATCACTATTTTGCT
GTGGAGGATTTGTTTATAAGGCCCAAGTGCAAGGCCAGCGTGGAATTGATGCATTGCCGGGCATGACACTACTATCCGCTTGCTTGGAAACCGTGAGTGGTGCAGGACAA
AGCTACCCAAGAGCGGAAGGCATCGACAATGCATTCGTCAGTGAAGCCTCCTGGGATCGGCCATCATCTTCTTCTTCTTCTCGACGGACATGGACACCATCTGAGAAAAA
TTATGGTTCAATATGA
Protein sequenceShow/hide protein sequence
MSSELDTVENPRKWRFTWEAQSHIPTLRLLLFDSHTNPSLQCQNLKVHLNLQQSVVCVTWSQDLEMSIGVPIPPVLVDAESPLSFRAFEDHIEVKLVLLLPVDHPIIINF
DNVLDFSEERENSYSKASKPLLMDSDQISLSRSGGVHFYCRNCSFRLSKSPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGVCLLTLTTITLS
KDDVIGHMFPDSDGTREFKDESDIIDANWLTEAKQESQCNHTSTEKVKSKQFNDKNLAANTEADAAEKDSEVDSPLVTPIPDCCHHGESNVLDHLDRDCMHHTCDIYKLD
PKPFNTIDISDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLFKCYVSSCTSVESGNLLSEYTLERMFANQLLESANEESSFRT
VVKELKTKSPMLHIVLINSNSWSCSGYCLGMEDTTELVPKVDLNPIIKVLFSDCNKSAESHLRKLEEWVTKDIADEVFMLAHQIEELVEILVSRNDTLPSSCSSLDGLTL
TSILSIEELVVVYCVSAFRSHQPPCNYLSAMAYQILDAIQGCGKLPHRAYLFGDINELCMDIETFSFDVDSFDWCSRAIAAHMNGAERACMMILCQKMEKANRFNCHIMQ
VEGIWWREKQIWNRVLRITSELIKASKLLSPSTLIKTLVTVMFSRIHPVSFSAEQQNGAIHLKVSALFDATISLLYTDLTNALTQAFDSTAAIRIVVEINAIAVPSSSCY
VFDNSSHIIDFSSWIGQLFEYDGKDSDLVVRFCKDVESRSQMGYVDFGRFDKFNYFVSGSGHANFVQGYYNGDLTSCEQSYDKLGRTSQVNVLCGGCLNGQCKGGLGCIC
NITYESNCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFSTEQTGVVLYMTAIASLSSLVQKPIIQVFPENGLEVKVSGSGATGSYPTTL
SPSMLMIDWRCDVARDIPYEVNVTIPVADYEPISFLLTKMCEKRQDVQKDSMKGWATFGILSCIFIVVASLFCCGGFVYKAQVQGQRGIDALPGMTLLSACLETVSGAGQ
SYPRAEGIDNAFVSEASWDRPSSSSSSRRTWTPSEKNYGSI