; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010997 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010997
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein IWS1-like protein A isoform X1
Genome locationChr01:1428962..1431332
RNA-Seq ExpressionHG10010997
SyntenyHG10010997
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050461.1 protein IWS1-like protein A isoform X1 [Cucumis melo var. makuwa]3.1e-24877.2Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+                     ED K EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ QLLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEI      
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S PN T+SF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

TYK03142.1 protein IWS1-like protein A isoform X1 [Cucumis melo var. makuwa]7.1e-24576.55Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIE-------------DGKK--------EVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+E             DG+K        EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIE-------------DGKK--------EVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ +LLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA           
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S  N TVSF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

XP_008465383.1 PREDICTED: uncharacterized protein LOC103503017 isoform X1 [Cucumis melo]2.4e-25378.01Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+                     ED K EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ QLLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEISN+EAG
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S PN T+SF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

XP_008465387.1 PREDICTED: uncharacterized protein LOC103503017 isoform X2 [Cucumis melo]1.5e-23976.56Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLLIENNV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+E   +                        
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLLIENNV

Query:  LTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFERGKRLR
           ++K AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEISN+EAGTPPN S PN T+SF +G++  
Subjt:  LTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFERGKRLR

Query:  TIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKSDCCTN
        TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C SIC EEKIDNGNK+++SDC TN
Subjt:  TIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKSDCCTN

Query:  NVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGKENSL
        N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P RRRLVSLRN   KG  KENS 
Subjt:  NVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGKENSL

Query:  SQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMKAV
        SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKWEYEADMLA+FGKDPELCMKAV
Subjt:  SQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMKAV

Query:  CALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA
        CALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLFEIYGNKEDPFFL A
Subjt:  CALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA

XP_038898046.1 uncharacterized protein LOC120085872 [Benincasa hispida]8.0e-28987.56Show/hide
Query:  MLDMKMDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLL
        MLD+KMDVEE+R+KVE  DVCKC+ELKLRCE+AE+ CEELELKIQIKKIE+ELLQAQM+KLESEKSVIE+ELK+WIE+GKKE VNLVDKNE++DEVVQLL
Subjt:  MLDMKMDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLL

Query:  IENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFER
        IENNVL VEKKKAESEAELWKDKFKQLEIQISKMDERKG  +PVLSGKFEGGIGLS SPEARA DGVA NHY LSKEISNLEAGTPPN S PN TVSF R
Subjt:  IENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFER

Query:  GKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKS
        GK++ T+ESYF+H NRVRK+L FEEESPCKKMAPLTPG SRPLHGVINIDDSDDEV I KLPS IRKKSEACVSAAQDLSGCGSICS+EKI + NKLS+S
Subjt:  GKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKS

Query:  DCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIG
        DCCT NVENVE FSD+FIF+STPKR RASNIITSDTETDDDDN+PICKLK+ PLQKTNCTQVGSE  DDPTSACDKVVVS TPARRRLV LRNNY KG G
Subjt:  DCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIG

Query:  KENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMK
        KENS SQI EIKNGMKDVEDELEDVGSELETESDSLEGFIV DS+VSD GSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMK
Subjt:  KENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMK

Query:  AVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA
        AVCALYRQQTSEEKVSK TLFSNK+GFSKFDAIRGTSLAEFLTDG+ QGNL KSVKE+E YDP AIDLCRKLATHYSKQLFEI+GNKEDPFFLPA
Subjt:  AVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA

TrEMBL top hitse value%identityAlignment
A0A1S3CNP3 uncharacterized protein LOC103503017 isoform X27.4e-24076.56Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLLIENNV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+E   +                        
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLLIENNV

Query:  LTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFERGKRLR
           ++K AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEISN+EAGTPPN S PN T+SF +G++  
Subjt:  LTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFERGKRLR

Query:  TIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKSDCCTN
        TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C SIC EEKIDNGNK+++SDC TN
Subjt:  TIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKSDCCTN

Query:  NVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGKENSL
        N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P RRRLVSLRN   KG  KENS 
Subjt:  NVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGKENSL

Query:  SQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMKAV
        SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKWEYEADMLA+FGKDPELCMKAV
Subjt:  SQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMKAV

Query:  CALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA
        CALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLFEIYGNKEDPFFL A
Subjt:  CALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA

A0A1S3CNR9 uncharacterized protein LOC103503017 isoform X11.2e-25378.01Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+                     ED K EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ QLLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEISN+EAG
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S PN T+SF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

A0A5A7UA97 Protein IWS1-like protein A isoform X11.5e-24877.2Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+                     ED K EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDGKKEVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ QLLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA SKEI      
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S PN T+SF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

A0A5D3BWC5 Protein IWS1-like protein A isoform X13.4e-24576.55Show/hide
Query:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIE-------------DGKK--------EVV
        MDVEEVRHKVE  DVCKCSELKLRCE+AE  CEELELKIQIKKIE+ELLQ QM+KLESEKSVIE ELK W+E             DG+K        EVV
Subjt:  MDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIE-------------DGKK--------EVV

Query:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG
        NLVD NEI++E+ +LLIENNVL VEKK AE+EAELWK KFK LEIQ SKMDERKG GKPVL GKFEG IGLS S E  A DGVASNHYA           
Subjt:  NLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAG

Query:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS
        TPPN S  N TVSF +G++  TIES+FEH NRVRKQL FEEESP KKMAPLTPG SRPLHGVINIDDSDDEVII KLPSH ++KSEACVSA QDLS C S
Subjt:  TPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGS

Query:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA
        IC EEKIDNGNK+++SDC TNN ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTNCTQV S+LN+DPTS CDKVVVSA P 
Subjt:  ICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPA

Query:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRN   KG  KENS SQI  IKNGM D EDELEDV S   +++DS EGFIV+ SD SD GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDPELCMKAVCALYRQQTSEEK+SKETLFSNKRGFSKFDAIRGT LAEFLT+GD QGNL KSVKELE YDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFLPA
        EIYGNKEDPFFL A
Subjt:  EIYGNKEDPFFLPA

A0A6J1HU91 uncharacterized protein LOC1114675156.9e-23873.11Show/hide
Query:  MLDMKMDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDG
        MLD+KMDVEEVRH+VE +DVCKC+ELKLRCERAEK CEELEL IQ KKIE ELLQ Q++ LESEKS IE EL VW+                     EDG
Subjt:  MLDMKMDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWI---------------------EDG

Query:  KKEVVNLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDER--KGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKE
         K  VNL+ K ++E+EV+QLLIENNVL VEKKKAESEAE+WK KFK+LEI+I K+D+        PVLSG+ EGG G  SSPEAR LD V SNH++L K+
Subjt:  KKEVVNLVDKNEIEDEVVQLLIENNVLTVEKKKAESEAELWKDKFKQLEIQISKMDER--KGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKE

Query:  ISNLEAGTPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQ
        I +L+AGTPPN SP +HTVS ERGK+ R IESYF+H N+VRKQL FE+ESP K+MAPLTP  SRPLHG+INIDDSDDE II K  S+ +KK+E CVSAAQ
Subjt:  ISNLEAGTPPNVSPPNHTVSFERGKRLRTIESYFEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQ

Query:  DLSGCGSICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDP---TSAC
        DLSGCG   SEEKI N NKL +S C T N+E +EAFSD FIFNSTPKRKRASNII SDTETDDDDNVPICKLKR P+QKTN TQV  ELND P   TSAC
Subjt:  DLSGCGSICSEEKIDNGNKLSKSDCCTNNVENVEAFSDDFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDP---TSAC

Query:  DKVVVSATPARRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSEDLSNGSEGYDEIISKLGR
        DKVV SATPARRRLVSLR+ Y K  G +   SQI  I NGMKDVEDELED  SE +TES+SL+ FIVD SD SD GS +QS+DLSNGSEGY+EIISKLGR
Subjt:  DKVVVSATPARRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSEDLSNGSEGYDEIISKLGR

Query:  RKKELKWEYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLAT
        RKKELKWEYEADMLAAFGKDPELCMKAVCALYRQQTSEEKV KE+LFSN RGFSKFDAI+GT LAEFLTDGD QGNL KSV+ELE YDPKAI+LCRKLAT
Subjt:  RKKELKWEYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLAT

Query:  HYSKQLFEIYGNKEDPFFLPA
        HYSKQLFEIYGNKEDPFFLPA
Subjt:  HYSKQLFEIYGNKEDPFFLPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53220.1 unknown protein7.2e-5445.64Show/hide
Query:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK
        N E+ +  + D   +S   R++   +I S    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R N  K + +
Subjt:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK

Query:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD
        E S S  ++  I       +DE E+ GS  E+E +SL+GFI+DD D  ++ S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKD
Subjt:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD

Query:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF
        PELCM+AVC L+R QT +EKV + +  SN RGFSK DA+RGTS+A FLTDGD  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF
Subjt:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF

AT5G53220.2 unknown protein7.2e-5445.64Show/hide
Query:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK
        N E+ +  + D   +S   R++   +I S    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R N  K + +
Subjt:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK

Query:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD
        E S S  ++  I       +DE E+ GS  E+E +SL+GFI+DD D  ++ S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKD
Subjt:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD

Query:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF
        PELCM+AVC L+R QT +EKV + +  SN RGFSK DA+RGTS+A FLTDGD  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF
Subjt:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF

AT5G53220.3 unknown protein7.2e-5445.64Show/hide
Query:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK
        N E+ +  + D   +S   R++   +I S    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R N  K + +
Subjt:  NVENVEAFSDDFIFNSTPKRKRASNIITS----DTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGK

Query:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD
        E S S  ++  I       +DE E+ GS  E+E +SL+GFI+DD D  ++ S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKD
Subjt:  ENSLS--QITEIKNGMKDVEDELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKD

Query:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF
        PELCM+AVC L+R QT +EKV + +  SN RGFSK DA+RGTS+A FLTDGD  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF
Subjt:  PELCMKAVCALYRQQTSEEKVSKETLFSNKRGFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGATATGAAGATGGATGTGGAGGAAGTTAGACATAAAGTAGAACAAACCGATGTATGCAAGTGCAGTGAACTGAAATTGAGGTGTGAAAGAGCTGAAAAA
ATGTGTGAAGAGTTAGAATTGAAGATTCAGATAAAGAAGATCGAGTTTGAATTACTTCAGGCCCAAATGCGGAAGTTGGAGTCAGAAAAGTCAGTCATTGAAGCA
GAGCTAAAAGTTTGGATTGAGGATGGAAAGAAAGAAGTTGTGAATTTAGTTGATAAGAATGAAATAGAAGATGAGGTTGTTCAGCTTCTGATTGAGAATAATGTT
TTGACAGTTGAGAAGAAGAAGGCTGAAAGTGAGGCTGAGCTGTGGAAAGATAAGTTCAAACAATTGGAGATTCAGATCTCAAAGATGGATGAAAGGAAAGGTGGT
GGCAAGCCAGTGCTTTCTGGGAAGTTTGAAGGGGGAATTGGATTATCATCGAGTCCTGAAGCTCGTGCTTTGGATGGTGTGGCCTCTAACCATTATGCTCTTAGT
AAGGAAATTAGTAACTTGGAAGCAGGAACACCTCCAAATGTCTCACCGCCCAATCATACTGTTTCCTTTGAGCGAGGAAAAAGACTTCGTACGATTGAATCTTAT
TTTGAGCATCAAAATCGAGTTAGAAAACAGTTGAGATTCGAGGAAGAGAGCCCCTGCAAGAAGATGGCACCATTAACTCCAGGCGCTTCTAGACCTTTACATGGC
GTCATTAACATTGATGATAGTGATGATGAAGTAATTATTGGCAAATTGCCCTCTCATATTCGGAAAAAGAGTGAGGCTTGTGTTTCAGCCGCTCAAGATTTAAGT
GGATGTGGATCTATTTGTTCTGAGGAGAAAATAGACAATGGAAACAAATTGAGCAAGTCTGATTGCTGTACAAACAATGTGGAAAATGTGGAAGCTTTTAGTGAT
GATTTTATCTTTAATTCAACACCCAAAAGGAAACGAGCTTCTAATATTATAACCAGCGACACTGAAACTGATGATGATGATAATGTTCCAATCTGCAAACTGAAG
AGATCGCCTCTTCAGAAAACAAACTGTACACAAGTAGGTTCAGAATTGAATGATGACCCTACTTCTGCGTGTGACAAAGTTGTGGTTAGTGCAACCCCTGCAAGG
AGGCGTTTAGTGTCGTTAAGAAATAACTATGGGAAAGGAATCGGAAAGGAAAATTCTTTAAGTCAGATCACTGAGATCAAGAATGGAATGAAGGATGTTGAAGAT
GAACTGGAAGATGTTGGGTCAGAGCTAGAGACCGAGAGTGATAGCTTGGAAGGATTTATTGTTGACGACTCTGATGTTTCGGATGCTGGAAGCACTTCTCAGTCA
GAAGATTTATCAAATGGCAGTGAAGGATATGATGAGATAATATCTAAGCTAGGAAGGAGAAAAAAAGAGTTGAAATGGGAGTATGAGGCTGATATGCTTGCAGCT
TTTGGCAAGGATCCTGAATTATGTATGAAGGCTGTGTGTGCTCTATATAGACAGCAAACTTCTGAGGAGAAAGTAAGTAAAGAAACGTTGTTCTCTAACAAGAGA
GGATTCAGCAAGTTCGATGCTATCAGGGGCACCAGTTTGGCTGAGTTTCTCACCGATGGCGATCTGCAAGGTAATCTCAAAAAATCTGTTAAAGAGTTGGAGGCC
TATGACCCTAAGGCCATTGACCTCTGCAGAAAATTAGCCACACATTACTCCAAGCAGTTGTTCGAGATATATGGAAATAAAGAAGATCCTTTCTTTTTACCTGCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGATATGAAGATGGATGTGGAGGAAGTTAGACATAAAGTAGAACAAACCGATGTATGCAAGTGCAGTGAACTGAAATTGAGGTGTGAAAGAGCTGAAAAA
ATGTGTGAAGAGTTAGAATTGAAGATTCAGATAAAGAAGATCGAGTTTGAATTACTTCAGGCCCAAATGCGGAAGTTGGAGTCAGAAAAGTCAGTCATTGAAGCA
GAGCTAAAAGTTTGGATTGAGGATGGAAAGAAAGAAGTTGTGAATTTAGTTGATAAGAATGAAATAGAAGATGAGGTTGTTCAGCTTCTGATTGAGAATAATGTT
TTGACAGTTGAGAAGAAGAAGGCTGAAAGTGAGGCTGAGCTGTGGAAAGATAAGTTCAAACAATTGGAGATTCAGATCTCAAAGATGGATGAAAGGAAAGGTGGT
GGCAAGCCAGTGCTTTCTGGGAAGTTTGAAGGGGGAATTGGATTATCATCGAGTCCTGAAGCTCGTGCTTTGGATGGTGTGGCCTCTAACCATTATGCTCTTAGT
AAGGAAATTAGTAACTTGGAAGCAGGAACACCTCCAAATGTCTCACCGCCCAATCATACTGTTTCCTTTGAGCGAGGAAAAAGACTTCGTACGATTGAATCTTAT
TTTGAGCATCAAAATCGAGTTAGAAAACAGTTGAGATTCGAGGAAGAGAGCCCCTGCAAGAAGATGGCACCATTAACTCCAGGCGCTTCTAGACCTTTACATGGC
GTCATTAACATTGATGATAGTGATGATGAAGTAATTATTGGCAAATTGCCCTCTCATATTCGGAAAAAGAGTGAGGCTTGTGTTTCAGCCGCTCAAGATTTAAGT
GGATGTGGATCTATTTGTTCTGAGGAGAAAATAGACAATGGAAACAAATTGAGCAAGTCTGATTGCTGTACAAACAATGTGGAAAATGTGGAAGCTTTTAGTGAT
GATTTTATCTTTAATTCAACACCCAAAAGGAAACGAGCTTCTAATATTATAACCAGCGACACTGAAACTGATGATGATGATAATGTTCCAATCTGCAAACTGAAG
AGATCGCCTCTTCAGAAAACAAACTGTACACAAGTAGGTTCAGAATTGAATGATGACCCTACTTCTGCGTGTGACAAAGTTGTGGTTAGTGCAACCCCTGCAAGG
AGGCGTTTAGTGTCGTTAAGAAATAACTATGGGAAAGGAATCGGAAAGGAAAATTCTTTAAGTCAGATCACTGAGATCAAGAATGGAATGAAGGATGTTGAAGAT
GAACTGGAAGATGTTGGGTCAGAGCTAGAGACCGAGAGTGATAGCTTGGAAGGATTTATTGTTGACGACTCTGATGTTTCGGATGCTGGAAGCACTTCTCAGTCA
GAAGATTTATCAAATGGCAGTGAAGGATATGATGAGATAATATCTAAGCTAGGAAGGAGAAAAAAAGAGTTGAAATGGGAGTATGAGGCTGATATGCTTGCAGCT
TTTGGCAAGGATCCTGAATTATGTATGAAGGCTGTGTGTGCTCTATATAGACAGCAAACTTCTGAGGAGAAAGTAAGTAAAGAAACGTTGTTCTCTAACAAGAGA
GGATTCAGCAAGTTCGATGCTATCAGGGGCACCAGTTTGGCTGAGTTTCTCACCGATGGCGATCTGCAAGGTAATCTCAAAAAATCTGTTAAAGAGTTGGAGGCC
TATGACCCTAAGGCCATTGACCTCTGCAGAAAATTAGCCACACATTACTCCAAGCAGTTGTTCGAGATATATGGAAATAAAGAAGATCCTTTCTTTTTACCTGCA
TGA
Protein sequenceShow/hide protein sequence
MLDMKMDVEEVRHKVEQTDVCKCSELKLRCERAEKMCEELELKIQIKKIEFELLQAQMRKLESEKSVIEAELKVWIEDGKKEVVNLVDKNEIEDEVVQLLIENNV
LTVEKKKAESEAELWKDKFKQLEIQISKMDERKGGGKPVLSGKFEGGIGLSSSPEARALDGVASNHYALSKEISNLEAGTPPNVSPPNHTVSFERGKRLRTIESY
FEHQNRVRKQLRFEEESPCKKMAPLTPGASRPLHGVINIDDSDDEVIIGKLPSHIRKKSEACVSAAQDLSGCGSICSEEKIDNGNKLSKSDCCTNNVENVEAFSD
DFIFNSTPKRKRASNIITSDTETDDDDNVPICKLKRSPLQKTNCTQVGSELNDDPTSACDKVVVSATPARRRLVSLRNNYGKGIGKENSLSQITEIKNGMKDVED
ELEDVGSELETESDSLEGFIVDDSDVSDAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPELCMKAVCALYRQQTSEEKVSKETLFSNKR
GFSKFDAIRGTSLAEFLTDGDLQGNLKKSVKELEAYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPA